Computer Vision Using Pose Estimation
DOI:
https://doi.org/10.31185/wjcm.111Keywords:
computer vision, deep learning, pose estimation, 2D, 3DAbstract
Pose estimation involves estimating the position and orientation of objects in a 3D space, and it has applications in areas such as robotics, augmented reality, and human-computer interaction. There are several methods for pose estimation, including model-based, feature-based, direct, hybrid, and deep learning-based methods. Each method has its own strengths and weaknesses, and the choice of method depends on the specific requirements of the application, object being estimated, and available data. Advancements in computer vision and machine learning have made it possible to achieve high accuracy and robustness in pose estimation, allowing for the development of a wide range of innovative applications. Pose estimation will continue to be an important area of research and development, and we can expect to see further improvements in the accuracy and robustness of pose estimation methods in the future.
References
S. Tulsiani, T. Zhou, A. A. Efros, and J. Malik, “Multi-view supervision for single-view reconstruction via differentiable ray consistency,”
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2623–2631, 2015.
K. Pauwels, M. M. V. Hulle, and B. D. Baets, “Survey of computational stereo vision algorithms: An overview,” Journal of Image and Vision Computing, vol. 21, no. 4, pp. 285–310, 2003.
X. Zhou, K. Xu, and J. Y. Zhu, “Deformable convolutional networks for object detection in a video,” IEEE Transactions on Image Processing, vol. 30, pp. 1865–1877, 2021.
A. Rahmati and J. Lu, “Pose-estimation-free object tracking via attentive feature extraction,” Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 6910–6919, 2020.
K. He, G. Gkioxari, P. Dollár, and R. Girshick, “Mask R-CNN,” Proceedings of the IEEE International Conference on Computer Vision, pp. 2980– 2988, 2017.
X. Chen, K. Kundu, Z. Zhang, H. Ma, S. Fidler, and R. Urtasun, “Monocular 3D object detection for autonomous driving,” Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2147–2156, 2016.
O. Ronneberger, P. Fischer, and T. Brox, “U-net: Convolutional networks for biomedical image segmentation,” in International Conference on Medical image computing and computer-assisted intervention, pp. 234–241, Springer, 2015.
A. Krizhevsky, I. Sutskever, and G. E. Hinton, “ImageNet classification with deep convolutional neural networks,” Advances in neural information processing systems, pp. 1097–1105, 2012.
K. Zhang, Z. Zhang, Z. Li, and Y. Qiao, “Joint face detection and alignment using multitask cascaded convolutional networks,” IEEE Signal Processing Letters, vol. 23, no. 10, pp. 1499–1503, 2016.
Z. Cao, T. Simon, S. E. Wei, and Y. Sheikh, “Realtime multi-person 2D pose estimation using part affinity fields,” Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1302–1310, 2017.
A. Krull, N. Buch, and A. Pieropan, “Probabilistic 6D object pose estimation and refinement in RGB-D images,” IEEE Robotics and Automation Letters, vol. 3, no. 4, pp. 3322–3329, 2018.
M. Loper, N. Mahmood, J. Romero, G. Pons-Moll, and M. J. Black, “SMPL: A skinned multi-person linear model,” ACM Transactions on Graphics (TOG), vol. 34, no. 6, pp. 248–248, 2014.
Downloads
Published
Issue
Section
License
Copyright (c) 2023 Ghazali Bin Sulong, M . Randles
This work is licensed under a Creative Commons Attribution 4.0 International License.