Abstract
Deep learning methods have revolutionized computer vision since the appearance of AlexNet in 2012. Nevertheless, 6 degrees of freedom pose estimation is still a difficult task to perform precisely. Therefore, we propose 2 ensemble techniques to refine poses from different deep learning 6DoF pose estimation models. The first technique, merge ensemble, combines the outputs of the base models geometrically. In the second, stacked generalization, a machine learning model is trained using the outputs of the base models and outputs the refined pose. The merge method improves the performance of the base models on LMO and YCB-V datasets and performs better on the pose estimation task than the stacking strategy.
Original language | English |
---|---|
Article number | 126270 |
Journal | Neurocomputing |
Volume | 541 |
DOIs | |
Publication status | Published - 7 Jul 2023 |
Funding
This paper has been supported by the project PROFLOW under the Basque program ELKARTEK, grant agreement No. KK-2022/00024.
Keywords
- Deep learning
- Ensemble
- Pose estimation
- Stacked generalization