Ir directamente a la navegación principal Ir directamente a la búsqueda Ir directamente al contenido principal

MVMO: A MULTI-OBJECT DATASET FOR WIDE BASELINE MULTI-VIEW SEMANTIC SEGMENTATION

  • Computer Vision Centre
  • Nankai University

Producción científica: Capítulo del libro/informe/acta de congresoContribución a la conferenciarevisión exhaustiva

Resumen

We present MVMO (Multi-View, Multi-Object dataset): a synthetic dataset of 116, 000 scenes containing randomly placed objects of 10 distinct classes and captured from 25 camera locations in the upper hemisphere. MVMO comprises photorealistic, path-traced image renders, together with semantic segmentation ground truth for every view. Unlike existing multi-view datasets, MVMO features wide baselines between cameras and high density of objects, which lead to large disparities, heavy occlusions and view-dependent object appearance. Single view semantic segmentation is hindered by self and inter-object occlusions that could benefit from additional viewpoints. Therefore, we expect that MVMO will propel research in multi-view semantic segmentation and cross-view semantic transfer. We also provide baselines that show that new research is needed in such fields to exploit the complementary information of multi-view setups.

Idioma originalInglés
Título de la publicación alojada2022 IEEE International Conference on Image Processing, ICIP 2022 - Proceedings
EditorialIEEE Computer Society
Páginas1166-1170
Número de páginas5
ISBN (versión digital)9781665496209
DOI
EstadoPublicada - 2022
Evento29th IEEE International Conference on Image Processing, ICIP 2022 - Bordeaux, Francia
Duración: 16 oct 202219 oct 2022

Serie de la publicación

NombreProceedings - International Conference on Image Processing, ICIP
ISSN (versión impresa)1522-4880

Conferencia

Conferencia29th IEEE International Conference on Image Processing, ICIP 2022
País/TerritorioFrancia
CiudadBordeaux
Período16/10/2219/10/22

Huella

Profundice en los temas de investigación de 'MVMO: A MULTI-OBJECT DATASET FOR WIDE BASELINE MULTI-VIEW SEMANTIC SEGMENTATION'. En conjunto forman una huella única.

Citar esto