Habitat-matterport 3d semantics dataset

Karmesh Yadav, Ram Ramrakhya, Santhosh K. Ramakrishnan, Theo Gervet, John Turner, Aaron Gokaslan, Noah Maestre, Angel X. Chang, Dhruv Batra, Manolis Savva, Alexander William Clegg, Devendra Singh Chaplot

June 2023

HM3DSem Annotations

Abstract

We present the Habitat-Matterport 3D Semantics (HM3DSEM) dataset. HM3DSEM is the largest dataset of 3D real-world spaces with densely annotated semantics that is currently available to the academic community. It consists of 142,646 object instance annotations across 216 3D spaces and 3,100 rooms within those spaces. The scale, quality, and diversity of object annotations far exceed those of prior datasets. A key difference setting apart HM3DSEM from other datasets is the use of texture information to annotate pixel-accurate object boundaries. We demonstrate the effectiveness of HM3DSEM dataset for the Object Goal Navigation task using different methods. Policies trained using HM3DSEM perform outperform those trained on prior datasets. Introduction of HM3DSEM in the Habitat ObjectNav Challenge lead to an increase in participation from 400 submissions in 2021 to 1022 submissions in 2022.

Type

Conference paper

Publication

In the Conference on Computer Vision and Pattern Recognition 2023

Click the Cite button above to view the bibtex.

Embodied AI

Habitat-matterport 3d semantics dataset

Abstract

Karmesh Yadav

Ph.D. Student at Georgia Tech

Related