GithubHelp home page GithubHelp logo

nucraft_api's Introduction

logo

Project Page

Download: [ Kaggle ] [ HuggingFace ]

nuCraft: Crafting High Resolution 3D Semantic Occupancy for Unified 3D Scene Understanding

Benjin Zhu1, Zhe Wang2, Hongsheng Li1,3

1 MMLab, The Chinese University of Hong Kong 2 SenseTime Research 3 Shanghai AI Laboratory

News

  • 2024/07/15 [email protected] dataset v1.0 released.
  • 2024/07/01 nuCraft is accepted by ECCV 2024.

Overview

nuCraft is a cutting-edge, high-resolution 3D semantic occupancy dataset designed to meet the increasing demand for accurate and comprehensive 3D scene understanding in autonomous driving. Derived from the widely-utilized nuScenes dataset, nuCraft distinguishes itself by offering an 8x increase in resolution over previous benchmarks, with grid resolution increased to 0.1 meters.

The motivation behind nuCraft stems from the inherent limitations present in existing 3D semantic occupancy datasets, such as low resolution and inaccurate annotations. These limitations hinder the effective unification and understanding of complex urban scenes needed for innovative autonomous driving technologies. In response, nuCraft addresses these issues by ensuring more precise semantic annotations and higher resolution, thus facilitating more refined and accurate occupancy predictions.

Wrong Ego Poses

logo

Inconsistent Semantic Labels

logo

Visual Comparison

logo

nuCraft represents a significant step forward in 3D scene understanding, integrating high-resolution data with advanced processing techniques to provide a superior dataset for researchers and developers. Its release is poised to drive innovations and improve the accuracy of autonomous driving technologies.

Key Features

  • High Resolution: nuCraft provides semantic occupancy grids at a resolution of [1024 x 1024 x 80] with a voxel size of 0.1m, compared to the [512 x 512 x 40] 0.2m grids in existing datasets.
  • Enhanced Data Quality: It mitigates the impact of noisy and sparse raw data, delivering more accurate and consistent annotations.
  • Comprehensive Labels: The dataset includes fine-grained semantic labels for both static elements (like buildings and roads) and dynamic elements (such as vehicles and pedestrians). It also provides accurate and dense occupancy flow and depth label (in future release).

logo

Data Generation Pipeline

The creation of nuCraft involves several meticulous steps:

  1. Pre-Processing: This includes static/moving parts separation to enhance the clarity of static and dynamic scene components and continuous scenes grouping to form longer, cohesive sequences.
  2. LiDAR Sequence Aggregation: Advanced pose estimation techniques like Kiss-ICP are employed to ensure better alignment of LiDAR frames, resulting in more reliable aggregated point clouds.
  3. Mesh Reconstruction: Multi-level octree structures and semantic mesh reconstruction techniques ensure high-quality and noise-free occupancy grids.
  4. Post-Processing: Further data refinement is done to remove outliers, reduce noise, and generate visibility masks for both LiDAR and camera sensors.

Acknowledgement

nuCraft is built upon the groundwork laid by the nuScenes dataset and integrates advanced methodologies elaborated in papers dealing with 3D occupancy prediction like SemanticKITTI, OpenOccupancy and Occ3D. The design and implementation of nuCraft are aimed at overcoming the specific deficiencies observed in these early datasets, inspired by academic research and practical requirements of modern autonomous driving systems.

License

The nuCraft dataset is under CC BY-NC-ND 4.0 license. All assets and code within this repository are under the Apache 2.0 license unless specified otherwise.

Citation

If this work is helpful for your research, please consider citing the following BibTeX entry.

@inproceedings{zhu2024nucraft,
  title={nuCraft: Crafting High Resolution 3D Semantic Occupancy for Unified 3D Scene Understanding},
  author={Zhu, Benjin and Wang, Zhe and Li, Hongsheng},
  booktitle={Proceedings of the IEEE/CVF Conference on European Conference on Computer Vision},
  year={2024}
}

nucraft_api's People

Contributors

poodarchu avatar

Stargazers

chenhaomingbob avatar Zhimin-Liao avatar hak-kyoung.kim avatar Daisuke Nishimatsu avatar lvchuandong avatar  avatar Yuxuan Liu avatar Liheng avatar zoy avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.