GithubHelp home page GithubHelp logo

yinwu33 / maptr Goto Github PK

View Code? Open in Web Editor NEW

This project forked from hustvl/maptr

0.0 0.0 0.0 6.04 MB

[ICLR'23 Spotlight] MapTR: Structured Modeling and Learning for Online Vectorized HD Map Construction

License: MIT License

Shell 0.62% C++ 4.57% Python 91.09% CSS 0.01% Cuda 3.64% Makefile 0.03% Batchfile 0.03% Dockerfile 0.02%

maptr's Introduction

MapTR

Structured Modeling and Learning for Online Vectorized HD Map Construction

Bencheng Liao1,2,3 *, Shaoyu Chen1,3 *, Xinggang Wang1 ๐Ÿ“ง, Tianheng Cheng1,3, Qian Zhang3, Wenyu Liu1, Chang Huang3

1 School of EIC, HUST, 2 Institute of Artificial Intelligence, HUST, 3 Horizon Robotics

(*) equal contribution, (๐Ÿ“ง) corresponding author.

ArXiv Preprint (arXiv 2208.14437)

openreview ICLR'23, accepted as ICLR Spotlight

News

  • May. 12th, 2023: MapTR now support various bevencoder, such as BEVFormer encoder and BEVFusion bevpool. Check it out!
  • Apr. 20th, 2023: Extending MapTR to a general map annotation framework (paper), with high flexibility in terms of spatial scale and element type.
  • Mar. 22nd, 2023: By leveraging MapTR, VAD (paper, code) models the driving scene as fully vectorized representation, achieving SoTA end-to-end planning performance!
  • Jan. 21st, 2023: MapTR is accepted to ICLR 2023 as Spotlight Presentation!
  • Nov. 11st, 2022: We release an initial version of MapTR.
  • Aug. 31st, 2022: We released our paper on Arxiv. Code/Models are coming soon. Please stay tuned! โ˜•๏ธ

Introduction

MapTR is a simple, fast and strong online vectorized HD map construction framework.

framework

We present MapTR, a structured end-to-end framework for efficient online vectorized HD map construction. We propose a unified permutation-based modeling approach, ie, modeling map element as a point set with a group of equivalent permutations, which avoids the definition ambiguity of map element and eases learning. We adopt a hierarchical query embedding scheme to flexibly encode structured map information and perform hierarchical bipartite matching for map element learning. MapTR achieves the best performance and efficiency among existing vectorized map construction approaches on nuScenes dataset. In particular, MapTR-nano runs at real-time inference speed ( $25.1$ FPS ) on RTX 3090, $8\times$ faster than the existing state-of-the-art camera-based method while achieving $3.3$ higher mAP. MapTR-tiny significantly outperforms the existing state-of-the-art multi-modality method by $13.5$ mAP while being faster. Qualitative results show that MapTR maintains stable and robust map construction quality in complex and various driving scenes. MapTR is of great application value in autonomous driving.

Models

Results from the paper

Method Backbone BEVEncoder Lr Schd mAP FPS memroy
MapTR-nano R18 GKT 110ep 44.2 25.1 11907M (bs 24)
MapTR-tiny R50 GKT 24ep 50.3 11.2 10287M (bs 4)
MapTR-tiny R50 GKT 110ep 58.7 11.2 10287M (bs 4)

Notes:

  • FPS is measured on NVIDIA RTX3090 GPU with batch size of 1 (containing 6 view images).
  • All the experiments are performed on 8 NVIDIA GeForce RTX 3090 GPUs.

Results from this repo. FPSs are much higher.

Method Backbone BEVEncoder Lr Schd mAP FPS memroy Config Download
MapTR-nano R18 GKT 110ep 46.3 48.2 11907M (bs 24) config model / log
MapTR-tiny R50 GKT 24ep 50.0 18.4 10287M (bs 4) config model / log
MapTR-tiny R50 GKT 110ep 59.3 18.4 10287M (bs 4) config model / log
MapTR-tiny Camera & LiDAR GKT 24ep 62.7 6.0 11858M (bs 4) config model / log
MapTR-tiny R50 bevpool 24ep 50.1 17.2 9817M (bs 4) config model / log
MapTR-tiny R50 bevformer 24ep 48.7 18.1 10219M (bs 4) config model / log

Qualitative results on nuScenes val set

MapTR maintains stable and robust map construction quality in various driving scenes.

visualizations

Sunny&Cloudy

sunny.cloudy_short.mp4

Rainy

rainy_short.mp4

Night

night_short.mp4

End-to-end Planning

e2e_planning.mp4

Getting Started

Catalog

  • centerline detection & topology support
  • multi-modal checkpoints
  • multi-modal code
  • lidar modality code
  • argoverse2 dataset
  • Nuscenes dataset
  • MapTR checkpoints
  • MapTR code
  • Initialization

Acknowledgements

MapTR is based on mmdetection3d. It is also greatly inspired by the following outstanding contributions to the open-source community: BEVFusion, BEVFormer, HDMapNet, GKT, VectorMapNet.

Citation

If you find MapTR is useful in your research or applications, please consider giving us a star ๐ŸŒŸ and citing it by the following BibTeX entry.

@inproceedings{MapTR,
  title={MapTR: Structured Modeling and Learning for Online Vectorized HD Map Construction},
  author={Liao, Bencheng and Chen, Shaoyu and Wang, Xinggang and Cheng, Tianheng, and Zhang, Qian and Liu, Wenyu and Huang, Chang},
  booktitle={International Conference on Learning Representations},
  year={2023}
}

maptr's People

Contributors

legendbc avatar outsidercsy avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.