GithubHelp home page GithubHelp logo

zhangyp15 / simmod Goto Github PK

View Code? Open in Web Editor NEW
47.0 8.0 1.0 60.62 MB

Implementation of SimMOD: A Simple Baseline for Multi-Camera 3D Object Detection

Python 90.43% Shell 0.75% Jupyter Notebook 0.05% Dockerfile 0.02% Makefile 0.03% Batchfile 0.04% CSS 0.01% C++ 5.18% Cuda 3.49%

simmod's Introduction

SimMOD

Implementation of SimMOD: A Simple Baseline for Multi-Camera 3D Object Detection. (AAAI 2023)

Arxiv

framework

Installation

Check installation for installation.

Data Preparation

Check data_preparation for preparing the nuScenes dataset.

Getting Started

To train SimMOD with 8 GPUs, run:

bash tools/dist_train.sh $CONFIG 8

For evaluation, use:

bash tools/dist_test.sh $CONFIG $CKPT 8 --eval=bbox

We provide the pretrained models for SimMOD.

Method Pretrain mAP NDS Log Weights
SimMOD-r50 ImageNet 33.1 42.7 log model
SimMOD-r101 ImageNet 34.9 43.1 log model
SimMOD-r101 FCOS3D 37.0 45.4 log model
SimMOD-r101 NuImg 37.6 46.1 log model

Acknowledgement

This project is mainly based on DETR3D. Thanks for their great work.

If you find this repo useful for your research, please consider citing the paper:

@article{zhang2022simple,
  title={A Simple Baseline for Multi-Camera 3D Object Detection},
  author={Zhang, Yunpeng and Zheng, Wenzhao and Zhu, Zheng and Huang, Guan and Zhou, Jie and Lu, Jiwen},
  journal={arXiv preprint arXiv:2208.10035},
  year={2022}
}

simmod's People

Contributors

yunpengzhangphigent avatar zhangyp15 avatar

Stargazers

 avatar Xiaobing Han avatar Szymon Kwiatkowski avatar Xu CAO avatar Ruizi Yang avatar Liu Xiaolu avatar huawen avatar Zichen Yu avatar Jokester avatar dxjforyou avatar Junming Zhang avatar Peize Sun avatar Mose. avatar Lufan Ma avatar  avatar Wayne Mao avatar Andy.Chen avatar yangkang avatar  avatar Haisong Liu avatar  avatar JLQ avatar  avatar Hongyang Tang avatar LiuFG avatar  avatar probability avatar Chenming Wu avatar  avatar savoki avatar  avatar 爱可可-爱生活 avatar  avatar Kinke Kabingila avatar vasgaowei avatar 一条冻狗 avatar Patrick Liu avatar Lu Ming avatar Wenzhao Zheng avatar KeningChai avatar Yineng Xiong avatar Xinwei He avatar  avatar  avatar Jiazhi Yang avatar LeiZHang avatar Egqawkq avatar

Watchers

savoki avatar Yineng Xiong avatar LeiZHang avatar LiuFG avatar KeningChai avatar JLQ avatar hiyyg avatar  avatar

Forkers

whuhxb

simmod's Issues

config中LoadAnnotations3D的with_tokens与mmdet3d不匹配

您好,感谢您出色的工作。运行代码时报错TypeError: LoadAnnotations3D: init() got an unexpected keyword argument 'with_tokens'

config中LoadAnnotations3D有with_tokens参数,而对应版本的mmdet3d中的LoadAnnotations3D没有这个参数,我的mmdet和mmdet3d是按照MMDetection==2.14.0,MMDetection3D==0.17.2配置的。
所以会报错TypeError: LoadAnnotations3D: init() got an unexpected keyword argument 'with_tokens',所以这个参数是多余的吗?或者是mmdet3d要选择另外的版本呢?
期待您的回复。

@PIPELINES.register_module()

class LoadAnnotations3D(LoadAnnotations):

def __init__(self,
             with_bbox_3d=True,
             with_label_3d=True,
             with_attr_label=False,
             with_mask_3d=False,
             with_seg_3d=False,
             with_bbox=False,
             with_label=False,
             with_mask=False,
             with_seg=False,
             with_bbox_depth=False,
             poly2mask=True,
             seg_3d_dtype='int',
             file_client_args=dict(backend='disk')):
    super().__init__(
        with_bbox,
        with_label,
        with_mask,
        with_seg,
        poly2mask,
        file_client_args=file_client_args)
    self.with_bbox_3d = with_bbox_3d
    self.with_bbox_depth = with_bbox_depth
    self.with_label_3d = with_label_3d
    self.with_attr_label = with_attr_label
    self.with_mask_3d = with_mask_3d
    self.with_seg_3d = with_seg_3d
    self.seg_3d_dtype = seg_3d_dtype

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.