LiDAR R-CNN: An Efficient and Universal 3D Object Detector

Introduction

This is the official code of LiDAR R-CNN: An Efficient and Universal 3D Object Detector. In this work, we present LiDAR R-CNN, a second stage detector that can generally improve any existing 3D detector. We find a common problem in Point-based RCNN, which is the learned features ignore the size of proposals, and propose several methods to remedy it. Evaluated on WOD benchmarks, our method significantly outperforms previous state-of-the-art.

中文介绍：https://zhuanlan.zhihu.com/p/359800738

News

We provide the training code for multi-frame setting, and show 3 frame results based PointPillars.

Requirements

All the codes are tested in the following environment:

Linux (tested on Ubuntu 16.04)
Python 3.6+
PyTorch 1.5 or higher (tested on PyTorch 1.5, 6, 7)
CUDA 10.1

To install pybind11:

git clone [email protected]:pybind/pybind11.git
cd pybind11
mkdir build && cd build
cmake .. && make -j 
sudo make install

To install requirements:

pip install -r requirements.txt
apt-get install ninja-build libeigen3-dev

Install LiDAR_RCNN library:

python setup.py develop --user

Cuda Extensions:

# Rotated IOU
cd src/LiDAR_RCNN/ops/iou3d/
python setup.py build_ext --inplace

Preparing Data

Please refer to data processer to generate the proposal data.

Training

After preparing WOD data, we can train the vehicle only model in the paper, run this command:

python -m torch.distributed.launch --nproc_per_node=4 tools/train.py --cfg config/lidar_rcnn.yaml --name lidar_rcnn

For 3 class in WOD:

python -m torch.distributed.launch --nproc_per_node=8 tools/train.py --cfg config/lidar_rcnn_all_cls.yaml --name lidar_rcnn_all

The models and logs will be saved to work_dirs/outputs.

NOTE: for multi-frame training, please set MODEL.Frame = n in config.

Evaluation

To evaluate, run distributed testing with 4 gpus:

python -m torch.distributed.launch --nproc_per_node=4 tools/test.py --cfg config/lidar_rcnn.yaml --checkpoint outputs/lidar_rcnn/checkpoint_lidar_rcnn_59.pth.tar
python tools/create_results.py --cfg config/lidar_rcnn.yaml

Note that, you should keep the nGPUS in config equal to nproc_per_node .This will generate a val.bin file in the work_dir/results. You can create submission to Waymo server using waymo-open-dataset code by following the instructions here.

Results

Our model achieves the following performance on:

Waymo Open Dataset Challenges (3D Detection)

Proposals from	Class	Frame/Channel	3D AP L1 Vehicle	3D AP L1 Pedestrian	3D AP L1 Cyclist
PointPillars	Vehicle	1 / 1x	75.6	-	-
PointPillars	Vehicle	1 / 2x	75.6	-	-
PointPillars	Vehicle	3 / 2x	77.8	-	-
SST	Vehicle	3 / 2x	78.6	-	-
PointPillars	3 Class	1 / 1x	73.4	70.7	67.4
PointPillars	3 Class	1 / 2x	73.8	71.9	69.4

Proposals from	Class	Frame/Channel	3D AP L2 Vehicle	3D AP L2 Pedestrian	3D AP L2 Cyclist
PointPillars	Vehicle	1 / 1x	66.8	-	-
PointPillars	Vehicle	1 / 2x	67.9	-	-
PointPillars	Vehicle	3 / 2x	69.1	-	-
SST	Vehicle	3 / 2x	69.9	-	-
PointPillars	3 Class	1 / 1x	64.8	62.4	64.8
PointPillars	3 Class	1 / 2x	65.1	63.5	66.8

Note: The proposals provided by PointPillars are detected on 1 frame points cloud.

Citation

If you find our paper or repository useful, please consider citing

@article{li2021lidar,
  title={LiDAR R-CNN: An Efficient and Universal 3D Object Detector},
  author={Li, Zhichao and Wang, Feng and Wang, Naiyan},
  journal={CVPR},
  year={2021},
}

Acknowledgement

This project draws on the following codebases.

dimens66 / lidar_rcnn Goto Github PK

lidar_rcnn's Introduction

LiDAR R-CNN: An Efficient and Universal 3D Object Detector

Introduction

News

Requirements

Preparing Data

Training

Evaluation

Results

Citation

Acknowledgement

lidar_rcnn's People

Contributors

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent

Jobs