lzgmatrix / cdpn_iccv2019_zhigangli Goto Github PK

The official training code of ICCV2019 paper "CDPN: Coordinates-based Disentangled Pose Network for Real-time RGB-based 6-DoF Object Pose Estimation".

License: Apache License 2.0

Python 99.76% Shell 0.24%

6d-pose-estimation disentangled

cdpn_iccv2019_zhigangli's Introduction

News:

Our CDPN wins "The Best RGB-Only Method Awards" on the BOP Challenge on ICCV2019 (code link: https://github.com/LZGMatrix/BOP19_CDPN_2019ICCV).
Our CDPNv2 wins the "The Best Methods on Individual Datasets" on the BOP Challenge on ECCV2020 (code link: https://github.com/LZGMatrix/BOP19_CDPN_2019ICCV/tree/bop2020).

CDPN: Coordinates-based Disentangled Pose Network for Real-time RGB-based 6-DoF Object Pose Estimation

CDPN: Coordinates-based Disentangled Pose Network for Real-time RGB-based 6-DoF Object Pose Estimation
Zhigang Li, Gu Wang, Xiangyang Ji
ICCV 2019 (Oral) paper, supplement, oral

We provide the clean-version training code of our ICCV 2019 paper "CDPN: Coordinates-based Disentangled Pose Network for Real-time RGB-based 6-DoF Object Pose Estimation". This code can perfectly reproduce the impressive results of our paper.

If you find this code useful for your research, please cite our paper:

@InProceedings{Li_2019_ICCV,
author = {Li, Zhigang and Wang, Gu and Ji, Xiangyang},
title = {CDPN: Coordinates-Based Disentangled Pose Network for Real-Time RGB-Based 6-DoF Object Pose Estimation},
booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)},
month = {October},
year = {2019}
}

Overview

6-DoF object pose estimation from a single RGB image is a fundamental and long-standing problem in computer vision. Current leading approaches solve it by training deep networks to either regress both rotation and translation from image directly or to construct 2D-3D correspondences and further solve them via PnP indirectly. We argue that rotation and translation should be treated differently for their significant difference. In this work, we propose a novel 6-DoF pose estimation approach: Coordinates-based Disentangled Pose Network (CDPN), which disentangles the pose to predict rotation and translation separately to achieve highly accurate and robust pose estimation. Our method is flexible, efficient, highly accurate and can deal with texture-less and occluded objects. Extensive experiments on LINEMOD and Occlusion datasets are conducted and demonstrate the superiority of our approach. Concretely, our approach significantly exceeds the state-of-theart RGB-based methods on commonly used metrics.

Results in our paper

Environment

Set up python 3.6.7 environment：

pytorch==1.4.0
torchvision==0.5.0
numpy==1.19.3
opencv==4.5.0
tensorboardx==2.1

Other dependencies: 
yaml, pickle, pyparsing, progress, plyfile, scipy, tqdm, glob, os, sys...

Prepare the dataset

Download training and test data of LINEMOD dataset:
- The OpenGL training data can be found here (password: b7kt).
- The real training data can be found here (password: sesw).
- The test data can be found here (password: mhko).
- The 3D object models can be found here (password: ba4f).
Download VOC2012 dataset from official website (http://host.robots.ox.ac.uk/pascal/VOC/index.html).

Note: The VOC2012 is only used as background data, which can also be replaced with other real-world datasets such as MS COCO dataset, SUN datset, and so on.
Prepare the dataset as follows:

Root
├── dataset
│   ├── bg_images
│   │   └── VOC2012
│   └── lm
│       ├── models
│       │   ├── ape
│       │   └── ...
│       ├── imgn
│       │   ├── ape
│       │   └── ...
│       ├── real_test
│       │   ├── ape
│       │   └── ...
│       └── real_train
│           ├── ape
│           └── ...
├── asserts
├── tools
├── lib
├── dataset_cache (automatically built in the first running)
└── exp (automatically built in running)

Training

1. Stage 1:

cd tools; and run:

sh train_stage1.sh

2. Stage 2:

Edit the config file of stage 2 in path ./tools/exps_cfg/config_trans.yaml:

Replace the 'path to the trained model of step1' with your actual path.
cd tools; and run:

sh train_stage2.sh

3. Stage 3:

Edit the config file of stage 3 in path ./tools/exps_cfg/config_rot_trans.yaml:

Replace the 'path to the trained model of step2' with your actual path.
cd tools; and run:

sh train_stage3.sh

Testing

For testing various models trained in stage1, stage2 and stage3:

Edit the config file by:
- Setting the "test" from False to True.
- Setting the "load_model" with path of the testing model.
cd tools; and run:

python main.py --cfg=exps_cfg/path_to_your_config_file.yaml

Our trained models

Our trained model (download link, password: 92e2) and log file (download link, password: n792) of stage 1:

Object/Metric	5^\circ \ 5cm	ADD	Proj.\ 2D
ape	84.76	15.24	95.9
benchvise	97.38	94.86	95.44
camera	98.04	71.76	98.53
can	98.33	87.6	97.54
cat	94.71	54.89	98.7
driller	96.83	91.97	94.55
duck	90.33	25.54	97.93
eggbox	96.62	99.25	98.69
glue	80.31	90.35	97.59
holepuncher	98	74.6	99.81
iron	94.99	94.08	95.71
lamp	97.22	96.83	92.9
phone	91.12	83.76	98.3
Average	93.74	75.44	97.05

Our trained model (download link, password: q8rq) and log file (download, password: mdle) of stage 2:

Object/Metric	2cm	5cm
ape	89.71	99.14
benchvise	92.82	99.9
camera	95.1	99.9
can	94.78	99.8
cat	93.11	99.8
driller	83.55	99.31
duck	91.08	99.72
eggbox	95.02	99.62
glue	82.24	99.42
holepuncher	94.01	99.71
iron	89.48	99.49
lamp	89.83	99.71
phone	83.66	98.87
Average	90.34	99.57

Our trained model (download, password: ksoo) and log file (download, password: tb1q) of stage 3:

Object/Metric	5^\circ \ 5cm	ADD	Proj.\ 2D
ape	86.67	67.33	97.52
benchvise	98.35	98.74	98.74
camera	98.73	92.84	98.63
can	98.92	96.56	99.61
cat	95.61	86.63	99.30
driller	96.93	95.14	94.85
duck	92.30	75.21	98.40
eggbox	97.84	99.62	99.06
glue	82.24	99.61	98.36
holepuncher	98.48	89.72	99.52
iron	95.71	97.85	97.85
lamp	97.79	97.79	95.68
phone	92.16	90.65	96.79
Average	94.75	91.36	98.02

Acknowledgement

This work is affliated with Tsinghua University Xiangyang Ji's Lab. For business cooperation, welcome to contact: [email protected] & [email protected].

Copyright (c) Tsinghua University Xiangyang Ji's Lab. All Rights Reserved.

Licensed under the Apache License, Version 2.0 (the "License");
you may not use this file except in compliance with the License.
You may obtain a copy of the License at

    http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software
distributed under the License is distributed on an "AS IS" BASIS,
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
See the License for the specific language governing permissions and
limitations under the License.

cdpn_iccv2019_zhigangli's People

Contributors

Stargazers

Watchers

Forkers

fubowen1229 hiyyg ismarou mohamedaminhamdad jxw-tmp thu-da-6d-pose-group johnbhlm taekwang94 linkq-goldeneye cpsiff hsuweehou wenzihan bruinxiong zhuangtingt ykzzyk junfeng-geo

cdpn_iccv2019_zhigangli's Issues

thanks for your work. could you please put the dataset on google drive? baidu downloads really slow!

extend to occlusion datset

Dear author,

As title, if I want to use linemod-occlusion dataset to train model, how can I prepare datset for the training stage?

Thanks

Experiments on Occlusion dataset issue

I'm wondering how to experiment in the Occlusion dataset, but it doesn't seem to be introduced in your readme.md file.

Can the authors share the detailed experimental steps on the Occlusion dataset in the paper? Including datasets, training, testing, and much more.I would be grateful if you could tell me.

RuntimeError: received 0 items of ancdata

Hello, I run test using config_rot.yaml. I change the 'tes' from 'False' to 'True'; 'classes' from 'all' to ['cat']; 'img_type from 'real_imgn' to ''; 'test_mode' from 'all_fast' to 'all'. I run python main.py --cfg=exps_cfg/config_rot.yaml. It can run for a while then I got the run error like this:
Traceback (most recent call last): File "main.py", line 106, in <module> main() File "main.py", line 75, in main _, preds = test(0, cfg, test_loader, network, obj_vtx, obj_info, criterions) File "/home/xxx/CDPN_ICCV2019_ZhigangLi/tools/../lib/test.py", line 72, in test for i, (obj, obj_id, inp, pose, c_box, s_box, box, trans_local) in enumerate(data_loader): File "/home/xxx/CDPN_ICCV2019_ZhigangLi/venv/lib/python3.8/site-packages/torch/utils/data/dataloader.py", line 345, in __next__ data = self._next_data() File "/home/xxx/CDPN_ICCV2019_ZhigangLi/venv/lib/python3.8/site-packages/torch/utils/data/dataloader.py", line 841, in _next_data idx, data = self._get_data() File "/home/xxx/CDPN_ICCV2019_ZhigangLi/venv/lib/python3.8/site-packages/torch/utils/data/dataloader.py", line 808, in _get_data success, data = self._try_get_data() File "/home/xxx/CDPN_ICCV2019_ZhigangLi/venv/lib/python3.8/site-packages/torch/utils/data/dataloader.py", line 761, in _try_get_data data = self._data_queue.get(timeout=timeout) File "/usr/lib/python3.8/multiprocessing/queues.py", line 116, in get return _ForkingPickler.loads(res) File "/home/xxx/CDPN_ICCV2019_ZhigangLi/venv/lib/python3.8/site-packages/torch/multiprocessing/reductions.py", line 294, in rebuild_storage_fd fd = df.detach() File "/usr/lib/python3.8/multiprocessing/resource_sharer.py", line 58, in detach return reduction.recv_handle(conn) File "/usr/lib/python3.8/multiprocessing/reduction.py", line 189, in recv_handle return recvfds(s, 1)[0] File "/usr/lib/python3.8/multiprocessing/reduction.py", line 164, in recvfds raise RuntimeError('received %d items of ancdata' % RuntimeError: received 0 items of ancdata

Thanks your reply!

About the bop fomat dataset

Thank you so much for such a great project!
I saw that you trained the model for the BOP Challenge, so I wanted to ask how to train the BOP format dataset. I have now synthesized my own BOP format data set using BlenderProc.

Looking forward to your reply!
Thank you very much!

Experiments on Occlusion dataset issue

I'm wondering how to experiment in the Occlusion dataset, but it doesn't seem to be introduced in your readme.md file.

Can the authors share the detailed experimental steps on the Occlusion dataset in the paper? Including datasets, training, testing, and much more.I would be grateful if you could tell me.

create coordinate pkl file

I tried to project the 3d model (point cloud points) to 2D plane, and create 3 maps with the the x,y,z coordinates of those points in the world coordinate, but get somewhat different results with what's in the pkl file.

Can someone help me with creating the dataset? Thanks!

Not up to the official performance.

Hello, I use your code for training, but the final added performance is only 86.00, which is far from the paper data (89.86) and lower than the value provided by github (91.36). All training steps are run according to the readme file, using lm data set and Tesla K80.

Can you tell me how to improve the performance until the add is the same as that provided by the paper data or github?

Linemod-occlusion datset issue

How I tested the test on linemod-occlusion datset with my trained model

the code for solving the pose through PnP issue

I would like to know where the code for solving the pose through PnP is after getting the cc_maps through the resnet_rot_head, thank you very much!

cc_maps = self.rot_head_net(features)

.pkl里面存储的是什么类型数据

大佬你好，请问这个.pkl文件存储的是在当前相机坐标系下的3D坐标吗？可以用深度相机结合相机内参计算的到吗？

在其他数据集训练

您好，您的项目在现在仍然是很多方法的基础。请问我该怎么组织我的数据集以适应您的项目呢？我想在一个工业数据集上进行实验。

About object detector

Hello, author

Thanks for yor greate work and sharing.

I noticed that in your paper the object detector seems to be used only in the testing stage, I would like to ask how the training stage is used to determine the position of the object and crop the image?
I would like to try to test the result of pose estimation using other object detectors, may I have some suggestions from you?

Yuning,
best

Questions about posture result visualization

I wanted to know how to visualize the results like in the picture.

The final result

Sorry to bother you, I use the stage3.checkpoint in your link, but I got the result below, I think the result is not correct.

Object detector training/inference code

Hi there,
Thanks for sharing all the work and code, very interesting.

I was wondering if you have any code for training the object detectors mentioned in your papers? This repo only contains the code for the "second" stage of your work.

Kind regards,
Chris

Trained models

Hi! Thank you for making your great work available!

I was wondering if you could also make your trained models available in another platform besides Baidu. It is quite difficult to download files from there.

Thank you in advance!

If I want to make my own project dataset, how can I get this pkl file.

real_test\cat\000001-coor.pkl.
Hello.What does the content of this pkl file mean? If I want to make my own project dataset, how can I get this pkl file.Can I train the model without this pkl file? What impact will it have?