GithubHelp home page GithubHelp logo

matrixgame2018 / monocular-depth-estimation-toolbox Goto Github PK

View Code? Open in Web Editor NEW

This project forked from zhyever/monocular-depth-estimation-toolbox

0.0 0.0 0.0 6.34 MB

Monocular Depth Estimation Toolbox based on MMSegmentation.

License: Apache License 2.0

Shell 0.37% Python 99.63%

monocular-depth-estimation-toolbox's Introduction

Monocular-Depth-Estimation-Toolbox

Introduction

Monocular-Depth-Estimation-Toolbox is an open source monocular depth estimation toolbox based on PyTorch and MMSegmentation v0.16.0.

It aims to benchmark MonoDepth methods and provides effective supports for evaluating and visualizing results.

Major features

  • Unified benchmark

    Provide a unified benchmark toolbox for various depth estimation methods.

  • Modular design

    Depth estimation frameworks are decomposed into different components. One can easily construct a customized framework by combining different modules.

  • Support of multiple methods out of box

    I would like to reproduce some of the most excellent depth estimation methods based on this toolbox.

  • High efficiency

    It seems that there are few depth estimation benchmarks, so I start this project and hope it is helpful for research.

Thanks to MMSeg, we own these major features. ๐Ÿ˜Š

Benchmark and model zoo

Results and models are available in the model zoo.

Supported backbones (partially release):

  • ResNet (CVPR'2016)
  • EfficientNet (ICML'2019)
  • Vision Transformer (ICLR'2021)
  • Swin Transformer (ICCV'2021)
  • I recommend cross-package import in config, so that you can utilize other backbone in MMcls, MMseg, etc. Refer to introduction. I will add more backbones in the future.

Supported methods:

Supported datasets:

Installation

Please refer to get_started.md for installation and dataset_prepare.md for dataset preparation.

Get Started

We provide train.md and inference.md for the usage of this toolbox.

License

This project is released under the Apache 2.0 license.

Acknowledgement

This repo benefits from awesome works of mmsegmentation, Adabins, BTS. Please also consider citing them.

Cite

@misc{lidepthtoolbox2022,
  title={Monocular Depth Estimation Toolbox},
  author={Zhenyu Li},
  howpublished = {\url{https://github.com/zhyever/Monocular-Depth-Estimation-Toolbox}},
  year={2022}
}

Changelog

  • Nov. 29, 2022: Release the config for BinsFormer on KITTI. Release configs for cross-dataset inference (nyu2sunrgbd). Fix NYUv2 download bugs as discussed in Issue #66 and #36. I add several codes for NeWCRFs but have no plan to conduct detailed experiments for it due to the current limited resource and time.
  • Jul. 23, 2022: Release codes for BinsFormer. Potential bugs or performance drops can exist, waiting for completion.
  • Jun. 5, 2022: Add support for custom dataset training. Add a warmup interface for sigloss to help convergence as discussed in Issue #20. Enhance the DPT support and fix bugs in provided pre-trained models as reported in Issue #23.
  • Apr. 16, 2022: Finish most of docs and provide all pre-trained parameters. Release codes about BTS, Adabins, DPT, SimIPU, and DepthFormer. Support KITTI, NYU-v2, SUN RGB-D(eval), and CityScapes.

TODO

  • I will complete the release of BinsFormer models on KITTI dataset (currently limited by the GPU resource).
  • I would like to include self-supervised depth estimation methods, such as MonoDepth2.

monocular-depth-estimation-toolbox's People

Contributors

zhyever avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.