GithubHelp home page GithubHelp logo

sarvan0506 / yolo-midas Goto Github PK

View Code? Open in Web Editor NEW
23.0 3.0 8.0 10.84 MB

Combine YOLOv3 with MiDaS with a single Resnext101 backbone for Autonomous Navigation

Jupyter Notebook 82.37% Python 17.63%

yolo-midas's Introduction

yolo-midas for Autonomous Navigation

Combine YOLOv3 with MiDaS with a single Resnext101 backbone. The idea is to use a single feature extractor for two different applications, in this case, Object detection and Monocular Depth Estimation.

Please read the article to see more details https://sarvan0506.medium.com/yolo-v3-and-midas-from-a-single-resnext101-backbone-8ba42948bf65

structure

The model architecture change can be seen in model/mde_net.py

Training

The model is trained on Construction Safety Gear Data which can be found here https://github.com/sarvan0506/EVA5-Vision-Squad/tree/Saravana/14_Construction_Safety_Dataset. If training need to done on custom datasets refer the data preparation steps mentioned in the page.

Place the data inside data/customdata/custom.data folder

python3.6 train.py --data data/customdata/custom.data --batch 8 --cache --cfg cfg/mde.conf --epochs 50 --img-size 512

Please refer the config file cfg/mde.cfg to change the network configuration, freeze different branches. The model is an extension of YOLOv3 and MiDaS networks. Most of the configurations can be understood if familiar with

  1. https://github.com/ultralytics/yolov3
  2. https://github.com/intel-isl/MiDaS

Inference

Download the weights from https://drive.google.com/file/d/1LZoWaZbsD4gG4xgWQ4cW-ezyhmaHXV1O/view?usp=sharing and place it under weights folder

Place the images on which inference need to be run, inside input folder

python3.6 detect.py --source input --conf-thres 0.1 --output output --weights weights/best.pt

The inferred images will be stored inside output folder

Inference Result Sample

result

yolo-midas's People

Contributors

sarvan0506 avatar

Stargazers

 avatar wyz avatar AI/ML Engineer avatar Akshat avatar  avatar Sangmin Song  avatar Philipp Schmidt avatar  avatar ANUJ RATHORE avatar Murat Toprak avatar Nikita Ermolenko avatar Hwi avatar  avatar Marouan  avatar Raphael Marco avatar Luiz Vitor Martinez Cardoso avatar  avatar Sameera Sandaruwan avatar Shaswata Das avatar Shawn Charles avatar Florian Bruggisser avatar Alexey avatar Yanwei Liu avatar

Watchers

Luiz Vitor Martinez Cardoso avatar James Cloos avatar  avatar

yolo-midas's Issues

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.