GithubHelp home page GithubHelp logo

cai648971528 / dl.eyesight Goto Github PK

View Code? Open in Web Editor NEW

This project forked from liuguiyangnwpu/dl.eyesight

0.0 0.0 0.0 871 KB

Mainly use SSD, YOLO and other models to solve the target detection problem in image and video !

License: GNU General Public License v3.0

Python 100.00%

dl.eyesight's Introduction

图像视频中的目标检测

本工程主要目的是集成深度学习中常用的目标检测模型,并利用目标检测模型进行图像和视频中的检测!

开发环境

OS: Ubuntu 16.04
Python: Python 3.6.0
Tensorflow: 1.4.1 version
Opencv: 3.2.0 version for python

实现模型介绍

  • SSD+VGG
  • SSD+Res
  • SSD+Inception
  • SSD+SqueezeNet
  • SSD+Deconvolution
  • YOLO

运行说明

  1. 先准备好数据集
cat /Volumes/projects/DataSets/VOC2007/voc_train.txt
image_path01 xmin ymin xmax ymax class_id xmin ymin xmax ymax class_id
image_path02 xmin ymin xmax ymax class_id xmin ymin xmax ymax class_id
image_path03 xmin ymin xmax ymax class_id xmin ymin xmax ymax class_id

PS:Class_id从0开始编号,顺序同cfg文件中的label顺序一致
  1. 修改配置文件 配置文件存放在根目录下:conf/ssd_train.cfg 其中还有若干配置项,进行修改

  2. 运行程序 进入到example/ssd目录中

python vgg_trainer.py -c ../../conf/ssd_train.cfg

TODOLISTS

  • 整理文件目录结构,按照设计模式进行
  • 增加数据预处理的PipeLine
    • 图像插值
    • 图像镜像操作(左右,上下)
    • 添加随机噪声(各种模糊操作)
    • 对比度拉伸
    • 饱和度变化
    • 图像锐化
  • 提高模型训练速度
    • RawData ---> TFRecords
    • Single Process ---> Multi Processes
  • 检测过程的可视化
  • 编写检测网络结构模型文件
  • 对数据集的处理结构的统一接口
  • 编写对模块的测试文件

实验结果

  • YOLOv1模型在Pascal VOC数据集上的表现

联系我

dl.eyesight's People

Contributors

liuguiyangnwpu avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.