GithubHelp home page GithubHelp logo

computer-vision-tasks's Introduction

Computer Vision Tasks

CV Tasks 包含Classification、Object Detection、Semantic Segmentation、Instance Segmentation,以下為一些應用範例。

1 keras-facenet.ipynb

此範例說明如何使用Pre-trained的model來做Face Verification和Face Recognition應用。[1]

2 Mask R-CNN for Object Detection.ipynb

此範例說明如何用Pre-trained的Mask RCNN來做Object Detection和Instance Segmentation。[2]

3 Transfer Learning by TensorFlow.ipynb

此範例說明如何用TensorFlow內建的Pre-trained model來做Classification的Transfer Learning應用。[3]
transfer learning

4 retrain customized object detection with tensorflow API.ipynb

此範例說明如何用TensorFlow API訓練自己的Object Detection應用。[4]

5 retrain semantic segmentation.ipynb

此範例說明如何使用Pre-trained的model來訓練自己的Semantic Segmentation應用。[5]

6 detectron2.ipynb

此範例說明如何使用Detectron2(Pre-trained model)來做Object Detection、Instance Segmentation、Keypoint Detection、Panoptic segmentation。
Keypoint Detection為體態辨識。[6]
img02
Panoptic segmentation為instance segmentation和semantic segmentation的結合。在圖中可以被數數量出來的物件 (例:bicycle, dog, car, person)被稱為‘things’,難以被計數的區域(例:pavement, ground, dirt, wall)稱為‘stuff’。[7]
img03

7 U-Net segmentation.ipynb

此範例說明如何使用U-Net來訓練自己的Semantic Segmentation應用。[8]

8 Style Transfer.ipynb

此範例介紹圖片Style Transfer應用,輸入原圖和風格圖即可將原圖改變風格。[9]
img04

9 retrain Poly-YOLO.ipynb

此範例說明如何使用Poly-YOLO來訓練自己的Object Detection應用。[10]

10 retrainYOLOv5.ipynb

此範例說明如何使用YOLOv5來訓練自己的Object Detection應用。[11]

11 StyleGAN pre-trained model exploration.ipynb

此範例說明如何使用StyleGAN來產生超擬真的人臉,並在fidelity和diversity之間做trade-off。[12][13][14]
img05

12 BigGAN Pre-trained model exploration.ipynb

此範例說明如何使用BigGAN來產生上百種的物品圖。[15]

References

[1] https://github.com/nyoki-mtl/keras-facenet
[2] https://github.com/matterport/Mask_RCNN
[3] https://www.tensorflow.org/tutorials/images/transfer_learning
[4] https://github.com/EdjeElectronics/TensorFlow-Object-Detection-API-Tutorial-Train-Multiple-Objects-Windows-10
[5] https://github.com/GeorgeSeif/Semantic-Segmentation-Suite.git
[6] https://github.com/facebookresearch/detectron2
[7] https://medium.com/@danielmechea/what-is-panoptic-segmentation-and-why-you-should-care-7f6c953d2a6a
[8] https://github.com/qubvel/segmentation_models
[9] https://github.com/GlebBrykin/SANET
[10] https://gitlab.com/irafm-ai/poly-yolo
[11] https://github.com/ultralytics/yolov5
[12] https://github.com/NVlabs/stylegan
[13] https://jonathan-hui.medium.com/gan-stylegan-stylegan2-479bdf256299
[14] https://arxiv.org/pdf/1812.04948.pdf
[15] https://tfhub.dev/deepmind/biggan-deep-256/1

computer-vision-tasks's People

Contributors

pttrazavi avatar

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.