GithubHelp home page GithubHelp logo

ssenichev / deep_vision_and_graphics Goto Github PK

View Code? Open in Web Editor NEW

This project forked from yandexdataschool/deep_vision_and_graphics

0.0 0.0 0.0 46.07 MB

Course about deep learning for computer vision and graphics co-developed by YSDA and Skoltech.

License: MIT License

C++ 0.02% Python 1.71% Cuda 0.12% Jupyter Notebook 98.16%

deep_vision_and_graphics's Introduction

Deep Vision and Graphics

This repo supplements course "Deep Vision and Graphics" taught at YSDA @fall'22. The course is the successor of "Deep Learning" course taught at YSDA in 2015-2021. New course focuses more on applications of deep learning for computer vision.

Lecture and seminar materials for each week are in ./week* folders. Homeworks are in ./homework* folders.

General info

  • Telegram chat room (russian).
  • YSDA deadlines & admin stuff can be found at the YSDA LMS (ysda students only).
  • Any technical issues, ideas, bugs in course materials, contribution ideas - add an issue

Syllabus

  • week01 Intro, recap of Neural network basics, optimization, backprop, biological networks
  • week02 Images, linear filtering, convolutional networks, batchnorms, augmentations
  • week03 ConvNet architectures and how to find them, sparse convolutions in 3D, ConvNets for videos, transfer learning
  • week04 Dense prediction: semantic segmentation, superresolution/image synthesis, perceptual losses
  • week05 Non-convolutional architectures: transformers (some recap of their use in NLP), mixers, FFT convolutions
  • week06 Visualizing and understanding deep architectures, adversarial examples
  • week07 Object detection, instance/panoptic segmentation, 2D/3D human pose estimation
  • week08 Representation learning: face recognition, verification tasks, self-supervised learning, image captioning
  • week09 Generative adversarial networks
  • week10 Latent models (GLO, AEs, VQ-VAE, generative transformers)
  • week11 Flow models, diffusion models, generative transformers, CLIP, DALL-E
  • week12 Shape and motion estimation: spatial transformers, optical flow, stereo, monodepth, point cloud generation, implicit and semi-implicit shape representations
  • week13 New view synthesis: multi-plane images, neural radiance fields, mesh-based and point-based representations for NVS, neural renderers

Contributors & course staff

Course materials and teaching performed by

deep_vision_and_graphics's People

Contributors

sayankotor avatar justheuristic avatar simflin avatar atmyre avatar chiyar avatar markovka17 avatar redhash avatar vovaf709 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.