GithubHelp home page GithubHelp logo

detr's Introduction

1. Resume

RESUME

2. 'Baekjoon Online Judge' Solved Rank

hyp3rflow's solved.ac stats

3. PyTorch Implementations From Scratch

Vision
2014 VAE Kingma and Welling [✓] Training on MNIST
[✓] Encoder output visualization
[✓] Decoder output visualization
2015 CAM Zhou et al. [✓] Application to GoogleNet
2016 Gatys et al., 2016 Gatys et al. [✓] Application to VGGNet-19
YOLO Redmon et al. [✗] Training on VOC 2012
[✗] Class probability map
[✗] Ground truth vlisualization on grid
DCGAN Radford et al. [✓] Training on CelebA at 64 × 64
[✓] Sampling
[✓] Latent space interpolation
Noroozi et al., 2016 Noroozi et al. [✓] Architecture
[✓] Chromatic aberration
[✓] Permutation set
Zhang et al., 2016 Zhang et al. [✓] Empirical probability distribution
[✗] Color space
2014
2017
Conditional GAN
WGAN-GP
Mirza et al.
Gulrajani et al.
[✓] Training on MNIST
2016
2017
PixelCNN
VQ-VAE
Oord et al.
Oord et al.
[✓] Training on Fashion MNIST
[✓] Training on CIFAR-10
2017 Pix2Pix Isola et al. [✓] Training on Google Maps
[✓] Training on Facades
[✗] Inference on larger resolution
CycleGAN Zhu et al. [✓] Training on Monet to photo
[✓] Training on Vangogh to photo
[✓] Training on Cezanne to photo
[✓] Training on Ukiyo-e to photo
[✓] Training on Horse to zebra
[✓] Training on Summer to winter
Noroozi et al., 2017 Noroozi et al. [✓] Constrastive loss
2018 PGGAN Karras et al. [✓] Training on CelebA-HQ at 512 × 512
DeepLab v3 Chen et al. [✓] Training on VOC 2012
[✓] Prediction on VOC 2012 validation set
[✓] Average mIoU
PixelLink Deng et al. [✓] Architecture
[✓] Instance-balanced cross entropy loss
[✓] Post-processing
RotNet Gidaris et al [✓] Attention map visualization
2020 STEFANN Roy et al. [✓] FANnet architecture
[✓]Training FANnet on Google Fonts
[✓] Custom Google Fonts dataset
[✓] Average SSIM
DDPM Ho et al. [✓] Training on CelebA at 32 × 32
[✓] Training on CelebA at 64 × 64
[✓] Denoising process visualization
[✓] Linear interpolation sampling
[✓] Coarse-to-fine sampling
DDIM Song et al. [✓] Sampling
[✓] Spherical interpolation sampling
[✓] Interpolation on grid sampling
[✓] Truncated normal
ViT Dosovitskiy et al. [✓] Training on CIFAR-10
[✓] Training on CIFAR-100
[✓] Attention Roll-out
[✓] Position embedding similarity
[✓] Position embedding interpolation
Extra
[✓] CutOut
[✓] Hide-and-Seek
[✓] CutMix
SimCLR Chen et al. [✓] Normalized temperature-scaled cross entropy loss
[✓] Data augmentation
[✓] Pixel intensity histogram
DETR Carion et al. [✓] Architecture
[✗] Batch normalization freezing
[✗] Data preparation
[✗] Training on COCO 2017
2021 Improved DDPM Nichol and Dhariwal [✓] Cosine diffusion schedule
Classifier-Guidance Dhariwal and Nichol [✗] AdaGN
[✗] BiGGAN Upsample/Downsample
[✗] Improved DDPM sampling
[✗] Conditional/Unconditional models
[✗] Super-resolution model
[✗] Interpolation
ILVR Choi et al. [✓] Sampling from single reference
[✓] Sampling from various scale factors
[✓] Sampling from various conditioning range
SDEdit Meng et al. [✓] User input stroke simulation
MAE He et al. [✓] MAE architecture for pre-training
[✗] MAE architecture for self-supervised learning
[✗] Training on ImageNet-1K
[✗] Fine-tuning
[✗] Linear probing
Copy-Paste Ghiasi et al. [✓] Large scale jittering
[✓] Copy-Paste (within mini-batch)
[✗] Gaussian filter
ViViT Arnab et al.
2022 CFG Ho et al.
Language
2017 Transformer Vaswani et al. [✓] Architecture
[✓] Position encoding visualization
2019 BERT Devlin et al. [✓] BookCorpus data pre-processing
[✓] Architecture
[✓] Masked language modeling
[✓] SQuAD data pre-processing
[✓]SWAG data pre-processing
Sentence-BERT Reimers et al. [✓] Classification loss
[✓] Regression loss
[✓] Constrastive loss
[✓] STSb data pre-processing
[✓] WikiSection data pre-processing
[✗] NLI data pre-processing
RoBERTa Liu et al. [✓] BookCorpus data pre-processing
[✓] Masked language modeling
[✗] BookCorpus data pre-processing
(SEGMENT-PAIR + NSP)
[✗] BookCorpus data pre-processing
(SENTENCE-PAIR + NSP)
[✓] BookCorpus data pre-processing
(FULL-SENTENCES)
[✗] BookCorpus data pre-processing
(DOC-SENTENCES)
Vision-Language
2021 CLIP Radford et al. [✓] Training on Flickr8k + Flickr30k
[✓] Zero-shot classification on ImageNet1k (mini)
[✓] Linear classification on ImageNet1k (mini)

detr's People

Contributors

kimrass avatar

Stargazers

 avatar  avatar  avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.