GithubHelp home page GithubHelp logo

misery0424's Projects

clip icon clip

Contrastive Language-Image Pretraining

clip-1 icon clip-1

CLIP: Connecting Text and Image (Learning Transferable Visual Models From Natural Language Supervision)

clip-featurevis icon clip-featurevis

code for reproducing some of the diagrams in the paper "Multimodal Neurons in Artificial Neural Networks"

clipbert icon clipbert

[CVPR 2021 Oral] Official PyTorch code for ClipBERT, an efficient framework for end-to-end learning for image-text and video-text tasks.

decord icon decord

An efficient video loader for deep learning with smart shuffling that's super easy to digest

diffae icon diffae

Official implementation of Diffusion Autoencoders

dit icon dit

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

ernie icon ernie

Official implementations for various pre-training models of ERNIE-family, covering topics of Language Understanding & Generation, Multimodal Understanding & Generation, and beyond.

lostgans icon lostgans

Official implementation of our ICCV19 paper "Image Synthesis From Reconfigurable Layout and Style"

ml-visuals icon ml-visuals

šŸŽØ ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.

mmf icon mmf

A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)

mxnet icon mxnet

Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more

navit icon navit

My implementation of "Patch nā€™ Pack: NaViT, a Vision Transformer for any Aspect Ratio and Resolution"

ofa icon ofa

Official repository of OFA. Paper: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework

open-sora icon open-sora

Open-Sora: Democratizing Efficient Video Production for All

poster_template icon poster_template

some academic posters as references. May we have in-person poster session soon!

pytorch-image-models icon pytorch-image-models

PyTorch image models, scripts, pretrained weights -- (SE)ResNet/ResNeXT, DPN, EfficientNet, MixNet, MobileNet-V3/V2, MNASNet, Single-Path NAS, FBNet, and more

pytorchvideo icon pytorchvideo

A deep learning library for video understanding research.

rq-transformer icon rq-transformer

Implementation of RQ Transformer, proposed in the paper "Autoregressive Image Generation using Residual Quantization"

rq-vae-transformer icon rq-vae-transformer

The official implementation of Autoregressive Image Generation using Residual Quantization (CVPR '22)

slowfast icon slowfast

PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.

soho icon soho

[CVPR'21 Oral] Seeing Out of tHe bOx: End-to-End Pre-training for Vision-Language Representation Learning

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    šŸ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. šŸ“ŠšŸ“ˆšŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ā¤ļø Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.