GithubHelp home page GithubHelp logo

diffusers-papers's Introduction

Denoising Diffusion Probabilistic Models Papers

Papers club from the AI team in D-ID - this time Diffusion Model(DM).

Diffusion Models were first introduced in Deep Unsupervised Learning using Nonequilibrium Thermodynamics. However, it took until Generative Modeling by Estimating Gradients of the Data Distribution (Song et al., 2019, Stanford University), and then Denoising Diffusion Probabilistic Models (Ho et al., 2020, Google Brain) who independently improved the approach.

A good explnantion on what are Diffusion Models and why they are intresting can be found in Diffusion-Models Tutorial (CVPR 2022).

מועדון קריאת מאמרים שלנו - כל ההרצאות בעיברית

Title Paper / Resource Year Why is it interesting? Asignee Recording Slides
Denoising Diffusion Probabilistic Models Denoising Diffusion Probabilistic Models 2020
read whyhigh quality image synthesis results using diffusion probabilistic models, a class of latent variable models inspired by considerations from nonequilibrium thermodynamics.
@talbenha zoom(@NnH10JK) slides
The Annotated Diffusion Model The Annotated Diffusion Model
read why
self-work -- --
Colorization, Inpainting, Uncropping, and JPEG restoration Palette: Image-to-Image Diffusion Models 2021
read why A unified framework for image-to-image translation based on conditional diffusion models and evaluates this framework on four challenging image-to-image translation tasks, namely colorization, inpainting, uncropping, and JPEG restoration
@ArnoBen zoom (6CbWY6e*) slides
Rethinking Diffusion Models Design Elucidating the Design Space of Diffusion-Based Generative Models 2020
read whyKarras, the StyleGAN author is doing a back to the roots rethinking design choices of diffusion models, creating a well justified baseline archtecture
@orgoro zoom1(.m0gN7.?) zoom2(S^*c0ai3) slides
Super-Resolution Image Super-Resolution via Iterative Refinement 2021
read whyhigh quality image synthesis results using diffusion probabilistic models, a class of latent variable models inspired by considerations from nonequilibrium thermodynamics.
self-work -- --
Classifier (+ Classifier-Free) Diffusion Guidance Diffusion Models Beat GANs on Image Synthesis & Classifier-Free Diffusion Guidance 2021
read why DM achieve image sample quality superior to the current SOTA GAN models by improving the U-Net architecture, as well as introducing classifier (+calssifier free) guidance
@talbenha zoom(?JS330&C) slides
Text2Image ImageGen 2022
read why text-to-image synthesis
@alon.mengi zoom(7hB61@CU) slides
Efficient DM (Stable Diffusion) High-Resolution Image Synthesis with Latent Diffusion Models 2022
read why Apply DM in the latent space of powerful pretrained autoencoders to enable training on limited computational resources while retaining their quality and flexibility
@ShiraBaronn zoom(U!+B+7g+) slides
Imagic Imagic: Text-Based Real Image Editing with Diffusion Models 2022
read whyApply complex (e.g., non-rigid) text-guided semantic edits to a single real image
@Ganitk zoom(%1x7WWl*) slides
Text2Video Imagen Video: High Definition Video Generation with Diffusion Models 2022
read whya text-conditional video generation system based on a cascade of video diffusion models
@maysteinfeld zoom($Y=U45cT) slides
TTS-Diffusion Grad-TTS: A Diffusion Probabilistic Model for Text-to-Speech 2021
read whyText-to-speech model with score-based decoder producing mel-spectrograms by gradually transforming noise predicted by encoder and aligned with text input by means of Monotonic Alignment Search.
@amitay-nachmani zoom(@3yMN0gC) slides
3D Shape Synthesis LION: Latent Point Diffusion Models for 3D Shape Generation 2022
read whyHierarchical Latent Point Diffusion Model for 3D shape generation. LION is set up as a variational autoencoder (VAE) with a hierarchical latent space that combines a global shape latent representation with a point-structured latent space.
@matan-feldman zoom(q=v@4WYg) slides
DreamFusion DreamFusion: Text-to-3D using 2D Diffusion 2022
read whyDreamFusion use a pretrained 2D text-to-image diffusion model to perform text-to-3D synthesis
@ShiraBaronn zoom(9gZqV*2Y) slides
FMRI-to-Image with SD High-resolution image reconstruction with latent diffusion models from human brain activity 2023
read whyReconstruct images from FMRI using stable diffusion
@Ganitk zoom(B5J0vf?+) slides
Few cool papers 😎 Control Net, InstructPix2Pix, DreamBooth, Textual-Inversion, Prompt-to-Prompt 2023
read whyClosing the seminar with 5 cool papers
@talbenha zoom(r+52hd5@) slides

diffusers-papers's People

Contributors

orgoro avatar ganitk avatar shirabaronn avatar alon-mengi avatar amitay-nachmani avatar leong-deid avatar talbenha avatar maysteinfeld avatar matan-feldman avatar

Stargazers

 avatar  avatar Paul Fahnestock avatar  avatar  avatar  avatar Alexey Zagalsky avatar Guy Kabiri avatar  avatar sagi hammer avatar  avatar  avatar  avatar  avatar  avatar  avatar Dror Weiss avatar Arik Sfaradi avatar

Watchers

 avatar Kostas Georgiou avatar

Forkers

pdragonlabs

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.