GithubHelp home page GithubHelp logo

ddddm-pr / dafc Goto Github PK

View Code? Open in Web Editor NEW

This project forked from doulujiyao/dafc

0.0 1.0 0.0 351 KB

ICMEW:A_Generative_Compression_Framework_For_Low_Bandwidth_Video_Conference

Python 100.00%

dafc's Introduction

ICMEW:A_Generative_Compression_Framework_For_Low_Bandwidth_Video_Conference

This repository contains the source code for the generation based video compression method described in the paper:

A_Generative_Compression_Framework_For_Low_Bandwidth_Video_Conference
IEEE International Conference on Multimedia & Expo Workshops (ICMEW)
Feng, Y. Huang, Y. Zhang, J. Ling, A. Tang and L. Song
Paper

Abstract: Video conferences introduce a new scenario for video transmission, which focuses on keeping the fidelity of faces even in the low bandwidth network environment. In this work, we propose VSBNet, one of the frameworks to utilize face landmarks in video compression. Our method utilizes the adversarial learning to reconstruct origin frames from the landmarks. To recover more details and keep the consistency of identity, we propose the concept of visual sensitivity to separate the contour of the face from the fast-moving parts, such as eyes and mouth. Experimental results demonstrate the superiority of our framework with a low bit rate of around 1KB/s.

Important note

THE METHODS PROVIDED IN THIS REPOSITORY ARE NOT TO BE USED FOR MALICIOUS OR INAPPROPRIATE USE CASES.
We release this code in order to help facilitate research of technical counter-measures for detecting this kind of forgeries. Suppressing this kind of publications will not stop their development but will only make it more difficult to detect them.

Please note this is a work in progress, while we make every effort to improve the results of this method, not every pair of faces can produce a high quality face swap.

Requirements

  • High-end NVIDIA GPUs with at least 11GB of DRAM.
  • Either Linux or Windows. We recommend Linux for better performance.
  • CUDA Toolkit 10.1, CUDNN 7.5, and the latest NVIDIA driver.
  • Python 3.6+ and PyTorch 1.4.0+.

Data preparation and train

Download the [RAVDESS] (used in the paper) or alternatively any other source of high resolution videos will be fine as well.

The videos should be placed in a flat directory structure.

Run the following command to train all the videos:

cd dafc
python train_large_v2.py

dafc's People

Contributors

doulujiyao avatar

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.