GithubHelp home page GithubHelp logo

Xiao Yu's Projects

tablemaster-mmocr icon tablemaster-mmocr

2nd solution of ICDAR 2021 Competition on Scientific Literature Parsing, Task B.

tenginekit icon tenginekit

TengineKit - Free, Fast, Easy, Real-Time Face Detection & Face Landmarks & Face Attributes & Hand Detection & Hand Landmarks & Body Detection & Body Landmarks & Iris Landmarks & Yolov5 SDK On Mobile.

train-clip icon train-clip

A PyTorch Lightning solution to training OpenAI's CLIP from scratch.

transformers icon transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

transnetv2 icon transnetv2

TransNet V2: Shot Boundary Detection Neural Network

tts icon tts

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

u-2-net icon u-2-net

The code for our newly accepted paper in Pattern Recognition 2020: "U^2-Net: Going Deeper with Nested U-Structure for Salient Object Detection."

ugatit-pytorch icon ugatit-pytorch

Official PyTorch implementation of U-GAT-IT: Unsupervised Generative Attentional Networks with Adaptive Layer-Instance Normalization for Image-to-Image Translation

unisal icon unisal

Unified Image and Video Saliency Modeling (ECCV 2020)

upscale-a-video icon upscale-a-video

Upscale-A-Video: Temporal-Consistent Diffusion Model for Real-World Video Super-Resolution

vall-e-x icon vall-e-x

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io

var icon var

[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!

videomamba icon videomamba

VideoMamba: State Space Model for Efficient Video Understanding

videos icon videos

Code for the manim-generated scenes used in 3blue1brown videos

videosuperresolution icon videosuperresolution

A collection of state-of-the-art video or single-image super-resolution architectures, reimplemented in tensorflow.

vins-mono icon vins-mono

A Robust and Versatile Monocular Visual-Inertial State Estimator

vits_chinese icon vits_chinese

Best practice TTS based on BERT and VITS with some Natural Speech Features Of Microsoft; Support ONNX streaming out!

vtoonify icon vtoonify

[SIGGRAPH Asia 2022] VToonify: Controllable High-Resolution Portrait Video Style Transfer

woodpecker icon woodpecker

✨✨Woodpecker: Hallucination Correction for MLLMs. The first work to correct hallucination in multimodal large language models.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.