yhzhai Goto Github PK
Name: Yuanhao Zhai
Type: User
Company: State University of New York at Buffalo
Bio: PhD student in computer vision
Location: New York
Blog: yhzhai.com
Name: Yuanhao Zhai
Type: User
Company: State University of New York at Buffalo
Bio: PhD student in computer vision
Location: New York
Blog: yhzhai.com
[ACM MM 2023] Official implementation of paper "Language-guided Human Motion Synthesis with Atomic Actions".
Pytorch Implementation of 'Background Suppression Network for Weakly-supervised Temporal Action Localization' (AAAI-20)
A pytorch-version implementation codes of paper: "BMN: Boundary-Matching Network for Temporal Action Proposal Generation", which is accepted in ICCV 2019.
Codes of our paper: "BSN: Boundary Sensitive Network for Temporal Action Proposal Generation"
Code for CVPR2020 paper: Conditional Gaussian Distribution Learning for Open Set Recognition
Temporal Segment Networks (TSN) in PyTorch
Demo code for paper "Learning optical flow from still images", CVPR 2021.
This repository is intended to host the diagnosis tool for analyzing temporal action localization algorithms. This tool is first presented as part of our DETAD paper.
End-to-End Object Detection with Transformers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
Simple script for watching GPU usage on both system-wide and per-user basis.
HACS: Human Action Clips and Segments Dataset
[ECCV 2024] IDOL: Unified Dual-Modal Latent Diffusion for Human-Centric Joint Video-Depth Generation
MATLAB implementations of popular Image Forensic algorithms
Motion Consistency Model: Accelerating Video Diffusion with Disentangled Motion-Appearance Distillation
OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
An open source implementation of CLIP.
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
The project is an official implementation of our paper "PoseFormerV2: Exploring Frequency Domain for Efficient and Robust 3D Human Pose Estimation".
[Doc] Productive Deep Learner
[CVPR 2022 Oral] Detecting Deepfakes with Self-Blended Images https://arxiv.org/abs/2204.08376
Official code for Self-supervised Learning of Adversarial Example: Towards Good Generalizations for Deepfake Detection (CVPR 2022 oral)
LaTeX sleek beamer template
[ICCV 2023] Official implementation of paper "SOAR: Scene-debiasing Open-set Action Recognition".
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.