happyxy Goto Github PK
Name: Xiao Yu
Type: User
Name: Xiao Yu
Type: User
[CVPR-2024] One Model For Image/Video/Instractive/Open-Vocabulary Segmentation
[AAAI 2021] (oral) Progressive One-shot Human Parsing, [TPAMI 2023] End-to-end One-shot Human Parsing
Official implementation of OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on
Open Source Computer Vision Library
This is a marker less augmented reality application developed using OpenCV. This code can be used under MIT license. Demo movie: http://youtu.be/KgQguj78qMA
Instant voice cloning by MyShell
OpenVSLAM: A Versatile Visual SLAM Framework
ORB-SLAM-3 (released in 2020), built in Windows 10, Visual Studio 2019. Zip size 319 Mb
ORB_SLAM2 with Map Save/Load Function. Without ROS Environment
👑 Easy-to-use and powerful NLP library with 🤗 Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including 🗂Text Classification, 🔍 Neural Search, ❓ Question Answering, ℹ️ Information Extraction, 📄 Document Intelligence, 💌 Sentiment Analysis and 🖼 Diffusion AICG system etc.
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
Easy-to-use image segmentation library with awesome pre-trained model zoo, supporting wide-range of practical tasks in Semantic Segmentation, Interactive Segmentation, Panoptic Segmentation, Image Matting, 3D Segmentation, etc.
深度学习经典、新论文逐段精读
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
Personalize Segment Anything Model (SAM) with 1 shot in 10 seconds
PhotoMaker
Efficient facial landmark detector
Extented Kalman Filter for 6D pose estimation using gps, imu, magnetometer and sonar sensor.
(ICCV 2019) Uncertainty-aware Face Representation and Recognition
A very long list of English profanity.
Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment
PULSE: Self-Supervised Photo Upsampling via Latent Space Exploration of Generative Models
label-smooth, amsoftmax, partial-fc, focal-loss, triplet-loss, lovasz-softmax. Maybe useful
Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言,并添加配音
Implementation of "Quadratic video interpolation", NeurIPS 2019.
The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.
Evaluation framework for your Retrieval Augmented Generation (RAG) pipelines
The rep for the RankIQA paper in ICCV 2017
(ECCV 2020) RANSAC-Flow: generic two-stage image alignment
PyTorch code for our ECCV 2018 paper "Image Super-Resolution Using Very Deep Residual Channel Attention Networks"
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.