happyxy Goto Github PK

followers: 1.0 following: 3.0 repos: 254.0 gists: 0.0

Name: Xiao Yu

Type: User

Xiao Yu's Projects

omg-seg

[CVPR-2024] One Model For Image/Video/Instractive/Open-Vocabulary Segmentation

one-shot-human-parsing

[AAAI 2021] (oral) Progressive One-shot Human Parsing, [TPAMI 2023] End-to-end One-shot Human Parsing

ootdiffusion

Official implementation of OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on

opencv-marker-less-ar

This is a marker less augmented reality application developed using OpenCV. This code can be used under MIT license. Demo movie: http://youtu.be/KgQguj78qMA

openvslam

OpenVSLAM: A Versatile Visual SLAM Framework

orbslam3-windows

ORB-SLAM-3 (released in 2020), built in Windows 10, Visual Studio 2019. Zip size 319 Mb

orbslam_mapsave

ORB_SLAM2 with Map Save/Load Function. Without ROS Environment

👑 Easy-to-use and powerful NLP library with 🤗 Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including 🗂Text Classification, 🔍 Neural Search, ❓ Question Answering, ℹ️ Information Extraction, 📄 Document Intelligence, 💌 Sentiment Analysis and 🖼 Diffusion AICG system etc.

paddleocr

Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)

paddleseg

Easy-to-use image segmentation library with awesome pre-trained model zoo, supporting wide-range of practical tasks in Semantic Segmentation, Interactive Segmentation, Panoptic Segmentation, Image Matting, 3D Segmentation, etc.

paper-reading

深度学习经典、新论文逐段精读

peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

personalize-sam

Personalize Segment Anything Model (SAM) with 1 shot in 10 seconds

photomaker

PhotoMaker

pipnet

Efficient facial landmark detector

pose_ekf

Extented Kalman Filter for 6D pose estimation using gps, imu, magnetometer and sonar sensor.

probabilistic-face-embeddings

(ICCV 2019) Uncertainty-aware Face Representation and Recognition

profane-words

A very long list of English profanity.

pulid

Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment

pulse

PULSE: Self-Supervised Photo Upsampling via Latent Space Exploration of Generative Models

pytorch-loss

label-smooth, amsoftmax, partial-fc, focal-loss, triplet-loss, lovasz-softmax. Maybe useful

pyvideotrans

Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言，并添加配音

qvi

Implementation of "Quadratic video interpolation", NeurIPS 2019.

qwen-audio

The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.

ragas

Evaluation framework for your Retrieval Augmented Generation (RAG) pipelines

rankiqa

The rep for the RankIQA paper in ICCV 2017

ransac-flow

(ECCV 2020) RANSAC-Flow: generic two-stage image alignment

rcan

PyTorch code for our ECCV 2018 paper "Image Super-Resolution Using Very Deep Residual Channel Attention Networks"

happyxy Goto Github PK

Xiao Yu's Projects

Recommend Projects

Recommend Topics

Recommend Org

Jobs