ruoyxue Goto Github PK

followers: 0.0 following: 5.0 repos: 34.0 gists: 0.0

Name: ruoyxue

Type: User

ruoyxue's Projects

advanced-css-project

Advanced CSS project for practice, which includes Natours, Trillo and Nexter.

auto_avsr

Auto-AVSR: Lip-Reading Sentences Project

av-relscore

Audio-Visual Corruption Modeling of our paper "Watch or Listen: Robust Audio-Visual Speech Recognition with Visual Corruption Modeling and Reliability Scoring" in CVPR23

avec

[WACV 2023] Audio-Visual Efficient Conformer (AVEC) for Robust Speech Recognition

cnvsrc2023baseline

Baseline system for CNVSRC2023 (Chinese Continuous Visual Speech Recognition Challenge 2023)

conformer

Implementation of the convolutional module from the Conformer paper, for use in Transformers

cpp-machine-learning

Linear Regression, Perceptron using C++. Repository for machine learning.

csapp-labs

cmu csapp labs answer

deep_avsr

A PyTorch implementation of the Deep Audio-Visual Speech Recognition paper.

digital-image-processing-lecture-notes

DIP lecture notess

e-business-react

An electronic business website using react

efficient-pytorch

My best practice of training large dataset using PyTorch.

face-occlusion-generation

[CVPRW 2022] Delving into High-Quality Synthetic Face Occlusion Segmentation Datasets

flash-linear-attention

Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton

hippogriff

Griffin MQA + Hawk Linear RNN Hybrid

ila

kdgraph

official code for KDGraph

learn-an-effective-lip-reading-model-without-pains

The PyTorch Code and Model In "Learn an Effective Lip Reading Model without Pains", (https://arxiv.org/abs/2011.07557), which reaches the state-of-art performance in LRW-1000 dataset.

lip-reading-deeplearning

:unlock: Lip Reading - Cross Audio-Visual Recognition using 3D Architectures

lipreading_using_temporal_convolutional_networks

ICASSP'22 Training Strategies for Improved Lip-Reading; ICASSP'21 Towards Practical Lipreading with Distilled and Efficient Models; ICASSP'20 Lipreading using Temporal Convolutional Networks

mamba-minimal

Simple, minimal implementation of the Mamba SSM in one file of PyTorch.

mssdmpa-net

muavic

MuAViC: A Multilingual Audio-Visual Corpus for Robust Speech Recognition and Robust Speech-to-Text Translation

rewrite-the-stars

[CVPR 2024] Rewrite the Stars

rnn-transducer

A Pytorch Implementation of Transducer Model for End-to-End Speech Recognition

semantic-segmentation-pytorch

semantic segmentation framework, pytorch FCN, unet, with self-constructed data iterator which is a light-weight substitute of dataloader+dataset

sub-word-level-lip-reading-with-visual-attention

Official Implementation of Visual Transformer Pooling for Lip reading

topo-boundary

A public available dataset for road boundary detection in aerial images

transformer_based_geo-localization

This is the repository for ECCV2022 paper titled: "Where in the World is this Image? Transformer-based Geo-localization in the Wild".

vim

Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

ruoyxue Goto Github PK

ruoyxue's Projects

Recommend Projects

Recommend Topics

Recommend Org

Jobs