shenggan's Projects
Adaptive Tensor Parallelism for Foundation Models
A curated list of awesome projects and papers for distributed training or inference
BCCD (Blood Cell Count and Detection) Dataset is a small-scale dataset for blood cells detection.
A Template Repository for LaTex Beamer Presentation
手持端app,主要负责视频显示和蓝牙控制
小车端app,主要负责视频录制和传输。
Making large AI models cheaper, faster and more accessible
Performance benchmarking with ColossalAI
Examples of training models with hybrid parallelism using ColossalAI
Reimplement Deep Cell with Keras and Horovod.
Automated Parallelization System and Infrastructure for Multiple Ecosystems
CS 6208 Group Project
Endue the car with Intelligent
Locality Sensitive Hashing using MinHash in Python/Cython to detect near duplicate text documents
CS420 Project: MNIST Handwritten-Digits Recognition
OpenDiT: An Easy, Fast and Memory-Efficient System for DiT Training and Inference
Trainable, memory-efficient, and GPU-friendly PyTorch reproduction of AlphaFold 2
An open clone of the GPT-2 WebText dataset by OpenAI. Still WIP.
A Survey and Benchmark of QUIC
A CPP implement of Twitter SnowFlake Algorithm.
Pipeline Parallelism for PyTorch
Development repository for the Triton language and compiler