Nam D. Tran's Projects
All papers, notes and things for Active Learning.
AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference.
ByteTrack: Multi-Object Tracking by Associating Every Detection Box
Common util for many type of databases.
A Python wrapper for gemma.cpp
Predict Churn
Port of Facebook's LLaMA model in C/C++
LLM training in simple, raw C/CUDA
A framework for few-shot evaluation of language models.
OpenMMLab Detection Toolbox and Benchmark
Notes and Notebooks for the Bishop's book: Pattern Recognition And Machine Learning.
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
Friendly Terminal Assistant for Developers
Train transformer language models with reinforcement learning.
5X faster 60% less memory QLoRA finetuning
A Simple extension for Website's Assistant.
YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with MegEngine, ONNX, TensorRT, ncnn, and OpenVINO supported. Documentation: https://yolox.readthedocs.io/