xuanlinli17 Goto Github PK
Name: Xuanlin (Simon) Li
Type: User
Bio: Researcher in computer vision, robotics, NLP | PhD student at UCSD CSE @haosulab | Alumni of Berkeley AI
Twitter: XuanlinLi2
Location: San Diego, CA
Name: Xuanlin (Simon) Li
Type: User
Bio: Researcher in computer vision, robotics, NLP | PhD student at UCSD CSE @haosulab | Alumni of Berkeley AI
Twitter: XuanlinLi2
Location: San Diego, CA
Code for "Discovering Non-monotonic Autoregressive Orderings with Variational Inference" (paper and code updated from ICLR 2021)
[CoRL22] Frame Mining - a Free Lunch for Learning Robotic Manipulation from 3D Point Clouds
My solutions to UC Berkeley CS285 (originally CS294-112, deeprlcourse) Fall 2019 assignments
EfficientViT is a new family of vision models for efficient high-resolution vision.
Toolbox for our GraspNet-1Billion dataset.
Regularization Matters in Policy Optimization
Neural Surface reconstruction based on Instant-NGP. Efficient and customizable boilerplate for your research projects. Train NeuS in 10min!
Distilling Large Vision-Language Model with Out-of-Distribution Generalizability (ICCV 2023)
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Minkowski Engine is an auto-diff neural network library for high-dimensional sparse tensors
The official repo for the paper "VeCLIP: Improving CLIP Training via Visual-enriched Captions"
This is the official code for MobileSAM project that makes SAM lightweight for mobile applications and beyond!
Strong and Open Vision Language Assistant for Mobile Devices
A project that optimizes OWL-ViT for real-time inference with NVIDIA TensorRT.
Python bindings to the pointcloud library (pcl)
ManiSkill2 RLDS dataset builder for X-embodiment dataset conversion.
Tracking Any Point (TAP)
Distributed machine learning infrastructure for large-scale robotics research
[ECCV 2022] Tensorial Radiance Fields, a novel approach to model and reconstruct radiance fields
VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)
[ECCV 2022] XMem: Long-Term Video Object Segmentation with an Atkinson-Shiffrin Memory Model
[CVPR 2024] Real-Time Open-Vocabulary Object Detection
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.