Hi! This is Xu CAO.
- Recently, I’m working on image generation and multi-modal tasks.
- 📫
xucaotju[AT]gmail.com
Name: Xu CAO
Type: User
Company: Tongji Univ
Bio: Computer Vision, Autonomous Driving
Location: Shanghai, China
Blog: [email protected]
Hi! This is Xu CAO.
xucaotju[AT]gmail.com
The official repository for BEVerse
[ECCV 2022] This is the official implementation of BEVFormer, a camera-only framework for autonomous driving perception, e.g., 3D object detection and semantic map segmentation.
Official Implementation of "CAT-Seg🐱: Cost Aggregation for Open-Vocabulary Semantic Segmentation"
Image to prompt with BLIP and CLIP
[ICCV 2023] DETRs with Collaborative Hybrid Assignments Training
This project is the official implementation of 'Diffir: Efficient diffusion model for image restoration', ICCV2023
[AAAI2024] Far3D: Expanding the Horizon for Surround-view 3D Object Detection
🧀 Code and models for the ICML 2023 paper "Grounding Language Models to Images for Multimodal Inputs and Outputs".
Tutorial on Laplacian Image Blending
Lite-Mono: A Lightweight CNN and Transformer Architecture for Self-Supervised Monocular Depth Estimation
Minimal is a Jekyll theme for GitHub Pages
OpenMMLab Image and Video Restoration, Editing and Generation Toolbox
OpenMMLab optical flow toolbox and benchmark
OpenMMLab Pre-training Toolbox and Benchmark
OpenMMLab Rotated Object Detection Toolbox and Benchmark
OpenMMLab Semantic Segmentation Toolbox and Benchmark.
Model Predictive Contouring Controller (MPCC) for Autonomous Racing
Multimodal-GPT
CVPR2023-Occupancy-Prediction-Challenge
Official PyTorch implementation of ODISE: Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models [CVPR 2023 Highlight]
This is the official PyTorch implementation of the paper Open-Vocabulary Semantic Segmentation with Mask-adapted CLIP.
This is Pytorch re-implementation of our CVPR 2020 paper "Panoptic-DeepLab: A Simple, Strong, and Fast Baseline for Bottom-Up Panoptic Segmentation" (https://arxiv.org/abs/1911.10194)
[AAAI 2023] PolarFormer: Multi-camera 3D Object Detection with Polar Transformers
Open-vocabulary Semantic Segmentation
Automated dense category annotation engine that serves as the initial semantic labeling for the Segment Anything dataset (SA-1B).
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.