An Minh Hùng's Projects
Stream video on a local network. Uses Flask and Opencv
Counting cars on highway from footage using MobileNetSSD, Dlib and OpenCV
This repository contains the code and implementation details of the CascadeTabNet paper "CascadeTabNet: An approach for end to end table detection and structure recognition from image-based documents"
A PyTorch implementation of CGD based on the paper "Combination of Multiple Global Descriptors for Image Retrieval"
Synthetic Dataset used in the ICDAR2019 Competition on HArvesting Raw Tables from Infographics (CHART-Infographics)
Fully automated end-to-end framework to extract data from bar plots and other figures in scientific research papers using modules such as OpenCV, AWS-Rekognition.
Lightweight package for interacting with ChatGPT's API by OpenAI. Uses reverse engineered official API.
A tool for generating Chinese license plate dataset for plate detecting
Contrastive Language-Audio Pretraining
Automatically find issues in image datasets and practice data-centric computer vision.
ClearML - Auto-Magical CI/CD to streamline your ML workflow. Experiment Manager, MLOps and Data-Management
Contrastive Language-Image Pretraining
Easily compute clip embeddings and build a clip retrieval system with them
[CVPR 2022 - Demo Track] - Effective conditioned and composed image retrieval combining CLIP-based features
CLIP-like model evaluation
Pytorch implementation of "Compact Global Descriptor for Neural Networks" (CGD).
Train neural networks up to 7x faster
Leading free and open-source face recognition system
Conditional implementation for NVIDIA's StyleGAN architecture
Convert Yolo-v4/v3/v2 .weights model to other models.
This method uses Segment Anything and CLIP to ground and count any object that matches a custom text prompt, without requiring any point or box annotation.