mmaaz60 Goto Github PK

followers: 111.0 following: 4.0 repos: 60.0 gists: 0.0

Name: Muhammad Maaz

Type: User

Company: @mbzuai

Bio: An Electrical Engineer with experience in Computer Vision software development. Skilled in Machine Learning, Deep Learning and Computer Vision.

Location: Abu Dhabi, UAE

Blog: https://www.muhammadmaaz.com

Hi there 👋

🔭 I’m currently working on multi-modal transformers and multi-task learning
🌱 I’m currently learning to play Table Tennis 🏓
📫 How to reach me: [email protected]

Muhammad Maaz's Projects

autogpt

An experimental open-source attempt to make GPT-4 fully autonomous.

bash-handbook

:book: For those who wanna learn Bash

contrastive_self-supervised_learning_for_visual_recognition_a-survey

Contrastive Self-Supervised Learning for Visual Recognition: A Survey

cvat_id_switch_solution

The repository contains the code to solve the id switches of tracks labelled using Intel's CVAT tool.

darknet

Windows and Linux version of Darknet Yolo v3 & v2 Neural Networks for object detection (Tensor Cores are used)

dcl

Destruction and Construction Learning for Fine-grained Image Recognition

deformable-detr

Deformable DETR: Deformable Transformers for End-to-End Object Detection.

detectron2

Detectron2 is FAIR's next-generation platform for object detection, segmentation and other visual recognition tasks.

detic

Code release for "Detecting Twenty-thousand Classes using Image-level Supervision".

detreg

Official implementation of the paper "DETReg: Unsupervised Pretraining with Region Priors for Object Detection".

dino

PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO

disciplineviolationdetection

Discipline Anomaly Detection using Audio and Video Processing

easy-faster-rcnn.pytorch

An easy implementation of Faster R-CNN (https://arxiv.org/pdf/1506.01497.pdf) in PyTorch.

edgenext

[CADL'22, ECCVW] Official repository of paper titled "EdgeNeXt: Efficiently Amalgamated CNN-Transformer Architecture for Mobile Vision Applications".

edges

Structured Edge Detection Toolbox

facial_keypoints_detection

Facial Keypoints detection using CNN modified from https://github.com/udacity/P1_Facial_Keypoints

groundinglmm

Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks [CVPR 2024].

image_captioning_using_encoder-decoder_architecture

Image Captioning Using CNN-RNN Architecture modified from https://github.com/udacity/CVND---Image-Captioning-Project

intriguing-properties-of-vision-transformers

Official repository for "Intriguing Properties of Vision Transformers" (NeurIPS 2021--Spotlight)

lighthead-rcnn-in-pytorch0.4.1

Pytorch0.4.1 codes for Lighthead-RCNN

llava-pp

🔥🔥 LLaVA++: Extending LLaVA with Phi-3 and LLaMA-3 (LLaVA LLaMA-3, LLaVA Phi-3)

llava-pp-hf-demo

maxvit

PyTorch reimplementation of the paper "MaxViT: Multi-Axis Vision Transformer" [arXiv 2022].

mdef_detr

mdetr

mgan

Mask-Guided Attention Network for Occluded Pedestrian Detection. (ICCV'19)

mmaaz60

mmaaz60.github.io

Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes

mobilevit

multimodal-prompt-learning

Official repository of paper titled "MaPLe: Multi-modal Prompt Learning".

mmaaz60 Goto Github PK

Hi there 👋

Muhammad Maaz's Projects

Recommend Projects

Recommend Topics

Recommend Org

Jobs