GithubHelp home page GithubHelp logo

Hi there šŸ‘‹

  • šŸ”­ Iā€™m currently working on multi-modal transformers and multi-task learning
  • šŸŒ± Iā€™m currently learning to play Table Tennis šŸ“
  • šŸ“« How to reach me: [email protected]

Muhammad Maaz's Projects

autogpt icon autogpt

An experimental open-source attempt to make GPT-4 fully autonomous.

cvat_id_switch_solution icon cvat_id_switch_solution

The repository contains the code to solve the id switches of tracks labelled using Intel's CVAT tool.

darknet icon darknet

Windows and Linux version of Darknet Yolo v3 & v2 Neural Networks for object detection (Tensor Cores are used)

dcl icon dcl

Destruction and Construction Learning for Fine-grained Image Recognition

deformable-detr icon deformable-detr

Deformable DETR: Deformable Transformers for End-to-End Object Detection.

detectron2 icon detectron2

Detectron2 is FAIR's next-generation platform for object detection, segmentation and other visual recognition tasks.

detic icon detic

Code release for "Detecting Twenty-thousand Classes using Image-level Supervision".

detreg icon detreg

Official implementation of the paper "DETReg: Unsupervised Pretraining with Region Priors for Object Detection".

dino icon dino

PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO

edgenext icon edgenext

[CADL'22, ECCVW] Official repository of paper titled "EdgeNeXt: Efficiently Amalgamated CNN-Transformer Architecture for Mobile Vision Applications".

edges icon edges

Structured Edge Detection Toolbox

groundinglmm icon groundinglmm

Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks [CVPR 2024].

llava-pp icon llava-pp

šŸ”„šŸ”„ LLaVA++: Extending LLaVA with Phi-3 and LLaMA-3 (LLaVA LLaMA-3, LLaVA Phi-3)

maxvit icon maxvit

PyTorch reimplementation of the paper "MaxViT: Multi-Axis Vision Transformer" [arXiv 2022].

mgan icon mgan

Mask-Guided Attention Network for Occluded Pedestrian Detection. (ICCV'19)

mmaaz60.github.io icon mmaaz60.github.io

Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    šŸ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. šŸ“ŠšŸ“ˆšŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ā¤ļø Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.