Topic: multi-modality Goto Github
Some thing interesting about multi-modality
Some thing interesting about multi-modality
multi-modality,CrossCLR: Cross-modal Contrastive Learning For Multi-modal Video Representations, ICCV 2021
Organization: amazon-science
multi-modality,:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
User: bradyfu
multi-modality,An official PyTorch implementation of the CRIS paper
User: derrickwang005
multi-modality,Algorithms and Publications on 3D Object Tracking
Organization: dlr-rm
multi-modality,[CVPR 2024] Prompt Highlighter: Interactive Control for Multi-Modal LLMs
Organization: dvlab-research
Home Page: https://julianjuaner.github.io/projects/PromptHighlighter
multi-modality,Unifying Voxel-based Representation with Transformer for 3D Object Detection (NeurIPS 2022)
Organization: dvlab-research
multi-modality,Official code for WACV 2021 paper - Compositional Learning of Image-Text Query for Image Retrieval
User: ecom-research
multi-modality,[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
User: haotian-liu
Home Page: https://llava.hliu.cc
multi-modality,InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output
Organization: internlm
multi-modality,This repo contains the official code of our work SAM-SLR which won the CVPR 2021 Challenge on Large Scale Signer Independent Isolated Sign Language Recognition.
User: jackyjsy
multi-modality,🏄 Scalable embedding, reasoning, ranking for images and sentences with CLIP
Organization: jina-ai
Home Page: https://clip-as-service.jina.ai
multi-modality,An open-source cloud-native of large multi-modal models (LMMs) serving framework.
Organization: jina-ai
multi-modality,An all-new Language Model That Processes Ultra-Long Sequences of 100,000+ Ultra-Fast
User: kyegomez
Home Page: https://discord.gg/qUtxnK2NMf
multi-modality,The World's First AI-Enabled Multi-Modality Native Search Engine
User: kyegomez
Home Page: https://search.apac.ai
multi-modality,A forest of autonomous agents.
User: kyegomez
Home Page: https://discord.gg/GYbXvDGevY
multi-modality,Implementation of Adepts Fuyu all-new Multi-Modality model in pytorch
User: kyegomez
Home Page: https://discord.gg/GYbXvDGevY
multi-modality,The open source implementation of Gemini, the model that will "eclipse ChatGPT" by Google
User: kyegomez
Home Page: https://discord.gg/GYbXvDGevY
multi-modality,My implementation of Kosmos2.5 from the paper: "KOSMOS-2.5: A Multimodal Literate Model"
User: kyegomez
Home Page: https://discord.gg/qUtxnK2NMf
multi-modality,Implementation of MambaByte in "MambaByte: Token-free Selective State Space Model" in Pytorch and Zeta
User: kyegomez
Home Page: https://discord.gg/Czg5rpMZaC
multi-modality,Simple Implementation of a Transformer in the new framework MLX by Apple
User: kyegomez
Home Page: https://discord.gg/GYbXvDGevY
multi-modality,PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"
User: kyegomez
Home Page: https://discord.gg/7VckQVxvKk
multi-modality,Implementation of MoE Mamba from the paper: "MoE-Mamba: Efficient Selective State Space Models with Mixture of Experts" in Pytorch and Zeta
User: kyegomez
Home Page: https://discord.gg/GYbXvDGevY
multi-modality,Implementation of Qformer from BLIP2 in Zeta Lego blocks.
User: kyegomez
Home Page: https://discord.gg/GYbXvDGevY
multi-modality, Effortless plugin and play Optimizer to cut model training costs by 50%. New optimizer that is 2x faster than Adam on LLMs.
User: kyegomez
Home Page: https://discord.gg/qUtxnK2NMf
multi-modality,An all-new OS that orchestrates autonomous agents as workers to execute tasks.
User: kyegomez
Home Page: https://discord.gg/cX5ttFP3eU
multi-modality,The Enterprise-Grade Production-Ready Multi-Agent Orchestration Framework Join our Community: https://discord.com/servers/agora-999382051935506503
User: kyegomez
Home Page: https://docs.swarms.world
multi-modality,Seed, Code, Harvest: Grow Your Own App with Tree of Thoughts!
User: kyegomez
Home Page: https://discord.gg/qUtxnK2NMf
multi-modality,Simple Implementation of TinyGPTV in super simple Zeta lego blocks
User: kyegomez
Home Page: https://discord.gg/GYbXvDGevY
multi-modality,(NeurIPS 2022 CellSeg Challenge - 1st Winner) Open source code for "MEDIAR: Harmony of Data-Centric and Model-Centric for Multi-Modality Microscopy"
User: lee-gihun
multi-modality,Simple command line tool for text to image generation using OpenAI's CLIP and Siren (Implicit neural representation network). Technique was originally created by https://twitter.com/advadnoun
User: lucidrains
multi-modality,🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.
User: luodian
Home Page: https://otter-ntu.github.io/
multi-modality,Cross-Modality Mutual Learning for Smart Contract Vulnerability Detection
User: messi-q
multi-modality,
Organization: mit-acl
multi-modality,Long-Term Rhythmic Video Soundtracker, ICML2023
Organization: opengvlab
Home Page: https://justinyuu.github.io/LORIS/
multi-modality,Chatbot Arena meets multi-modality! Multi-Modality Arena allows you to benchmark vision-language models side-by-side while providing images as inputs. Supports MiniGPT-4, LLaMA-Adapter V2, LLaVA, BLIP-2, and many more!
Organization: opengvlab
multi-modality,This is the official repository for Vista dataset - A Vietnamese multimodal dataset contains more than 700,000 samples of conversations and images
User: oztobuzz
Home Page: https://huggingface.co/datasets/Vi-VLM/Vista
multi-modality,[ICCV 2021] Official implementation of the paper "TRAR: Routing the Attention Spans in Transformers for Visual Question Answering"
User: rentainhe
multi-modality,[CVPR'23] MM-Diffusion: Learning Multi-Modal Diffusion Models for Joint Audio and Video Generation
Organization: researchmm
multi-modality,[CVPR'24] RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-grained Correctional Human Feedback
User: rlhf-v
Home Page: https://rlhf-v.github.io
multi-modality,[TCSVT] CorrI2P: Deep Image-to-Point Cloud Registration via Dense CorrespondenceThe code of CorrI2P
User: rsy6318
Home Page: https://rsy6318.github.io/CorrI2P/
multi-modality,This repository contains the training, inference, evaluation code for SpeechLLM models and details about the model releases on huggingface.
Organization: skit-ai
multi-modality,Multi-modal Graph learning for Disease Prediction (IEEE Trans. on Medical imaging, TMI2022)
User: ssgood
multi-modality,Embed arbitrary modalities (images, audio, documents, etc) into large language models.
User: sshh12
multi-modality,Fusion ICA Toolbox (MATLAB)
Organization: trendscenter
multi-modality,An open-source implementation for training LLaVA-NeXT.
User: xiaoachen98
multi-modality,Official code for NeurIPS2023 paper: CoDA: Collaborative Novel Box Discovery and Cross-modal Alignment for Open-vocabulary 3D Object Detection
User: yangcaoai
Home Page: https://yangcaoai.github.io/publications/CoDA.html
multi-modality,[CVPR 2023] Collaborative Diffusion
User: ziqihuangg
Home Page: https://ziqihuangg.github.io/projects/collaborative-diffusion.html
multi-modality,[Paper][LREC-COLING 2024] Unleashing the Power of Imbalanced Modality Information for Multi-modal Knowledge Graph Completion
Organization: zjukg
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.