Topic: multimodality Goto Github
Some thing interesting about multimodality
Some thing interesting about multimodality
multimodality,A CLI tool/python module for generating images from text using guided diffusion and CLIP from OpenAI.
User: afiaka87
multimodality,Automated modeling and machine learning framework FEDOT
Organization: aimclub
Home Page: https://fedot.readthedocs.io
multimodality,Code on selecting an action based on multimodal inputs. Here in this case inputs are voice and text.
User: akashe
multimodality,A library of transformer models for computer vision and multi-modality research
Organization: amazon-science
Home Page: https://github.com/amazon-research/gluonmm
multimodality,A Multi-modal Framework for Sentimental Analysis of Meme
User: ambityga
Home Page: https://openreview.net/forum?id=Okmqu6xqXK¬eId=HT6W-zLdjTD
multimodality,An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"
User: arrowluo
Home Page: https://arxiv.org/abs/2104.08860
multimodality,Deploy Stable Diffusion Model on Amazon SageMaker Endpont
Organization: aws-samples
multimodality,This guidance creates a scalable environment in AWS to prepare genomic, clinical, mutation, expression and imaging data for large-scale analysis and perform interactive queries against a data lake. The solution also demonstrates the use of Amazon Omics for multi-modal analysis.
Organization: awslabs
Home Page: https://aws.amazon.com/solutions/guidance/multi-omics-and-multi-modal-data-integration-and-analysis/
multimodality,The Cradle framework is a first attempt at General Computer Control (GCC). Cradle supports agents to ace any computer task by enabling strong reasoning abilities, self-improvment, and skill curation, in a standardized general environment with minimal requirements.
Organization: baai-agents
Home Page: https://baai-agents.github.io/Cradle/
multimodality,A python framework accelerating ML based discovery in the medical field by encouraging code reuse. Batteries included :)
Organization: biomedsciai
multimodality,✨✨Woodpecker: Hallucination Correction for Multimodal Large Language Models. The first work to correct hallucinations in MLLMs.
User: bradyfu
multimodality,Resources (conference/journal publications, references to dataset) for harmful memes detection.
User: firojalam
multimodality,multimodal social media content (text, image) classification
User: firojalam
Home Page: https://crisisnlp.qcri.org/crisismmd.html
multimodality,A Python package housing a collection of deep-learning multi-modal data fusion method pipelines! From data loading, to training, to evaluation - fusilli's got you covered 🌸
User: florencejt
Home Page: https://fusilli.readthedocs.io/en/latest/
multimodality,[TPAMI 2023] Multimodal Image Synthesis and Editing: The Generative AI Era
User: fnzhan
multimodality,[CVPR2024] Generative Region-Language Pretraining for Open-Ended Object Detection
Organization: foundationvision
multimodality,A knowledge base construction engine for richly formatted data
Organization: hazyresearch
Home Page: https://fonduer.readthedocs.io/
multimodality,Collecting awesome papers of RAG for AIGC. We propose a taxonomy of RAG foundations, enhancements, and applications in paper "Retrieval-Augmented Generation for AI-Generated Content: A Survey".
User: hymie122
multimodality,GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest
User: jshilong
multimodality,TCyb 2018: Graph learning for multiview clustering
User: kunzhan
Home Page: https://doi.org/10.1109/TCYB.2017.2751646
multimodality,An open source implementation of "Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning", an all-new multi modal AI that uses just a decoder to generate both text and images
User: kyegomez
Home Page: https://discord.gg/qUtxnK2NMf
multimodality,Towards Generalist Biomedical AI
User: kyegomez
Home Page: https://discord.gg/qUtxnK2NMf
multimodality,My implementation of "Patch n’ Pack: NaViT, a Vision Transformer for any Aspect Ratio and Resolution"
User: kyegomez
Home Page: https://discord.gg/qUtxnK2NMf
multimodality,Democratization of "PaLI: A Jointly-Scaled Multilingual Language-Image Model"
User: kyegomez
Home Page: https://discord.gg/GYbXvDGevY
multimodality,Implementation of PALI3 from the paper PALI-3 VISION LANGUAGE MODELS: SMALLER, FASTER, STRONGER"
User: kyegomez
Home Page: https://discord.gg/qUtxnK2NMf
multimodality,Swarming algorithms like PSO, Ant Colony, Sakana, and more in PyTorch 😊
User: kyegomez
Home Page: https://swarmstorch.readthedocs.io/en/latest/
multimodality,Mode normalization (ICLR 2019).
User: ldeecke
multimodality,Pytorch implementation of CVPR2020 paper “VectorNet: Encoding HD Maps and Agent Dynamics from Vectorized Representation”
User: liang-zx
multimodality,Sequence-to-Sequence Framework in PyTorch
Organization: lium-lst
multimodality,A simple command line tool for text to image generation, using OpenAI's CLIP and a BigGAN. Technique was originally created by https://twitter.com/advadnoun
User: lucidrains
multimodality,Implementation of 🌻 Mirasol, SOTA Multimodal Autoregressive model out of Google Deepmind, in Pytorch
User: lucidrains
multimodality,中文领域的多模态Bert
User: luka0612
multimodality,An official implementation for " UniVL: A Unified Video and Language Pre-Training Model for Multimodal Understanding and Generation"
Organization: microsoft
Home Page: https://arxiv.org/abs/2002.06353
multimodality,This repo contains evaluation code for the paper "MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI"
Organization: mmmu-benchmark
Home Page: https://mmmu-benchmark.github.io/
multimodality,This repo contains evaluation code for the paper "Are We on the Right Way for Evaluating Large Vision-Language Models"
Organization: mmstar-benchmark
Home Page: https://mmstar-benchmark.github.io
multimodality,DANCE: a deep learning library and benchmark platform for single-cell analysis
Organization: omicsml
Home Page: https://pydance.readthedocs.io
multimodality,A Comparative Framework for Multimodal Recommender Systems
Organization: preferredai
Home Page: https://cornac.preferred.ai
multimodality,Prognostically Relevant Subtypes and Survival Prediction for Breast Cancer Based on Multimodal Genomics Data
User: rezacsedu
multimodality,Effective prompting for Large Multimodal Models like GPT-4 Vision, LLaVA or CogVLM. 🔥
Organization: roboflow
Home Page: https://maestro.roboflow.com
multimodality,This repository contains the source code for the paper "Improving the performance of unimodal dynamic hand gesture recognition with multimodal training"
User: sbhonde1
Home Page: https://arxiv.org/abs/1812.06145
multimodality,A deep learning framework for building multimodal multi-task learning systems.
User: senwu
Home Page: https://emmental.readthedocs.io
multimodality,Attention-based multimodal fusion for sentiment analysis
User: soujanyaporia
multimodality,This repository contains code and metadata of How2 dataset
Organization: srvk
Home Page: https://srvk.github.io/how2-dataset/
multimodality,Behavioral data analysis and plotting in Python.
User: thechymera
multimodality,Fast regression and mediation analysis of vertex or voxel MRI data with TFCE
User: trislett
multimodality,Implementations of parallel tempering algorithms to augment samplers with tempering capabilities
Organization: turinglang
Home Page: https://turinglang.org/MCMCTempering.jl/
multimodality,Official implementation of Matcha-agent, https://arxiv.org/abs/2303.08268
User: xf-zhao
Home Page: https://matcha-agent.github.io/
multimodality,PyTorch implementation of LIMoE
User: yeonwoosung
multimodality,A Survey on multimodal learning research.
User: yutong-zhou-cv
multimodality,X-VLM: Multi-Grained Vision Language Pre-Training (ICML 2022)
User: zengyan-97
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.