Topic: cross-modal Goto Github
Some thing interesting about cross-modal
Some thing interesting about cross-modal
cross-modal,DSCNet Visible-Infrared Person ReID (TIFS 2022)
User: bitreidgroup
Home Page: https://ieeexplore.ieee.org/document/9963944
cross-modal,The implementation of AAAI-17 paper "Collective Deep Quantization of Efficient Cross-modal Retrieval"
User: caoyue10
cross-modal,Generalized cross-modal NNs; new audiovisual benchmark (IEEE TNNLS 2019)
User: catalina17
cross-modal,Preserving Semantic Neighborhoods for Robust Cross-modal Retrieval [ECCV 2020]
User: clt29
Home Page: http://www.cs.pitt.edu/~chris/semantic_neighborhoods
cross-modal,Website for Cross Modal Learning and Application workshop - ACM ICMR 2019
User: crossmodallearning
cross-modal,Represent, send, store and search multimodal data
Organization: docarray
Home Page: https://docs.docarray.org/
cross-modal,[NAACL 2022]Mobile Text-to-Image search powered by multimodal semantic representation models(e.g., OpenAI's CLIP)
User: drsy
cross-modal,Unleash the Potential of Image Branch for Cross-modal 3D Object Detection [NeurIPS2023]
User: eaphan
cross-modal,Implementation of "VSE++: Improving Visual-Semantic Embeddings with Hard Negatives" in Tensorflow.
User: gorjanradevski
cross-modal,Official PyTorch implementation of our CVPR 2022 paper: Beyond a Pre-Trained Object Detector: Cross-Modal Textual and Visual Context for Image Captioning
Organization: gt-ripl
Home Page: https://sites.google.com/view/xmodal-context
cross-modal,The official implementation of Achieving Cross Modal Generalization with Multimodal Unified Representation (NeurIPS '23)
User: haihuangcode
cross-modal,🪩 Create Disco Diffusion artworks in one line
Organization: jina-ai
cross-modal,[CVPR 2023] Referring Image Matting
User: jizhizili
cross-modal,A curated list of different papers and datasets in various areas of audio-visual processing
User: krantiparida
cross-modal,PyTorch source code for "Stacked Cross Attention for Image-Text Matching" (ECCV 2018)
User: kuanghuei
cross-modal,Code for journal paper "Learning Dual Semantic Relations with Graph Attention for Image-Text Matching", TCSVT, 2020.
User: kywen1119
cross-modal,Code, dataset and models for our CVPR 2022 publication "Text2Pos"
User: mako443
cross-modal,This repository provides a comprehensive collection of research papers focused on multimodal representation learning, all of which have been cited and discussed in the survey just accepted https://dl.acm.org/doi/abs/10.1145/3617833 .
User: marslanm
cross-modal,Official implementation of the paper "ALADIN: Distilling Fine-grained Alignment Scores for Efficient Image-Text Matching and Retrieval"
User: mesnico
cross-modal,MMAct: A Large-Scale Dataset for Cross Modal Learning on Human Action Understanding
User: mmact19
Home Page: https://mmact19.github.io/2019/
cross-modal,cDCGAN model for audio-to-image generation: a cross-modal analysis using deep-learning techniques
User: nataliakoliou
cross-modal,A hub hosting essential remote sensing datasets.
Organization: neuronelab
cross-modal,Code for COBRA: Contrastive Bi-Modal Representation Algorithm (https://arxiv.org/abs/2005.03687)
User: ovshake
cross-modal,[CVPR2024] The code of "UniPT: Universal Parallel Tuning for Transfer Learning with Efficient Parameter and Memory"
User: paranioar
cross-modal,Cross-modal convolutional neural networks
User: petarv-
cross-modal,An intentionally simple Image to Food cross-modal search. Created by Prithiviraj Damodaran.
User: prithivirajdamodaran
cross-modal,DistillBEV: Boosting Multi-Camera 3D Object Detection with Cross-Modal Knowledge Distillation (ICCV 2023)
User: qcraftai
cross-modal,BioT5: Enriching Cross-modal Integration in Biology with Chemical Knowledge and Natural Language Associations (EMNLP 2023)
User: qizhipei
Home Page: https://arxiv.org/abs/2310.07276
cross-modal,Effective prompting for Large Multimodal Models like GPT-4 Vision, LLaVA or CogVLM. 🔥
Organization: roboflow
Home Page: https://maestro.roboflow.com
cross-modal,Unofficial Implementation of Google Deepmind's paper `Objects that Sound`
User: rohitrango
cross-modal,Implementation of `Objects that Sound` and `Look, Listen, and Learn` papers by Relja Arandjelovi´c and Andrew Zisserman
User: rsmbyk
Home Page: https://deepmind.com/blog/objects-that-sound/
cross-modal,A collection of research on knowledge graphs
User: shaoxiongji
Home Page: https://shaoxiongji.github.io/knowledge-graphs/
cross-modal,Code for paper "direct speech-to-image translation"
User: smallflyingpig
Home Page: https://smallflyingpig.github.io/speech-to-image/main
cross-modal,Create Disco Diffusion artworks in one line
User: superdev0909
cross-modal,Analyze the unstructured data with Towhee, such as reverse image search, reverse video search, audio classification, question and answer systems, molecular search, etc.
Organization: towhee-io
cross-modal,Implementation of Fast ml-CCA from the ICCV-2015 work "Multi-Label Cross-Modal Retrieval"
User: viresh-r
cross-modal,Improving Visual Grounding with Visual-Linguistic Verification and Iterative Reasoning, CVPR 2022
User: yangli18
cross-modal,[IEEE T-IP 2020] Deep Image-to-Video Adaptation and Fusion Networks for Action Recognition
User: yangliu9208
Home Page: https://yangliu9208.github.io/DIVAFN/
cross-modal,[IEEE T-IP 2021] Semantics-aware Adaptive Knowledge Distillation for Cross-modal Action Recognition
User: yangliu9208
cross-modal,Remote Sensing Sar-Optical Land-use Classfication Pytorch Pytorch高分辨率遥感语义分割/地物分割/地物分类
User: yisun98
cross-modal,Python implementation of cross-modal hashing algorithms
User: yolo2233
cross-modal,Weakly Supervised 3D Object Detection from Point Clouds (VS3D), ACM MM 2020
User: zengyi-qin
cross-modal,[ECCV2022] Contrastive Vision-Language Pre-training with Limited Resources
User: zerovl
cross-modal,Search targeted pedestrians with the text.
Organization: zilliz-bootcamp
cross-modal,[Paper][AAAI 2023] DUET: Cross-modal Semantic Grounding for Contrastive Zero-shot Learning
Organization: zjukg
Home Page: https://arxiv.org/abs/2207.01328
cross-modal,[Paper][ICANN 2023] Target-oriented Sentiment Classification with Sequential Cross-modal Semantic Graph
Organization: zjukg
Home Page: https://arxiv.org/abs/2208.09417
cross-modal,[Paper][AAAI 2024] Structure-CLIP: Towards Scene Graph Knowledge to Enhance Multi-modal Structured Representations
Organization: zjukg
Home Page: https://arxiv.org/abs/2305.06152
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.