Topic: blip2 Goto Github
Some thing interesting about blip2
Some thing interesting about blip2
blip2,Finetuning Large Visual Models on Visual Question Answering
Organization: aws-samples
blip2,[ACM MM 2024] Improving Composed Image Retrieval via Contrastive Learning with Scaling Positives and Negatives
User: buaadreamer
Home Page: https://arxiv.org/abs/2404.11317
blip2,Uses AI to scare people...more.
User: craigsdennis
blip2,[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
Organization: damo-nlp-sg
blip2,Official implementation and dataset for the NAACL 2024 paper "ComCLIP: Training-Free Compositional Image and Text Matching"
Organization: eric-ai-lab
Home Page: https://sites.google.com/view/comclip
blip2,Caption images across your datasets with state of the art models from Hugging Face and Replicate!
User: jacobmarks
blip2,Implementation of Qformer from BLIP2 in Zeta Lego blocks.
User: kyegomez
Home Page: https://discord.gg/GYbXvDGevY
blip2,caption generator using lavis and argostranslate
User: leeyunjai
blip2,This repository is for profiling, extracting, visualizing and reusing generative AI weights to hopefully build more accurate AI models and audit/scan weights at rest to identify knowledge domains for risk(s).
Organization: matlok-ai
Home Page: https://bampe-weights.readthedocs.io/en/latest/
blip2,Annotations on a Budget: Leveraging Geo-Data Similarity to Balance Model Performance and Annotation Cost
Organization: michigannlp
blip2,(AAAI 2024) BLIVA: A Simple Multimodal LLM for Better Handling of Text-rich Visual Questions
Organization: mlpc-ucsd
Home Page: https://arxiv.org/abs/2308.09936
blip2,The Multimodal Model for Vietnamese Visual Question Answering (ViVQA)
User: nngocson2002
blip2,An end to end Deep Learning based tool for image caption generation.
User: notslok
blip2,Too lazy to organize my desktop, make gpt + BLIP-2 do it
User: otdavies
blip2,Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high performance and flexibility.
Organization: paddlepaddle
blip2,Creating stylish social media captions for an Image using Multi Modal Models and Reinforcement Learning
User: shreyassks
blip2,Chat with NeRF enables users to interact with a NeRF model by typing in natural language.
Organization: sled-group
Home Page: https://chat-with-nerf.github.io
blip2,Automate Fashion Image Captioning using BLIP-2. Automatic generating descriptions of clothes on shopping websites, which can help customers without fashion knowledge to better understand the features (attributes, style, functionality etc.) of the items and increase online sales by enticing more customers.
User: smithaupadhyaya
blip2,Modifying LAVIS' BLIP2 Q-former with models pretrained on Japanese datasets.
User: zhaopeiduo
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.