yangyi-chen / multimodal-and-large-language-models Goto Github PK
View Code? Open in Web Editor NEWPaper list about multimodal and large language models, only used to record papers I read in the daily arxiv for personal needs.
Paper list about multimodal and large language models, only used to record papers I read in the daily arxiv for personal needs.
Can you rename the paper "Aligning Large Multi-Modal Model with Robust Instruction Tuning" to "Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction Tuning"? Thanks a lot!
Hi, we recently finished a paper "Towards End-to-End Embodied Decision Making via Multi-modal Large Language Model: Explorations with GPT4-Vision and Beyond" ,we think it is highly related to this repo ๐. We would be glad if you could add it ๐๐, and we are always open to discuss.
Dear Yangyi,
We have recently released a survey on MLLMs, titled "The Synergy between Data and Multi-Modal Large Language Models: A Survey from Co-Development Perspective". We believe this survey is highly relevant to your repo and would like to kindly request, if possible, to have it added to this repo. We look forward to discussing and potentially
collaborating with you.
Best,
Zhen Qin
I'm thinking about enhancing this reading list. What if we include hyperlinks to the papers?
This addition could significantly improve the usability of the list.
Alternatively, we could consider uploading the PDFs of the papers to this repository.
That way, users can directly download the repository and begin their reading journey immediately.
Another option is to store the papers in Google Drive for easy access.
This is awesome paper reading list! Really helps a lot!
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.