GithubHelp home page GithubHelp logo

rongtongxueya / awesome-remote-sensing-multimodal-large-language-model Goto Github PK

View Code? Open in Web Editor NEW

This project forked from zhanyang-nwpu/awesome-remote-sensing-multimodal-large-language-model

0.0 0.0 0.0 64 KB

Multimodal Large Language Model for Remote Sensing (Vision-Language)

awesome-remote-sensing-multimodal-large-language-model's Introduction

Awesome-Remote-Sensing-Multimodal-Large-Language-Model (Vision-Language)

πŸ“’ A collection of remote sensing multimodal large language model papers focusing on the vision-language domain.

Author: Yang Zhan

School of Artificial Intelligence, OPtics, and ElectroNics (iOPEN), Northwestern Polytechnical University

Please share a STAR ⭐ if this project does help

πŸ“’ Latest Updates

In this repository, we will collect and document researchers and their outstanding work related to remote sensing multimodal large language model (vision-language).

  • The list will be continuously updated πŸ”₯πŸ”₯
  • πŸ“¦ coming soon! πŸš€

Content

Papers

  • πŸ”₯ Feb-4-24: LHRS-Bot: Empowering Remote Sensing with VGI-Enhanced Large Multimodal Language Model

arXiv 2024 (arXiv:2402.02544). D. Muhtar, Z. Li, F. Gu, X. Zhang, and P. Xiao. [Paper][Code]

  • πŸ”₯ Jan-30-24: EarthGPT: A Universal Multi-modal Large Language Model for Multi-sensor Image Comprehension in Remote Sensing Domain

arXiv 2024 (arXiv:2401.16822). W. Zhang, M. Cai, T. Zhang, Y. Zhuang, and X. Mao. [Paper][[Code]:Null]

  • πŸ”₯ Jan-18-24: SkyEyeGPT: Unifying Remote Sensing Vision-Language Tasks via Instruction Tuning with Large Language Model

arXiv 2024 (arXiv:2401.09712). Y. Zhan, Z. Xiong, and Y. Yuan. [Paper][Code]

  • πŸ”₯ Nov-30-23: Charting New Territories: Exploring the Geographic and Geospatial Capabilities of Multimodal LLMs

arXiv 2023 (arXiv:2311.14656). J. Roberts, T. LΓΌddecke, R. Sheikh, K. Han, and S. Albanie. [Paper][Code]

  • πŸ”₯ Nov-28-23: GeoChat: Grounded Large Vision-Language Model for Remote Sensing

arXiv 2023 (arXiv:2311.15826). K. Kuckreja, M. S. Danish, M. Naseer, A. Das, S. Khan, and F. S. Khan. [Paper][Code]

  • πŸ”₯ Jul-28-23: RSGPT: A Remote Sensing Vision Language Model and Benchmark

arXiv 2023 (arXiv:2307.15266). Y. Hu, J. Yuan, and C. Wen. [Paper][Code]

Remote Sensing Vision-Language Dataset

  • πŸ”₯ Feb-17-24: ChatEarthNet: A Global-Scale, High-Quality Image-Text Dataset for Remote Sensing

arXiv 2024 (arXiv:2402.11325). Z. Yuan, Z. Xiong, L. Mou, and X. X. Zhu. [Paper][[Code]:Null)]

  • πŸ”₯ Jan-2-24: RS5M and GeoRSCLIP: A Large Scale Vision-Language Dataset and A Large Vision-Language Model for Remote Sensing

arXiv 2023 (arXiv:2306.11300). Z. Zhang, T. Zhao, Y. Guo, and J. Yin. [Paper][Code]

  • πŸ”₯ Dec-20-23: SkyScript: A Large and Semantically Diverse Vision-Language Dataset for Remote Sensing

AAAI 2024 (arXiv:2312.12856). Z. Wang, R. Prabha, T. Huang, J. Wu, and R. Rajagopal. [Paper][Code]

related: Remote Sensing Vision-Language Foundation Models

  • πŸ”₯ Jan-2-24: RS5M and GeoRSCLIP: A Large Scale Vision-Language Dataset and A Large Vision-Language Model for Remote Sensing

arXiv 2023 (arXiv:2306.11300). Z. Zhang, T. Zhao, Y. Guo, and J. Yin. [Paper][Code]

  • πŸ”₯ Dec-12-23: Remote Sensing Vision-Language Foundation Models without Annotations via Ground Remote Alignment

arXiv 2023 (arXiv:2312.06960). U. Mall, C. P. Phoo, M. K. Liu, C. Vondrick, B. Hariharan, and K. Bala. [Paper][[Code]:Null]

  • πŸ”₯ Aug-10-23: RemoteCLIP: A Vision Language Foundation Model for Remote Sensing

arXiv 2023 (arXiv:2306.11029). F. Liu, D. Chen, Z. Guan, X. Zhou, J. Zhu, and J. Zhou. [Paper][Code]

awesome-remote-sensing-multimodal-large-language-model's People

Contributors

zhanyang-nwpu avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    πŸ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. πŸ“ŠπŸ“ˆπŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❀️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.