eric-ai-lab Goto Github PK

repos: 25.0 gists: 0.0

Name: UCSC ERIC Lab

Type: Organization

Bio: UCSC Embodied and Responsible Interaction and Communication (ERIC) Lab

Blog: http://eric-lab.soe.ucsc.edu/home

UCSC ERIC Lab's Projects

acltoolbox

aerial-vision-and-dialog-navigation

Codebase of ACL 2023 Findings "Aerial Vision-and-Dialog Navigation"

awesome-vision-language-navigation

A curated list for vision-and-language navigation. ACL 2022 paper "Vision-and-Language Navigation: A Survey of Tasks, Methods, and Future Directions"

comclip

Official implementation and dataset for the NAACL 2024 paper "ComCLIP: Training-Free Compositional Image and Text Matching"

cpl

Official implementation of our EMNLP 2022 paper "CPL: Counterfactual Prompt Learning for Vision and Language Models"

discffusion

Official repo for the paper "Discffusion: Discriminative Diffusion Models as Few-shot Vision and Language Learners"

fedvln

[ECCV 2022] Official pytorch implementation of the paper "FedVLN: Privacy-preserving Federated Vision-and-Language Navigation"

llm_coordination

Code repository for the paper "LLM-Coordination: Evaluating and Analyzing Multi-agent Coordination Abilities in Large Language Models"

minigpt-5

Official implementation of paper "MiniGPT-5: Interleaved Vision-and-Language Generation via Generative Vokens"

minigpt-5.github.io

mitigate-gender-bias-in-image-search

Code for the EMNLP 2021 Oral paper "Are Gender-Neutral Queries Really Gender-Neutral? Mitigating Gender Bias in Image Search" https://arxiv.org/abs/2109.05433

mmworld

Official repo of the paper "MMWorld: Towards Multi-discipline Multi-faceted World Model Evaluation in Videos"

multipanelvqa

Code for the MultipanelVQA benchmark "Muffin or Chihuahua? Challenging Large Vision-Language Models with Multipanel VQA"

naivgation-as-wish

Official implementation of the NAACL 2024 paper "Navigation as Attackers Wish? Towards Building Robust Embodied Agents under Federated Learning"

pectvlm

Code implementation for Findings of EMNLP 2023 paper "Parameter-Efficient Cross-lingual Transfer of Vision and Language Models via Translation-based Alignment"

pevit

Official implementation of AAAI 2023 paper "Parameter-efficient Model Adaptation for Vision Transformers"

photoswap

Official implementation of the NeurIPS 2023 paper "Photoswap: Personalized Subject Swapping in Images"

probmed

"Worse than Random? An Embarrassingly Simple Probing Evaluation of Large Multimodal Models in Medical VQA"

r2h

Official implementation of the EMNLP 2023 paper "R2H: Building Multimodal Navigation Helpers that Respond to Help Requests"

screen-point-and-read

Code repo for "Read Anywhere Pointed: Layout-aware GUI Screen Reading with Tree-of-Lens Grounding"

swap-anything

"SwapAnything: Enabling Arbitrary Object Swapping in Personalized Visual Editing"

t2iat

T2IAT: Measuring Valence and Stereotypical Biases in Text-to-Image Generation

via-video

vicor

This is the implementation of ACL 2024 Findings paper ViCor: Bridging Visual Understanding and Commonsense Reasoning with Large Language Models

vlmbench

NeurIPS 2022 Paper "VLMbench: A Compositional Benchmark for Vision-and-Language Manipulation"

eric-ai-lab Goto Github PK

UCSC ERIC Lab's Projects

Recommend Projects

Recommend Topics

Recommend Org

Jobs