Yipo Huang's Projects
An expert benchmark aiming to comprehensively evaluate the aesthetic perception capacities of MLLMs.
[ACMMM 2024] AesExpert: Towards Multi-modality Foundation Model for Image Aesthetics Perception
Multimodality Multi-attribute Contrastive Pre-training for Image Aesthetics Computing
Awesome_Multimodel is a curated GitHub repository that provides a comprehensive collection of resources for Multimodal Large Language Models (MLLM). It covers datasets, tuning techniques, in-context learning, visual reasoning, foundational models, and more. Stay updated with the latest advancement.
[TCSVT] Predicting the Quality of View Synthesis With Color-Depth Image Fusion
[SPL] Blind Quality Index of Depth Images Based on Structural Statistics for View Synthesis
ποΈ πΌοΈ π₯PyTorch Toolbox for Image Quality Assessment, including LPIPS, FID, NIQE, NRQM(Ma), MUSIQ, NIMA, DBCNN, WaDIQaM, BRISQUE, PI and more...
[TMM] Explainable and Generalizable Blind Image Quality Assessment via Semantic Attribute Reasoning
[TCSVT] Theme-aware Visual Attribute Reasoning for Image Aesthetics Assessment
Open-source evaluation toolkit of large vision-language models (LVLMs), support GPT-4v, Gemini, QwenVLPlus, 50+ HF models, 20+ benchmarks
Config files for my GitHub profile.