GithubHelp home page GithubHelp logo

vztu / bvqa_benchmark Goto Github PK

View Code? Open in Web Editor NEW
113.0 4.0 14.0 893 KB

A resource list and performance benchmark for blind video quality assessment (BVQA) models on user-generated content (UGC) datasets. [IEEE TIP'2021] "UGC-VQA: Benchmarking Blind Video Quality Assessment for User Generated Content", Zhengzhong Tu, Yilin Wang, Neil Birkbeck, Balu Adsumilli, Alan C. Bovik

Home Page: https://ieeexplore.ieee.org/document/9405420

License: MIT License

Python 100.00%
ugc-vqa ugc-datasets performance-benchmark video-quality-assessment image-quality-assessment picture-quality youtube-dataset vqa-dataset bvqa-benchmark bvqa-models

bvqa_benchmark's Introduction

BVQA_Benchmark

This is a resource list for blind video quality assessment (BVQA) models on user-generated databases, i.e., the UGC-VQA problem studied in our paper [IEEE TIP2021] UGC-VQA: Benchmarking blind video quality assessment for user generated content. IEEEXplore

The following content include datasets, models & codes, performance benchmark and leaderboard.

Maintained by: Zhengzhong Tu

๐Ÿ‘ Any suggestion or idea is welcomed. Please see Contributing

  • Updates [02-31-2021] Added VQA mainpage at papers-with-code here: video-quality-assessment
  • Updates [10-21-2021] All the features I used in the paper can be downloaded here: Google Drive

Contents

Evaluate Your Own Model

Extract features in the form of NxM matrix (N:#samples, M:#features) on a given VQA dataset and save it in data/ directory. Let metadata file be stored in the same folder with MOSs in the same order as your feature matrix (We have already provided the MOS arrays of three UGC datasets). The evaluate_bvqa_features.py evaluates the extracted features via 100 random train-test splits and reports the median (std) SRCC/KRCC/PLCC/RMSE performances. Note that it is not applicable to deep learning models (feature-based model only).

Pre-requisites

  • python3
  • sklearn

Demo evaluations (BRISUQE on KoNViD-1k)

$ python3 src/evaluate_bvqa_features.py

Custom usage with your own model on given dataset

$ python3 src/evaluate_bvqa_features.py [-h] [--model_name MODEL_NAME]
                                   [--dataset_name DATASET_NAME]
                                   [--feature_file FEATURE_FILE]
                                   [--mos_file MOS_FILE] [--out_file OUT_FILE]
                                   [--color_only] [--log_short] [--use_parallel]
                                   [--num_iterations NUM_ITERATIONS]
                                   [--max_thread_count MAX_THREAD_COUNT]

UGC-VQA Datasets

BVQA Dataset Download Paper
KoNViD-1k (2017) KoNViD-1k Hosu et al. QoMEX'17
LIVE-VQC (2018) LIVE-VQC Sinno et al. TIP'19
YouTube-UGC (2019) YouTube-UGC Wang et al. MMSP'19
LIVE-FB-LSVQ (2021) LIVE-FB-LSVQ Ying et al. CVPR'21

BIQA / BVQA Models

BIQA

Model Download Paper
BRISQUE BRISQUE Mittal et al. TIP'12
NIQE NIQE Mittal et al. TIP'13
ILNIQE ILNIQE Zhang et al. TIP'15
GM-LOG GM-LOG Xue et al. TIP'14
HIGRADE HIGRADE Kundu et al. TIP'17
FRIQUEE FRIQUEE Ghadiyaram et al. JoV'17
CORNIA BIQA_Toolbox Ye et al. CVPR'12
HOSA BIQA_Toolbox Xu et al. TIP'16
KonCept 512 koniq, koniq-PyTorch Hosu et al. TIP'20
PaQ-2-PiQ PaQ-2-PiQ, paq2piq-PyTorch Ying et al. CVPR'20

BVQA

Model Download Paper
VIIDEO VIIDEO Mittal et al. TIP'16
V-BLIINDS V-BLIINDS Saad et al. TIP'14
TLVQM nr-vqa-consumervideo Korhenen et al. TIP'19
VSFA VSFA Li et al. MM'19
NSTSS NRVQA-NSTSS Dendi et al. TIP'20
VIDEVAL VIDEVAL Tu et al. TIP'21
MDTVSFA MDTVSFA Li et al. IJCV'21
RAPIQUE RAPIQUE Tu et al. OJSP'21
PatchVQ PatchVQ Ying et al. CVPR'21
CoINVQ CoINVQ Wang et al. CVPR'21

Performance Benchmark

Regression Results

Median SRCC (std SRCC) of 100 random train-test (80%-20%) splits.

Methods KoNViD-1k LIVE-VQC YouTube-UGC All-Combined
BRISQUE 0.6567 (0.0351) 0.5925 (0.0681) 0.3820 (0.0519) 0.5695 (0.0289)
NIQE 0.5417 (0.0347) 0.5957 (0.0571) 0.2379 (0.0487) 0.4622 (0.0313)
IL-NIQE 0.5264 (0.0294) 0.5037 (0.0712) 0.2918 (0.0502) 0.4592 (0.0307)
GM-LOG 0.6578 (0.0324) 0.5881 (0.0683) 0.3678 (0.0589) 0.5650 (0.0295)
HIGRADE 0.7206 (0.0302) 0.6103 (0.0680) 0.7376 (0.0338) 0.7398 (0.0189)
FRIQUEE 0.7472 (0.0263) 0.6579 (0.0536) 0.7652 (0.0301) 0.7568 (0.0237)
CORNIA 0.7169 (0.0245) 0.6719 (0.0473) 0.5972 (0.0413) 0.6764 (0.0216)
HOSA 0.7654 (0.0224) 0.6873 (0.0462) 0.6025 (0.0344) 0.6957 (0.0180)
VGG-19 0.7741 (0.0288) 0.6568 (0.0536) 0.7025 (0.0281) 0.7321 (0.0180)
ResNet-50 0.8018 (0.0255) 0.6636 (0.0511) 0.7183 (0.0281) 0.7557 (0.0177)
KonCept512 0.7349 (0.0252) 0.6645 (0.0523) 0.5872 (0.0396) 0.6608 (0.0221)
PaQ-2-PiQ 0.6130 (0.0325) 0.6436 (0.0457) 0.2658 (0.0473) 0.4727 (0.0298)
VIIDEO 0.2988 (0.0561) 0.0332 (0.0856) 0.0580 (0.0536) 0.1039 (0.0349)
V-BLIINDS 0.7101 (0.0314) 0.6939 (0.0502) 0.5590 (0.0496) 0.6545 (0.0232)
TLVQM 0.7729 (0.0242) 0.7988 (0.0365) 0.6693 (0.0306) 0.7271 (0.0189)
VIDEVAL 0.7832 (0.0216) 0.7522 (0.0390) 0.7787 (0.0254) 0.7960 (0.0151)
VSFA 0.755 (0.025) - - -
NSTSS 0.6417 - - -
VIDEVAL+KonCept512 0.8149 (0.0194) 0.7849 (0.0440) 0.8083 (0.0232) 0.8123 (0.0163)
MDTVSFA 0.7812 (0.0278) 0.7382 (0.0357) - -
RAPIQUE 0.8031 0.7548 0.7591 0.8070
PatchVQ 0.791 0.827 - -
CoINVQ 0.767 - 0.816 -

The median PLCC (std PLCC) of 100 random train-test (80%-20%) splits.

Model KoNViD-1k LIVE-VQC YouTube-UGC All-Combined
BRISQUE 0.6576 (0.0342) 0.6380 (0.0632) 0.3952 (0.0486) 0.5861 (0.0272)
NIQE 0.5530 (0.0337) 0.6286 (0.0512) 0.2776 (0.0431) 0.4773 (0.0287)
IL-NIQE 0.5400 (0.0337) 0.5437 (0.0707) 0.3302 (0.0579) 0.4741 (0.0280)
GM-LOG 0.6636 (0.0315) 0.6212 (0.0636) 0.3920 (0.0549) 0.5942 (0.0306)
HIGRADE 0.7269 (0.0287) 0.6332 (0.0652) 0.7216 (0.0334) 0.7368 (0.0190)
FRIQUEE 0.7482 (0.0257) 0.7000 (0.0587) 0.7571 (0.0324) 0.7550 (0.0226)
CORNIA 0.7135 (0.0236) 0.7183 (0.0420) 0.6057 (0.0399) 0.6974 (0.0202)
HOSA 0.7664 (0.0207) 0.7414 (0.0410) 0.6047 (0.0347) 0.7082 (0.0167)
VGG-19 0.7845 (0.0246) 0.7160 (0.0481) 0.6997 (0.0281) 0.7482 (0.0176)
ResNet-50 0.8104 (0.0229) 0.7205 (0.0434) 0.7097 (0.0276) 0.7747 (0.0167)
KonCept512 0.7489 (0.0240) 0.7278 (0.0464) 0.5940 (0.0412) 0.6763 (0.0227)
PaQ-2-PiQ 0.6014 (0.0338) 0.6683 (0.0445) 0.2935 (0.0490) 0.4828 (0.0293)
VIIDEO 0.3002 (0.0539) 0.2146 (0.0903) 0.1534 (0.0498) 0.1621 (0.0355)
V-BLIINDS 0.7037 (0.0301) 0.7178 (0.0500) 0.5551 (0.0465) 0.6599 (0.0234)
TLVQM 0.7688 (0.0238) 0.8025 (0.0360) 0.6590 (0.0302) 0.7342 (0.0180)
VIDEVAL 0.7803 (0.0223) 0.7514 (0.0420) 0.7733 (0.0257) 0.7939 (0.0157)}
VSFA 0.744 (0.029) - - -
NSTSS 0.6531 - - -
VIDEVAL+KonCept512 0.8169 (0.0179) 0.8010 (0.0398) 0.8028 (0.0234) 0.8168 (0.0128)
MDTVSFA 0.7856 (0.0240) 0.7728 (0.0351) - -
RAPIQUE 0.8175 0.7863 0.7684 0.8229
PatchVQ 0.786 0.837 - -
CoINVQ 0.764 - 0.802 -

Contributing

Please feel free to send an issue or pull requests or email me to add links or new results.

Citation

Should you find this repo useful to your research, we sincerely appreciate it if you cite our papers:blush::

@article{tu2020ugc,
  title={UGC-VQA: Benchmarking Blind Video Quality Assessment for User Generated Content},
  author={Tu, Zhengzhong and Wang, Yilin and Birkbeck, Neil and Adsumilli, Balu and Bovik, Alan C},
  journal={arXiv preprint arXiv:2005.14354},
  year={2020}
}

@inproceedings{tu2020comparative,
  title={A Comparative Evaluation Of Temporal Pooling Methods For Blind Video Quality Assessment}, 
  author={Z. {Tu} and C. -J. {Chen} and L. -H. {Chen} and N. {Birkbeck} and B. {Adsumilli} and A. C. {Bovik}},
  booktitle={2020 IEEE International Conference on Image Processing (ICIP)},  
  year={2020},
  pages={141-145},
  doi={10.1109/ICIP40778.2020.9191169}
}

bvqa_benchmark's People

Contributors

vztu avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar

bvqa_benchmark's Issues

Youtube datasets

Hello, can you help upload the YouTube dataset to Baidu Cloud Disk? I met some problems (some videos failed to download) while downloading these videos from the Google cloud platform. Thank you very much!

Youtube datasets

Could you please share the 1380 videos you used with H264 format via Baidu Disk ?

CoINVQ

CoINVQ can not be connected

CNN feature

I saw the pertained ResNet-50/VGG-19 are used as a baseline CNN feature extractor but the results shown in Table V shows that they are very effective. Did you re-train it on other datasets or it just the fixed pertained model from imagenet?Thanks!

A plead for the implenmentation of EPooling method

Dear Dr. Tu,

Sorry to be a bother.
Thanks for your work in VQA! I am very interested in your work "EPooling". Could you please share me with the codes of your mentioned VQA pooling methods in "A Comparative Evaluation Of Temporal Pooling Methods For Blind Video Quality Assessment" ?
Thank you very much !

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.