Confidence of the BLIP captioning model about lavis HOT 4 CLOSED

salesforce commented on May 23, 2024

Confidence of the BLIP captioning model

from lavis.

Comments (4)

karndeepsingh commented on May 23, 2024

Hi,

Is there a way to access the confidence of the generated caption?

@jhwang7628 I have a similar question. Did you figure out any way to get the confidence level?

from lavis.

dxli94 commented on May 23, 2024

Hi @jhwang7628 @karndeepsingh

Currently this is not natively supported. I have to look into whether this is possible for HF bert. This expects delay.

One alternative might be to forward the generated caption and the original image into the BLIP ITM model to get a matching score.

Thanks.

from lavis.

karndeepsingh commented on May 23, 2024

Hi @jhwang7628 @karndeepsingh

Currently this is not natively supported. I have to look into whether this is possible for HF bert. This expects delay.

One alternative might be to forward the generated caption and the original image into the BLIP ITM model to get a matching score.

Thanks.

Thanks @dxli94 for answering.
How can I get a confidence score for VQA using BLIP?
Also, How can I compare the results of VQA and report the accuracy of the model on my dataset ( I am only using a pre-trained model for now)?

from lavis.

karndeepsingh commented on May 23, 2024

Hi @jhwang7628 @karndeepsingh

Currently this is not natively supported. I have to look into whether this is possible for HF bert. This expects delay.

One alternative might be to forward the generated caption and the original image into the BLIP ITM model to get a matching score.

Thanks.

@dxli94 How i can use BLIP ITM on text and image to predict the confidence? Any resource or notebook can you share? Is it possible using lavis ?

from lavis.

Recommend Projects

Confidence of the BLIP captioning model about lavis HOT 4 CLOSED

Comments (4)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent

Jobs