Dear Hila, Thank you for your work, I really like it. <p dir="au

Hi <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="

attn_grad about transformer-mm-explainability HOT 2 CLOSED

hila-chefer commented on May 20, 2024

attn_grad

from transformer-mm-explainability.

Comments (2)

hila-chefer commented on May 20, 2024 3

Hi @betterze, thanks for your kind words and for your interest in our work!

Let me try to clarify, but if my answer isn't good enough please let me know:
let's take the ViT example because it's simpler.
If we look at the ViT_new.py code, we can see the lines that are responsible for saving the gradients and attention maps:
self.save_attention_map(attn) attn.register_hook(self.save_attn_gradients)
If you notice, the first thing we do in the ViT notebook's generate_relevance function is to run a forward pass with the example we wish to explain:
output = model(input, register_hook=True), this triggers the "forward hook " that we set in this line: self.save_attention_map(attn), i.e. for each self-attention block this forward pass saves the self-attention matrix in self.attention_map.
After this forward pass, each attention block stores its attention map in self.attention_map.

Moving to the gradients, this line: attn.register_hook(self.save_attn_gradients) registers a backward hook on the attn tensor, such that when we backpropagate the gradients, the hook will be called for the attention map. This hook saves the attention-map's gradient in self.attn_gradients. The backward pass is also triggered from the generate_relevance function in the ViT notebook in line: one_hot.backward(retain_graph=True).
After this step, we have both gradients and attention maps saved in each self-attention block. All we need to do is iterate over the blocks and apply the rules on the maps + gradients, as you mentioned:
grad = blk.attn.get_attn_gradients() cam = blk.attn.get_attention_map()
I really hope this helps, but if somehow I failed to answer your question, please let me know and I'll clarify as needed.

Thanks :)

from transformer-mm-explainability.

betterze commented on May 20, 2024 1

Dear Hila,

Thank you very much for your detail answer. It is really helpful.

After a few experiments, I believe I understand it now.

Thank you again for your help, I really appreciate it.

Best Wishes,

Alex

from transformer-mm-explainability.

Recommend Projects

attn_grad about transformer-mm-explainability HOT 2 CLOSED

Comments (2)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent

Jobs