In this work we experiemented with the visualization of the hidden states from Bert and T5 Architectures. We also probed lots of models on different datasets.
The finetuned models we finetuned and used can be found under following the link: Link to the models