Hi, it‘s a nice work, thanks for your contribution. But in table 1, I have some questi

Question on the results of TransMIL and ABMIL about acmil HOT 9 CLOSED

dazhangyu123 commented on June 12, 2024

Question on the results of TransMIL and ABMIL

from acmil.

Comments (9)

wyhsleep commented on June 12, 2024 1

Thanks for your answering, that's may be one possible reason. And I have another question about Max-pooling. Since its performance on SSL pretrained ViT even surpass ABMIL. I occur this phenomena during my experiment. Can this phenomenon be explained by the fact that the feature representations are already good enough that there is no need to utilize additional parameters to learn?

from acmil.

dazhangyu123 commented on June 12, 2024 1

Certainly, I share your viewpoint. I have obtained similar results on the CAMELYON dataset with ViT SSL pretrained embeddings.

from acmil.

dazhangyu123 commented on June 12, 2024

I share a similar question and haven't fully unraveled it. My attempt to explain is as follows: While TransMIL makes strides in considering the correlation between instances, the use of two stacked attention layers introduces additional parameters, making TransMIL more susceptible to overfitting. In such cases, the quality of representation becomes pivotal for effective TransMIL utilization, and the better representation is more beneficial to TransMIL. Notably, three groups of results used the backbones of ResNet18 (our paper), ResNet50 (TransMIL paper), and SSL pretrained ViT (our paper), and this choice significantly impacted the final outcomes.

Furthermore, it's crucial to note that the implementation details and the selection of hyperparameters can also influence performance. In response to this, I've updated the implementations of ABMIL, TransMIL, and other baselines. I hope this provides some clarity.

from acmil.

MrPeterJin commented on June 12, 2024

Also, can the authors open-source their splits for reproducing the results? With "randomly split" quoted from the original work, the results are not easy to reproduce.

from acmil.

dazhangyu123 commented on June 12, 2024

Thank you for your suggestion, I have uploaded the split files with five seeds for CAMELYON dataset. please check the files in splits/camelyon

from acmil.

wyhsleep commented on June 12, 2024

Hi, i have one more question about what's the learning rate for reproducing the results of ABMIL and TransMIL？

from acmil.

dazhangyu123 commented on June 12, 2024

we set learning rate as 0.0001 for ViT-based features and learning rate as 0.0002 for ResNet-based features for all MIL models.

from acmil.

wyhsleep commented on June 12, 2024

Thank you for your reply, so I notice in the paper you mentioned that you trainde the model for 100 epochs, so what's the model you use for the testing, the one with the minimal loss or just the one in the last epoch?

from acmil.

dazhangyu123 commented on June 12, 2024

We observed that the model with the minimal validation loss and the one from the last epoch exhibit similar performance across three datasets. Following established studies, we chose to use the model with minimal validation loss. Thank you.

from acmil.

Question on the results of TransMIL and ABMIL about acmil HOT 9 CLOSED

Comments (9)

Related Issues (7)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent

Jobs