Hi. A few days ago, I encountered an error while attempting to run the pretrained_ViT model. I managed to resolve it through another issue. Actually, the reason I attempted to run the pretrained_ViT model was because the results of the non-pretrained model were inconsistent with the results in the paper provided in this GitHub repository. Therefore, after resolving the pretrained issue, I trained the model with pretrained_ViT set to True, and obtained results for sequences 01, 03, 04, 05, 06, 07, and 10 as follows:
args = {
"data_dir": "data",
"bsize": 4, # batch size
"val_split": 0.1, # percentage to use as validation data
"window_size": 2, # number of frames in window
"overlap": 1, # number of frames overlapped between windows
"optimizer": "Adam", # optimizer [Adam, SGD, Adagrad, RAdam]
"lr": 1e-5, # learning rate
"momentum": 0.9, # SGD momentum
"weight_decay": 1e-4, # SGD momentum
"epoch": 100, # train iters each timestep
"weighted_loss": None, # float to weight angles in loss function
"pretrained_ViT": True, # load weights from pre-trained ViT
"checkpoint_path": "checkpoints/Exp_vit_base_2", # path to save checkpoint
"checkpoint": None, # checkpoint
}
# tiny - patch_size=16, embed_dim=192, depth=12, num_heads=3
# small - patch_size=16, embed_dim=384, depth=12, num_heads=6
# base - patch_size=16, embed_dim=768, depth=12, num_heads=12
model_params = {
"dim": 768,
"image_size": (192, 640), #(192, 640),
"patch_size": 16,
"attention_type": 'divided_space_time', # ['divided_space_time', 'space_only','joint_space_time', 'time_only']
"num_frames": args["window_size"],
"num_classes": 6 * (args["window_size"] - 1), # 6 DoF for each frame
"depth": 12,
"heads": 12,
"dim_head": 64,
"attn_dropout": 0.1,
"ff_dropout": 0.1,
"time_only": False,
}
The results are similar to the pretrained models provided on GitHub, namely Model1, Model2, and Model3.
It seems like I might have made a mistake somewhere. Could you kindly advise on what I should correct?