mccreezhao / qamface Goto Github PK

View Code? Open in Web Editor NEW

35.0 35.0 3.0 42 KB

Pytorch implementation of Quadratic Additive Angular Margin Loss for Face Recognition

License: MIT License

Python 100.00%

qamface's People

Contributors

Stargazers

Watchers

Forkers

xuanjihe cavalleria jdd20191122

qamface's Issues

High learning rate

I haven't seen your paper yet but was just wondering how it was determined to use an initial LR of 0.2 instead of the typical 0.1.
While training IR_152 backend, I saw training loss jump significantly to 100.
Also, have you tried retraining using a pretrained backbone?

about loss

the loss function used in the project was loss = self.focalLoss(thetas, labels)??? not Additive Angular Margin Loss?

Any prebuilt weights available?

I notice you only went to epoch 22, which is still stage 1. Are you still training this?

Support for multiple GPUs

I noticed there's no code to set which GPUs are to be used. There's also no synchronisation code in the loss function for multiple GPUs.

How did you train it on 4 GPUs without this, or is it no longer needed in modern PyTorch?

I modified your QAMFace to include the support:

        if self.device_id == None:
            kernel_norm = l2_norm(self.weight, axis=0)
            cos_theta = torch.mm(embbedings, kernel_norm)
        else:
            x = embbedings
            sub_kernels = torch.chunk(self.weight, len(self.device_id), dim=1)
            temp_x = x.cuda(self.device_id[0])
            kernel_norm = l2_norm(sub_kernels[0], axis = 0).cuda(self.device_id[0])
            cos_theta = torch.mm(temp_x, kernel_norm)
            for i in range(1, len(self.device_id)):
                temp_x = x.cuda(self.device_id[i])
                kernel_norm = l2_norm(sub_kernels[i], axis = 0).cuda(self.device_id[i])
                cos_theta = torch.cat((cos_theta, torch.mm(temp_x, kernel_norm).cuda(self.device_id[0])), dim=1)

mccreezhao / qamface Goto Github PK

qamface's People

Contributors

Stargazers

Watchers

Forkers

qamface's Issues

High learning rate

about loss

Any prebuilt weights available?

Support for multiple GPUs

Not normalised -1 to 1?

Batch size in config not used

Any pre-trained model available?

focal loss

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent

Jobs