Hi, I use the L-Softmax loss on the new dataset, the accuarcy of m=1

Test the L-Softmax loss on new dataset about largemargin_softmax_loss HOT 6 CLOSED

wy1iu commented on May 23, 2024

Test the L-Softmax loss on new dataset

from largemargin_softmax_loss.

Comments (6)

wy1iu commented on May 23, 2024

What is your final loss value for m=4? Without knowing your settings, my best guess is that the m=4 loss have not been properly optimized. If your lambda is close to 0 (when the optimization is over) and the m=4 loss is reasonably low, it is highly unlikely the performance will go down.

from largemargin_softmax_loss.

dldl3D commented on May 23, 2024

My settings is as follows:
################## train ##################
layer {
name: "ip2"
type: "LargeMarginInnerProduct"
bottom: "bn_ip"
bottom: "label"
top: "ip2"
top: "lambda"
param {
name: "ip2"
lr_mult: 1
}
largemargin_inner_product_param {
num_output: 4
#type: SINGLE
#base: 0
type: QUADRUPLE
base: 900
gamma: 0.0000055
power: 40
iteration: 0
lambda_min: 0
weight_filler {
type: "msra"
}
}
include {
phase: TRAIN
}
}
################# test ##################
layer {
name: "ip2"
type: "LargeMarginInnerProduct"
bottom: "bn_ip"
bottom: "label"
top: "ip2"
top: "lambda"
param {
name: "ip2"
lr_mult: 0
}
largemargin_inner_product_param {
num_output: 4
type:SINGLE
base: 0
gamma: 0.0000055
power: 40
iteration: 0
lambda_min: 0
weight_filler {
type: "msra"
}
}
include {
phase: TEST
}
}
layer {
name: "accuracy"
type: "Accuracy"
bottom: "ip2"
bottom: "label"
top: "accuracy"
include {
phase: TEST
}
}
layer {
name: "loss"
type: "SoftmaxWithLoss"
bottom: "ip2"
bottom: "label"
top: "loss"
}
The lambda is 0.9726 when the optimization.

from largemargin_softmax_loss.

wy1iu commented on May 23, 2024

It seems your dataset is quite easy, since you can achieve quite good results with m=1. The final lambda is actually too high for such easy dataset. You should consider increase the parameter power or change the gamma a little bit, or even increase the iteration number in order to make lambda decrease more rapidly. I think the final lambda should be nearly 0, say below 10^-5, to achieve the best results.

from largemargin_softmax_loss.

dldl3D commented on May 23, 2024

When the lambda decrease rapidly, the network diverges. How can I fix it?

from largemargin_softmax_loss.

wy1iu commented on May 23, 2024

You should change gamma to a a smaller value, or set a small lambda_min.

from largemargin_softmax_loss.

wy1iu commented on May 23, 2024

BTW, I just fix a bug about lambda_min. You should use the new version

from largemargin_softmax_loss.

Test the L-Softmax loss on new dataset about largemargin_softmax_loss HOT 6 CLOSED

Comments (6)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent

Jobs