Softmax vs A-softmax Loss
Softmax loss is typically good at optimizing the inter-class difference (separating different classes), but not good at reducing the intra-class variation (making features of the same class compact). Additive Margin Softmax is a novel and more interpretable way to import the angular margin into the softmax loss.