Description
InfoNCE variants demonstrate direct and indirect coupling between the alignment and uniformity terms thus hurting optimisation. The Decoupled Hyperspherical Energy Loss (DHEL) is an NT-Xent variant that completly decouples alignment from uniformity by discarding the corresponding terms from the denominator. In this way optimisation is more efficient and robust to hyper parameter changes.