Back to Explorer
Research PaperResearchia:202602.27031[Data Science > Machine Learning]

Differentiable Zero-One Loss via Hypersimplex Projections

Camilo Gomez

Abstract

Recent advances in machine learning have emphasized the integration of structured optimization components into end-to-end differentiable models, enabling richer inductive biases and tighter alignment with task-specific objectives. In this work, we introduce a novel differentiable approximation to the zero-one loss-long considered the gold standard for classification performance, yet incompatible with gradient-based optimization due to its non-differentiability. Our method constructs a smooth, order-preserving projection onto the n,k-dimensional hypersimplex through a constrained optimization framework, leading to a new operator we term Soft-Binary-Argmax. After deriving its mathematical properties, we show how its Jacobian can be efficiently computed and integrated into binary and multiclass learning systems. Empirically, our approach achieves significant improvements in generalization under large-batch training by imposing geometric consistency constraints on the output logits, thereby narrowing the performance gap traditionally observed in large-batch training.


Source: arXiv:2602.23336v1 - http://arxiv.org/abs/2602.23336v1 PDF: https://arxiv.org/pdf/2602.23336v1 Original Link: http://arxiv.org/abs/2602.23336v1

Submission:2/27/2026
Comments:0 comments
Subjects:Machine Learning; Data Science
Original Source:
View Original PDF
arXiv: This paper is hosted on arXiv, an open-access repository
Was this helpful?

Discussion (0)

Please sign in to join the discussion.

No comments yet. Be the first to share your thoughts!

Differentiable Zero-One Loss via Hypersimplex Projections | Researchia | Researchia