TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Methods/Label Smoothing

Label Smoothing

GeneralIntroduced 198514332 papers

Description

Label Smoothing is a regularization technique that introduces noise for the labels. This accounts for the fact that datasets may have mistakes in them, so maximizing the likelihood of log⁡p(y∣x)\log{p}\left(y\mid{x}\right)logp(y∣x) directly can be harmful. Assume for a small constant ϵ\epsilonϵ, the training set label yyy is correct with probability 1−ϵ1-\epsilon1−ϵ and incorrect otherwise. Label Smoothing regularizes a model based on a softmax with kkk output values by replacing the hard 000 and 111 classification targets with targets of ϵk\frac{\epsilon}{k}kϵ​ and 1−k−1kϵ1-\frac{k-1}{k}\epsilon1−kk−1​ϵ respectively.

Source: Deep Learning, Goodfellow et al

Image Source: When Does Label Smoothing Help?

Papers Using This Method

DASViT: Differentiable Architecture Search for Vision Transformer2025-07-17Best Practices for Large-Scale, Pixel-Wise Crop Mapping and Transfer Learning Workflows2025-07-16DVFL-Net: A Lightweight Distilled Video Focal Modulation Network for Spatio-Temporal Action Recognition2025-07-16Langevin Flows for Modeling Neural Latent Dynamics2025-07-15Biological Processing Units: Leveraging an Insect Connectome to Pioneer Biofidelic Neural Architectures2025-07-15KV-Latent: Dimensional-level KV Cache Reduction with Frequency-aware Rotary Positional Embedding2025-07-15Hashed Watermark as a Filter: Defeating Forging and Overwriting Attacks in Weight-based Neural Network Watermarking2025-07-15Token Compression Meets Compact Vision Transformers: A Survey and Comparative Evaluation for Edge AI2025-07-13Learning from Synthetic Labs: Language Models as Auction Participants2025-07-12Comparative Analysis of Vision Transformers and Traditional Deep Learning Approaches for Automated Pneumonia Detection in Chest X-Rays2025-07-11Agent KB: Leveraging Cross-Domain Experience for Agentic Problem Solving2025-07-08Geo-Registration of Terrestrial LiDAR Point Clouds with Satellite Images without GNSS2025-07-08Tile-Based ViT Inference with Visual-Cluster Priors for Zero-Shot Multi-Species Plant Identification2025-07-08A Wireless Foundation Model for Multi-Task Prediction2025-07-08Growing Transformers: Modular Composition and Layer-wise Expansion on a Frozen Substrate2025-07-08SV-DRR: High-Fidelity Novel View X-Ray Synthesis Using Diffusion Model2025-07-07Emergent Semantics Beyond Token Embeddings: Transformer LMs with Frozen Visual Unicode Representations2025-07-07Estimating Interventional Distributions with Uncertain Causal Graphs through Meta-Learning2025-07-07AI Generated Text Detection Using Instruction Fine-tuned Large Language and Transformer-Based Models2025-07-07Fast and Simplex: 2-Simplicial Attention in Triton2025-07-03