TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Methods/Softmax

Softmax

GeneralIntroduced 200037448 papers

Description

The Softmax output function transforms a previous layer's output into a vector of probabilities. It is commonly used for multiclass classification. Given an input vector xxx and a weighting vector www we have:

P(y=j∣x)=exTwj∑k=1KexTwkP(y=j \mid{x}) = \frac{e^{x^{T}w_{j}}}{\sum^{K}_{k=1}e^{x^{T}wk}}P(y=j∣x)=∑k=1K​exTwkexTwj​​

Papers Using This Method

Making Language Model a Hierarchical Classifier and Generator2025-07-17DASViT: Differentiable Architecture Search for Vision Transformer2025-07-17Best Practices for Large-Scale, Pixel-Wise Crop Mapping and Transfer Learning Workflows2025-07-16DVFL-Net: A Lightweight Distilled Video Focal Modulation Network for Spatio-Temporal Action Recognition2025-07-16SystolicAttention: Fusing FlashAttention within a Single Systolic Array2025-07-15Langevin Flows for Modeling Neural Latent Dynamics2025-07-15Generative Click-through Rate Prediction with Applications to Search Advertising2025-07-15Biological Processing Units: Leveraging an Insect Connectome to Pioneer Biofidelic Neural Architectures2025-07-15KV-Latent: Dimensional-level KV Cache Reduction with Frequency-aware Rotary Positional Embedding2025-07-15Hashed Watermark as a Filter: Defeating Forging and Overwriting Attacks in Weight-based Neural Network Watermarking2025-07-15Feature Distillation is the Better Choice for Model-Heterogeneous Federated Learning2025-07-14ZClassifier: Temperature Tuning and Manifold Approximation via KL Divergence on Logit Space2025-07-14Token Compression Meets Compact Vision Transformers: A Survey and Comparative Evaluation for Edge AI2025-07-13Learning from Synthetic Labs: Language Models as Auction Participants2025-07-12Comparative Analysis of Vision Transformers and Traditional Deep Learning Approaches for Automated Pneumonia Detection in Chest X-Rays2025-07-11Lizard: An Efficient Linearization Framework for Large Language Models2025-07-11Chat-Ghosting: A Comparative Study of Methods for Auto-Completion in Dialog Systems2025-07-08SARA: Selective and Adaptive Retrieval-augmented Generation with Context Compression2025-07-08Agent KB: Leveraging Cross-Domain Experience for Agentic Problem Solving2025-07-08Geo-Registration of Terrestrial LiDAR Point Clouds with Satellite Images without GNSS2025-07-08