TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Methods/Activation Regularization

Activation Regularization

GeneralIntroduced 200056 papers
Source Paper

Description

Activation Regularization (AR), or L_2L\_{2}L_2 activation regularization, is regularization performed on activations as opposed to weights. It is usually used in conjunction with RNNs. It is defined as:

αL_2(m∘h_t)\alpha{L}\_{2}\left(m\circ{h\_{t}}\right) αL_2(m∘h_t)

where mmm is a dropout mask used by later parts of the model, L_2L\_{2}L_2 is the L_2L\_{2}L_2 norm, and hth_{t}ht​ is the output of an RNN at timestep ttt, and α\alphaα is a scaling coefficient.

When applied to the output of a dense layer, AR penalizes activations that are substantially away from 0, encouraging activations to remain small.

Papers Using This Method

An Adaptive Method Stabilizing Activations for Enhanced Generalization2025-06-10Robust Anti-Backdoor Instruction Tuning in LVLMs2025-06-04A Descriptor Is All You Need: Accurate Machine Learning of Nonadiabatic Coupling Vectors2025-05-29Advanced Deep Learning Techniques for Analyzing Earnings Call Transcripts: Methodologies and Applications2025-02-27No Argument Left Behind: Overlapping Chunks for Faster Processing of Arbitrarily Long Legal Texts2024-10-24RICo: Reddit ideological communities2024-06-05Exploring Multi-Level Threats in Telegram Data with AI-Human Annotation: A Preliminary Study2023-12-15Illicit Darkweb Classification via Natural-language Processing: Classifying Illicit Content of Webpages based on Textual Information2023-12-08Explainable and High-Performance Hate and Offensive Speech Detection2022-06-26IIITT@Dravidian-CodeMix-FIRE2021: Transliterate or translate? Sentiment analysis of code-mixed text in Dravidian languages2021-11-15Offensive Language Identification in Low-resourced Code-mixed Dravidian languages using Pseudo-labeling2021-08-27Towards Offensive Language Identification for Tamil Code-Mixed YouTube Comments and Posts2021-08-24Learning ULMFiT and Self-Distillation with Calibration for Medical Dialogue System2021-07-20WHOSe Heritage: Classification of UNESCO World Heritage "Outstanding Universal Value" Documents with Soft Labels2021-04-12L3CubeMahaSent: A Marathi Tweet-based Sentiment Analysis Dataset2021-03-21indicnlp@kgp at DravidianLangTech-EACL2021: Offensive Language Identification in Dravidian Languages2021-02-14indicnlp@ kgp at DravidianLangTech-EACL2021: Offensive Language Identification in Dravidian Languages2021-02-14Train your classifier first: Cascade Neural Networks Training from upper layers to lower layers2021-02-09Experimental Evaluation of Deep Learning models for Marathi Text Classification2021-01-13LaDiff ULMFiT: A Layer Differentiated training approach for ULMFiT2021-01-13