Tasks SotA Datasets Papers Methods Submit About

Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable Benchmarks All SotA Datasets Papers Methods

Community

Submit Results About

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Methods/AWD-LSTM

AWD-LSTM

ASGD Weight-Dropped LSTM

SequentialIntroduced 200052 papers

Description

ASGD Weight-Dropped LSTM, or AWD-LSTM, is a type of recurrent neural network that employs DropConnect for regularization, as well as NT-ASGD for optimization - non-monotonically triggered averaged SGD - which returns an average of last iterations of weights. Additional regularization techniques employed include variable length backpropagation sequences, variational dropout, embedding dropout, weight tying, independent embedding/hidden size, activation regularization and temporal activation regularization.

Papers Using This Method

Advanced Deep Learning Techniques for Analyzing Earnings Call Transcripts: Methodologies and Applications2025-02-27 No Argument Left Behind: Overlapping Chunks for Faster Processing of Arbitrarily Long Legal Texts2024-10-24 RICo: Reddit ideological communities2024-06-05 Exploring Multi-Level Threats in Telegram Data with AI-Human Annotation: A Preliminary Study2023-12-15 Illicit Darkweb Classification via Natural-language Processing: Classifying Illicit Content of Webpages based on Textual Information2023-12-08 Explainable and High-Performance Hate and Offensive Speech Detection2022-06-26 IIITT@Dravidian-CodeMix-FIRE2021: Transliterate or translate? Sentiment analysis of code-mixed text in Dravidian languages2021-11-15 Offensive Language Identification in Low-resourced Code-mixed Dravidian languages using Pseudo-labeling2021-08-27 Towards Offensive Language Identification for Tamil Code-Mixed YouTube Comments and Posts2021-08-24 Learning ULMFiT and Self-Distillation with Calibration for Medical Dialogue System2021-07-20 WHOSe Heritage: Classification of UNESCO World Heritage "Outstanding Universal Value" Documents with Soft Labels2021-04-12 L3CubeMahaSent: A Marathi Tweet-based Sentiment Analysis Dataset2021-03-21 indicnlp@kgp at DravidianLangTech-EACL2021: Offensive Language Identification in Dravidian Languages2021-02-14 indicnlp@ kgp at DravidianLangTech-EACL2021: Offensive Language Identification in Dravidian Languages2021-02-14 Train your classifier first: Cascade Neural Networks Training from upper layers to lower layers2021-02-09 Experimental Evaluation of Deep Learning models for Marathi Text Classification2021-01-13 LaDiff ULMFiT: A Layer Differentiated training approach for ULMFiT2021-01-13 Post-Training Weighted Quantization of Neural Networks for Language Models2021-01-01 HinglishNLP at SemEval-2020 Task 9: Fine-tuned Language Models for Hinglish Sentiment Detection2020-12-01 Smash at SemEval-2020 Task 7: Optimizing the Hyperparameters of ERNIE 2.0 for Humor Ranking and Rating2020-12-01