Tasks SotA Datasets Papers Methods Submit About

Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable Benchmarks All SotA Datasets Papers Methods

Community

Submit Results About

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Methods/Embedding Dropout

Embedding Dropout

GeneralIntroduced 200064 papers

Description

Embedding Dropout is equivalent to performing dropout on the embedding matrix at a word level, where the dropout is broadcast across all the word vector’s embedding. The remaining non-dropped-out word embeddings are scaled by $\frac{1}{1-p\_{e}}$ where $p\_{e}$ is the probability of embedding dropout. As the dropout occurs on the embedding matrix that is used for a full forward and backward pass, this means that all occurrences of a specific word will disappear within that pass, equivalent to performing variational dropout on the connection between the one-hot embedding and the embedding lookup.

Source: Merity et al, Regularizing and Optimizing LSTM Language Models

Papers Using This Method

Advanced Deep Learning Techniques for Analyzing Earnings Call Transcripts: Methodologies and Applications2025-02-27 No Argument Left Behind: Overlapping Chunks for Faster Processing of Arbitrarily Long Legal Texts2024-10-24 Leveraging Audio-Only Data for Text-Queried Target Sound Extraction2024-09-20 Robust Decision Transformer: Tackling Data Corruption in Offline RL via Sequence Modeling2024-07-05 RICo: Reddit ideological communities2024-06-05 Exploring Multi-Level Threats in Telegram Data with AI-Human Annotation: A Preliminary Study2023-12-15 Illicit Darkweb Classification via Natural-language Processing: Classifying Illicit Content of Webpages based on Textual Information2023-12-08 Contrastive Feature Masking Open-Vocabulary Vision Transformer2023-09-02 Learning Large Graph Property Prediction via Graph Segment Training2023-05-21 Explainable and High-Performance Hate and Offensive Speech Detection2022-06-26 IIITT@Dravidian-CodeMix-FIRE2021: Transliterate or translate? Sentiment analysis of code-mixed text in Dravidian languages2021-11-15 Offensive Language Identification in Low-resourced Code-mixed Dravidian languages using Pseudo-labeling2021-08-27 Towards Offensive Language Identification for Tamil Code-Mixed YouTube Comments and Posts2021-08-24 SHAQ: Single Headed Attention with Quasi-Recurrence2021-08-18 Learning ULMFiT and Self-Distillation with Calibration for Medical Dialogue System2021-07-20 Self-Attention Between Datapoints: Going Beyond Individual Input-Output Pairs in Deep Learning2021-06-04 WHOSe Heritage: Classification of UNESCO World Heritage "Outstanding Universal Value" Documents with Soft Labels2021-04-12 L3CubeMahaSent: A Marathi Tweet-based Sentiment Analysis Dataset2021-03-21 indicnlp@kgp at DravidianLangTech-EACL2021: Offensive Language Identification in Dravidian Languages2021-02-14 indicnlp@ kgp at DravidianLangTech-EACL2021: Offensive Language Identification in Dravidian Languages2021-02-14