TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Methods/Embedding Dropout

Embedding Dropout

GeneralIntroduced 200064 papers
Source Paper

Description

Embedding Dropout is equivalent to performing dropout on the embedding matrix at a word level, where the dropout is broadcast across all the word vector’s embedding. The remaining non-dropped-out word embeddings are scaled by 11−p_e\frac{1}{1-p\_{e}}1−p_e1​ where p_ep\_{e}p_e is the probability of embedding dropout. As the dropout occurs on the embedding matrix that is used for a full forward and backward pass, this means that all occurrences of a specific word will disappear within that pass, equivalent to performing variational dropout on the connection between the one-hot embedding and the embedding lookup.

Source: Merity et al, Regularizing and Optimizing LSTM Language Models

Papers Using This Method

Advanced Deep Learning Techniques for Analyzing Earnings Call Transcripts: Methodologies and Applications2025-02-27No Argument Left Behind: Overlapping Chunks for Faster Processing of Arbitrarily Long Legal Texts2024-10-24Leveraging Audio-Only Data for Text-Queried Target Sound Extraction2024-09-20Robust Decision Transformer: Tackling Data Corruption in Offline RL via Sequence Modeling2024-07-05RICo: Reddit ideological communities2024-06-05Exploring Multi-Level Threats in Telegram Data with AI-Human Annotation: A Preliminary Study2023-12-15Illicit Darkweb Classification via Natural-language Processing: Classifying Illicit Content of Webpages based on Textual Information2023-12-08Contrastive Feature Masking Open-Vocabulary Vision Transformer2023-09-02Learning Large Graph Property Prediction via Graph Segment Training2023-05-21Explainable and High-Performance Hate and Offensive Speech Detection2022-06-26IIITT@Dravidian-CodeMix-FIRE2021: Transliterate or translate? Sentiment analysis of code-mixed text in Dravidian languages2021-11-15Offensive Language Identification in Low-resourced Code-mixed Dravidian languages using Pseudo-labeling2021-08-27Towards Offensive Language Identification for Tamil Code-Mixed YouTube Comments and Posts2021-08-24SHAQ: Single Headed Attention with Quasi-Recurrence2021-08-18Learning ULMFiT and Self-Distillation with Calibration for Medical Dialogue System2021-07-20Self-Attention Between Datapoints: Going Beyond Individual Input-Output Pairs in Deep Learning2021-06-04WHOSe Heritage: Classification of UNESCO World Heritage "Outstanding Universal Value" Documents with Soft Labels2021-04-12L3CubeMahaSent: A Marathi Tweet-based Sentiment Analysis Dataset2021-03-21indicnlp@kgp at DravidianLangTech-EACL2021: Offensive Language Identification in Dravidian Languages2021-02-14indicnlp@ kgp at DravidianLangTech-EACL2021: Offensive Language Identification in Dravidian Languages2021-02-14