TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Methods/fastText

fastText

Natural Language ProcessingIntroduced 2000240 papers
Source Paper

Description

fastText embeddings exploit subword information to construct word embeddings. Representations are learnt of character nnn-grams, and words represented as the sum of the nnn-gram vectors. This extends the word2vec type models with subword information. This helps the embeddings understand suffixes and prefixes. Once a word is represented using character nnn-grams, a skipgram model is trained to learn the embeddings.

Papers Using This Method

Speak2Sign3D: A Multi-modal Pipeline for English Speech to American Sign Language Animation2025-07-09Guarded Query Routing for Large Language Models2025-05-20Ultra-FineWeb: Efficient Data Filtering and Verification for High-Quality LLM Training Data2025-05-08myNER: Contextualized Burmese Named Entity Recognition with Bidirectional LSTM and fastText Embeddings via Joint Training with POS Tagging2025-04-05A Data-driven Investigation of Euphemistic Language: Comparing the usage of "slave" and "servant" in 19th century US newspapers2025-03-19Text classification using machine learning methods2025-02-27Poster: Long PHP webshell files detection based on sliding window attention2025-02-26A Multi-tiered Solution for Personalized Baggage Item Recommendations using FastText and Association Rule Mining2025-01-16Sentiment Analysis in Twitter Social Network Centered on Cryptocurrencies Using Machine Learning2025-01-16Research on Violent Text Detection System Based on BERT-fasttext Model2024-12-21UnMA-CapSumT: Unified and Multi-Head Attention-driven Caption Summarization Transformer2024-12-16On Importance of Code-Mixed Embeddings for Hate Speech Identification2024-11-27BERT or FastText? A Comparative Analysis of Contextual as well as Non-Contextual Embeddings2024-11-26From Word Vectors to Multimodal Embeddings: Techniques, Applications, and Future Directions For Large Language Models2024-11-06Generic Embedding-Based Lexicons for Transparent and Reproducible Text Scoring2024-11-01LightFusionRec: Lightweight Transformers-Based Cross-Domain Recommendation Model2024-10-21Stress Detection on Code-Mixed Texts in Dravidian Languages using Machine Learning2024-10-08Beyond Film Subtitles: Is YouTube the Best Approximation of Spoken Vocabulary?2024-10-04Individuation in Neural Models with and without Visual Grounding2024-09-27An Evaluation of Sindhi Word Embedding in Semantic Analogies and Downstream Tasks2024-08-28