TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Methods/Seq2Seq

Seq2Seq

Sequence to Sequence

Natural Language ProcessingIntroduced 2000700 papers
Source Paper

Description

Seq2Seq, or Sequence To Sequence, is a model used in sequence prediction tasks, such as language modelling and machine translation. The idea is to use one LSTM, the encoder, to read the input sequence one timestep at a time, to obtain a large fixed dimensional vector representation (a context vector), and then to use another LSTM, the decoder, to extract the output sequence from that vector. The second LSTM is essentially a recurrent neural network language model except that it is conditioned on the input sequence.

(Note that this page refers to the original seq2seq not general sequence-to-sequence models)

Papers Using This Method

CoVE: Compressed Vocabulary Expansion Makes Better LLM-based Recommender Systems2025-06-24Exploring Speaker Diarization with Mixture of Experts2025-06-17Transforming Chatbot Text: A Sequence-to-Sequence Approach2025-06-15Improving Bangla Linguistics: Advanced LSTM, Bi-LSTM, and Seq2Seq Models for Translating Sylheti to Modern Bangla2025-05-24Dense Communication between Language Models2025-05-19Diverse In-Context Example Selection After Decomposing Programs and Aligned Utterances Improves Semantic Parsing2025-04-04Non-Monotonic Attention-based Read/Write Policy Learning for Simultaneous Translation2025-03-28Minimal Time Series Transformer2025-03-12ControllableGPT: A Ground-Up Designed Controllable GPT for Molecule Optimization2025-02-15A Differentiable Alignment Framework for Sequence-to-Sequence Modeling via Optimal Transport2025-02-03COVE: COntext and VEracity prediction for out-of-context images2025-02-03Development of an Inclusive Educational Platform Using Open Technologies and Machine Learning: A Case Study on Accessibility Enhancement2025-01-22A Diversity-Enhanced Knowledge Distillation Model for Practical Math Word Problem Solving2025-01-07AfriHG: News headline generation for African Languages2024-12-28Seq2Seq Model-Based Chatbot with LSTM and Attention Mechanism for Enhanced User Interaction2024-12-27DiffusionAttacker: Diffusion-Driven Prompt Manipulation for LLM Jailbreak2024-12-23Compositional Generalization Across Distributional Shifts with Sparse Tree Operations2024-12-18On the Role of Surrogates in Conformal Inference of Individual Causal Effects2024-12-16FedPAW: Federated Learning with Personalized Aggregation Weights for Urban Vehicle Speed Prediction2024-12-02NushuRescue: Revitalization of the Endangered Nushu Language with AI2024-11-29