TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers

575,626 papers

GenBreak: Red Teaming Text-to-Image Generators Using Large Language Models

Zilong Wang, Xiang Zheng, Xiaosen Wang, Bo wang, Xingjun Ma et al.

2025-06-11Large Language ModelRed Teaming
Paper
ToxSyn-PT: A Large-Scale Synthetic Dataset for Hate Speech Detection in Portuguese

Iago Alves Brito, Julia Soares Dollis, Fernanda Bufon Färber, Diogo Fernandes Costa Silva, Arlindo Rodrigues Galvão Filho et al.

2025-06-11Hate Speech DetectionMulti-Label Classification
Paper
Classifying Unreliable Narrators with Large Language Models

Anneliese Brei, Katharine Henry, Abhisheik Sharma, Shashank Srivastava, Snigdha Chaturvedi et al.

2025-06-11
PaperCode
TTT-Bench: A Benchmark for Evaluating Reasoning Ability with Simple and Novel Tic-Tac-Toe-style Games

Prakamya Mishra, Jiang Liu, Jialian Wu, Xiaodong Yu, Zicheng Liu et al.

2025-06-11MathLogical Reasoning
Paper
Q2E: Query-to-Event Decomposition for Zero-Shot Multilingual Text-to-Video Retrieval

Shubhashis Roy Dipta, Francis Ferraro

2025-06-11Video RetrievalText to Video RetrievalRetrieval
Paper
Can LLMs Generate Good Stories? Insights and Challenges from a Narrative Planning Perspective

Yi Wang, Max Kreminski

2025-06-11Story Generation
Paper
Measuring Corporate Human Capital Disclosures: Lexicon, Data, Code, and Research Opportunities

Elizabeth Demers, Victor Xiaoqi Wang, Kean Wu

2025-06-11Management
Paper
Analyzing Emotions in Bangla Social Media Comments Using Machine Learning and LIME

Bidyarthi Paul, SM Musfiqur Rahman, Dipta Biswas, Md. Ziaul Hasan, Md. Zahid Hossain et al.

2025-06-11Sentiment AnalysisEmotion Recognition
Paper
Unsupervised Elicitation of Language Models

Jiaxin Wen, Zachary Ankner, Arushi Somani, Peter Hase, Samuel Marks et al.

2025-06-11TruthfulQAGSM8K
PaperCode
ChartReasoner: Code-Driven Modality Bridging for Long-Chain Reasoning in Chart Question Answering

Caijun Jia, Nan Xu, Jingxuan Wei, Qingli Wang, Lei Wang et al.

2025-06-11Question AnsweringChart Question AnsweringImage to text+2
Paper
Chat-of-Thought: Collaborative Multi-Agent System for Generating Domain Specific Information

Christodoulos Constantinides, Shuxin Lin, Nianjun Zhou, Dhaval Patel

2025-06-11Large Language ModelLanguage Modelling
Paper
A quantum semantic framework for natural language processing

Christopher J. Agostino, Quan Le Thien, Molly Apsel, Denizhan Pak, Elina Lesyk et al.

2025-06-11
PaperCode
TaskCraft: Automated Generation of Agentic Tasks

Dingfeng Shi, Jingyi Cao, Qianben Chen, Weichen Sun, Weizhen Li et al.

2025-06-11
PaperCode
When Meaning Stays the Same, but Models Drift: Evaluating Quality of Service under Token-Level Behavioral Instability in LLMs

Xiao Li, Joel Kreuzwieser, Alan Peters

2025-06-11Diagnostic
PaperCode
When Large Language Models are Reliable for Judging Empathic Communication

Aakriti Kumar, Nalin Poungpeth, Diyi Yang, Erina Farrell, Bruce Lambert et al.

2025-06-11
PaperCode
A Call for Collaborative Intelligence: Why Human-Agent Systems Should Precede AI Autonomy

Henry Peng Zou, Wei-Chieh Huang, Yaozu Wu, Chunyu Miao, Dongyuan Li et al.

2025-06-11
PaperCode
CoRT: Code-integrated Reasoning within Thinking

Chengpeng Li, Zhengyang Tang, Ziniu Li, Mingfeng Xue, Keqin Bao et al.

2025-06-11Mathematical Reasoning
PaperCode
Bridging the Gap Between Open-Source and Proprietary LLMs in Table QA

Nikolas Evkarpidi, Elena Tutubalina

2025-06-11Question AnsweringText-To-SQLText-to-Code Generation+4
PaperCode
Data-Driven Modeling of IRCU Patient Flow in the COVID-19 Pandemic

Ana Carmen Navas-Ortega, José Antonio Sánchez-Martínez, Paula García-Flores, Concepción Morales-García, Rene Fabregas et al.

2025-06-11Respiratory Failure
PaperCode
Revisit What You See: Disclose Language Prior in Vision Tokens for Efficient Guided Decoding of LVLMs

Beomsik Cho, Jaehyung Kim

2025-06-11Text GenerationVisual GroundingHallucination
PaperCode
PreviousPage 251 of 28782Next