TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers

575,626 papers

Practical design and performance of physical reservoir computing using hysteresis

Yuhei Yamada

2025-07-08
Paper
evortran: a modern Fortran package for genetic algorithms with applications from LHC data fitting to LISA signal reconstruction

Thomas Biekötter

2025-07-08
Paper
Model-free Optical Processors using In Situ Reinforcement Learning with Proximal Policy Optimization

Yuhang Li, Shiqi Chen, Tingyu Gong, Aydogan Ozcan

2025-07-08Image ClassificationImage Generation
Paper
TuneShield: Mitigating Toxicity in Conversational AI while Fine-tuning on Untrusted Data

Aravind Cheruvu, Shravya Kanchi, Sifat Muhammad Abdullah, Nicholas Kong, Daphne Yao et al.

2025-07-08Instruction FollowingSafety AlignmentChatbot
Paper
AutoTriton: Automatic Triton Programming with Reinforcement Learning in LLMs

Shangzhan Li, Zefan Wang, Ye He, YuXuan Li, Qi Shi et al.

2025-07-08Reinforcement Learningreinforcement-learning
PaperCode
MobileGUI-RL: Advancing Mobile GUI Agent through Reinforcement Learning in Online Environment

Yucheng Shi, Wenhao Yu, Zaitang Li, Yonglin Wang, Hongming Zhang et al.

2025-07-08
Paper
Hierarchical Task Offloading for UAV-Assisted Vehicular Edge Computing via Deep Reinforcement Learning

Hongbao Li, Ziye Jia, Sijie He, Kun Guo, Qihui Wu et al.

2025-07-08Trajectory PlanningScheduling
Paper
Robust Bandwidth Estimation for Real-Time Communication with Offline Reinforcement Learning

Jian Kai, Tianwei Zhang, Zihan Ling, Yang Cao, Can Shen et al.

2025-07-08Reinforcement LearningOffline RL
Paper
GTA1: GUI Test-time Scaling Agent

Yan Yang, Dongxu Li, Yutong Dai, Yuhao Yang, Ziyang Luo et al.

2025-07-08Visual GroundingReinforcement LearningTask Planning
PaperCode
Stable Acoustic Relay Assignment with High Throughput via Lase Chaos-based Reinforcement Learning

Zengjing Chen, Lu Wang, Chengzhi Xing

2025-07-08Decision Making
Paper
Differentiable Reward Optimization for LLM based TTS system

Changfeng Gao, Zhihao Du, Shiliang Zhang

2025-07-08Text to Speechtext-to-speech
PaperCode
BlueLM-2.5-3B Technical Report

Baojiao Xiong, Boheng Chen, Chengzhi Wang, Daxiong Luo, Dongsheng Xu et al.

2025-07-08Multimodal Large Language ModelLarge Language Model
Paper
FEVO: Financial Knowledge Expansion and Reasoning Evolution for Large Language Models

Bo Pang, Yalu Ouyang, Hangfei Xu, Ziqi Jia, Panpan Li et al.

2025-07-08Reinforcement LearningLogical Reasoning
Paper
AI-Based Demand Forecasting and Load Balancing for Optimising Energy use in Healthcare Systems: A real case study

Iman Rahimi, Isha Patel

2025-07-08Time Series ForecastingDemand ForecastingManagement
Paper
Hierarchy or Heterarchy? A Theory of Long-Range Connections for the Sensorimotor Brain

Jeff Hawkins, Niels Leadholm, Viviane Clay

2025-07-08
Paper
AI-Reporter: A Path to a New Genre of Scientific Communication

Gerd Graßhoff

2025-07-08PhilosophySociology
Paper
Universal Embeddings of Tabular Data

Astrid Franz, Frederik Hoppe, Marianne Michaelis, Udo Göbel

2025-07-08Entity EmbeddingsOutlier Detection
Paper
Exploring Gain-Doped-Waveguide-Synapse for Neuromorphic Applications: A Pulsed Pump-Signal Approach

Robert Otupiri, Ripalta Stabile

2025-07-08
Paper
A Wireless Foundation Model for Multi-Task Prediction

Yucheng Sheng, Jiacheng Wang, Xingyu Zhou, Le Liang, Hao Ye et al.

2025-07-08Prediction IntervalsPrediction
Paper
Optimal Placement of Smart Hybrid Transformers in Distribution Networks

Samuel Hayward, Martin Doff-Sotta, Michael Merlin, Matthew Williams, Thomas Morstyn et al.

2025-07-08
Paper
PreviousPage 37 of 28782Next