Tasks SotA Datasets Papers Methods Submit About

Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable Benchmarks All SotA Datasets Papers Methods

Community

Submit Results About

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

URL

Umbrella Reinforcement Learning

Reinforcement LearningIntroduced 200035 papers

Description

A computationally efficient approach for solving hard nonlinear problems of reinforcement learning (RL). It combines umbrella sampling, from computational physics/chemistry, with optimal control methods. The approach is realized on the basis of neural networks, with the use of policy gradient. It outperforms, by computational efficiency and implementation universality, the available state-of-the-art algorithms, in application to hard RL problems with sparse reward, state traps and lack of terminal states. The proposed approach uses an ensemble of simultaneously acting agents, with a modified reward which includes the ensemble entropy, yielding an optimal exploration-exploitation balance.

Papers Using This Method

PhishKey: A Novel Centroid-Based Approach for Enhanced Phishing Detection Using Adaptive HTML Component Extraction2025-06-26 WebGuard++:Interpretable Malicious URL Detection via Bidirectional Fusion of HTML Subgraphs and Multi-Scale Convolutional BERT2025-06-24 Task Adaptation from Skills: Information Geometry, Disentanglement, and New Objectives for Unsupervised Reinforcement Learning2025-06-12 Open CaptchaWorld: A Comprehensive Web-based Platform for Testing and Benchmarking Multimodal LLM Agents2025-05-30 MultiPhishGuard: An LLM-based Multi-Agent System for Phishing Email Detection2025-05-26 URLs Help, Topics Guide: Understanding Metadata Utility in LLM Training2025-05-22 Streamline Without Sacrifice -- Squeeze out Computation Redundancy in LMM2025-05-21 Learning better representations for crowded pedestrians in offboard LiDAR-camera 3D tracking-by-detection2025-05-21 Do Not Change Me: On Transferring Entities Without Modification in Neural Machine Translation -- a Multilingual Perspective2025-05-09 Detecting Quishing Attacks with Machine Learning Techniques Through QR Code Analysis2025-05-06 Fill the Gap: Quantifying and Reducing the Modality Gap in Image-Text Representation Learning2025-05-06 Helping Large Language Models Protect Themselves: An Enhanced Filtering and Summarization System2025-05-02 Phishing URL Detection using Bi-LSTM2025-04-29 A Gradient-Optimized TSK Fuzzy Framework for Explainable Phishing Detection2025-04-25 From Past to Present: A Survey of Malicious URL Detection Techniques, Datasets and Code Repositories2025-04-23 Emerging Cyber Attack Risks of Medical AI Agents2025-04-02 MoonCast: High-Quality Zero-Shot Podcast Generation2025-03-18 VisualWebInstruct: Scaling up Multimodal Instruction Data through Web Search2025-03-13 DongbaMIE: A Multimodal Information Extraction Dataset for Evaluating Semantic Understanding of Dongba Pictograms2025-03-05 PhishVQC: Optimizing Phishing URL Detection with Correlation Based Feature Selection and Variational Quantum Classifier2025-03-03