TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Methods/URL

URL

Umbrella Reinforcement Learning

Reinforcement LearningIntroduced 200035 papers
Source Paper

Description

A computationally efficient approach for solving hard nonlinear problems of reinforcement learning (RL). It combines umbrella sampling, from computational physics/chemistry, with optimal control methods. The approach is realized on the basis of neural networks, with the use of policy gradient. It outperforms, by computational efficiency and implementation universality, the available state-of-the-art algorithms, in application to hard RL problems with sparse reward, state traps and lack of terminal states. The proposed approach uses an ensemble of simultaneously acting agents, with a modified reward which includes the ensemble entropy, yielding an optimal exploration-exploitation balance.

Papers Using This Method

PhishKey: A Novel Centroid-Based Approach for Enhanced Phishing Detection Using Adaptive HTML Component Extraction2025-06-26WebGuard++:Interpretable Malicious URL Detection via Bidirectional Fusion of HTML Subgraphs and Multi-Scale Convolutional BERT2025-06-24Task Adaptation from Skills: Information Geometry, Disentanglement, and New Objectives for Unsupervised Reinforcement Learning2025-06-12Open CaptchaWorld: A Comprehensive Web-based Platform for Testing and Benchmarking Multimodal LLM Agents2025-05-30MultiPhishGuard: An LLM-based Multi-Agent System for Phishing Email Detection2025-05-26URLs Help, Topics Guide: Understanding Metadata Utility in LLM Training2025-05-22Streamline Without Sacrifice -- Squeeze out Computation Redundancy in LMM2025-05-21Learning better representations for crowded pedestrians in offboard LiDAR-camera 3D tracking-by-detection2025-05-21Do Not Change Me: On Transferring Entities Without Modification in Neural Machine Translation -- a Multilingual Perspective2025-05-09Detecting Quishing Attacks with Machine Learning Techniques Through QR Code Analysis2025-05-06Fill the Gap: Quantifying and Reducing the Modality Gap in Image-Text Representation Learning2025-05-06Helping Large Language Models Protect Themselves: An Enhanced Filtering and Summarization System2025-05-02Phishing URL Detection using Bi-LSTM2025-04-29A Gradient-Optimized TSK Fuzzy Framework for Explainable Phishing Detection2025-04-25From Past to Present: A Survey of Malicious URL Detection Techniques, Datasets and Code Repositories2025-04-23Emerging Cyber Attack Risks of Medical AI Agents2025-04-02MoonCast: High-Quality Zero-Shot Podcast Generation2025-03-18VisualWebInstruct: Scaling up Multimodal Instruction Data through Web Search2025-03-13DongbaMIE: A Multimodal Information Extraction Dataset for Evaluating Semantic Understanding of Dongba Pictograms2025-03-05PhishVQC: Optimizing Phishing URL Detection with Correlation Based Feature Selection and Variational Quantum Classifier2025-03-03