MIRA Lab
MIRA Lab
People
Publications
News
Courses
Courses Fall 2025
Courses Fall 2024
Courses Fall 2023
Courses Fall 2022
Courses Fall 2021
Courses Spring 2021
Courses Spring 2020
Admission
Admission 2025 保研
Photos
Social Media
Contact Us
Tackling Heavy-Tailed Q-Value Bias in Offline-to-Online Reinforcement Learning with Laplace-Robust Modeling
April 2026
Ruibo Guo
,
Rui Yang
,
Lei Liu
,
Junjie Shen
,
Guoping Wu
,
Jie Wang
,
Bin Li
Type
Conference paper
Publication
The Fourteenth International Conference on Learning Representations
Related
Learning Robust Representations with Long-Term Information for Generalization in Visual Reinforcement Learning
Uncertainty-based Offline Variational Bayesian Reinforcement Learning for Robustness under Diverse Data Corruptions
Learning Task-relevant Representations for Generalization via Characteristic Functions of Reward Sequence Distributions
LogicTree: Improving Complex Reasoning of LLMs via Instantiated Multi-step Synthetic Logical Data
MILP-StuDio: MILP Instance Generation via Block Structure Decomposition
Cite
×