Code for NeurIPS 2022 paper Exploiting Reward Shifting in Value-Based Deep RL
reinforcement-learning ensemble ensemble-learning rnd deep-q-network reward-design reward-shaping exploration-exploitation value-based-methods reward-engineering offline-reinforcement-learning dqn-rnd ensemble-rl
-
Updated
Oct 29, 2023 - Python