You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
As per Jaksch et. al 2010, the confidence intervals for UCRL2 use t_k := the timestep at the start of episode k. However, in run_finite_tabular_experiment in experiment.py, the episode index is wrongly passed instead of the timestep.
UCFH is also affected by this bug.
The text was updated successfully, but these errors were encountered:
vzhuang
changed the title
UCRL2 confidence intervals are incorrect
UCRL2/UCFH confidence intervals are incorrect
Jan 29, 2020
Right, it's a simple fix. Since the time is inside a log factor, this can't be "fixed" by adjusting the scaling constant. I'm guessing it probably has at least a small impact on your results depending on if you tune the scaling factor.
As per Jaksch et. al 2010, the confidence intervals for UCRL2 use t_k := the timestep at the start of episode k. However, in
run_finite_tabular_experiment
inexperiment.py
, the episode index is wrongly passed instead of the timestep.UCFH is also affected by this bug.
The text was updated successfully, but these errors were encountered: