Less Dependency on First Observation
Currently we only initialize hypotheses on the first step of an episode which make us dependent on the first observation having low noise .
Currently we only initialize hypotheses on the first step of an episode which make us dependent on the first observation having low noise .