Reinforcement Learning Results

Qualitative rollout comparisons across models and environments

Random Policy Baseline

Center Point
Center Point trajectory
Inside Room
Inside Room trajectory
Trapped Corner
Trapped Corner trajectory
Local Attention ResNet-18

Center Point
Center Point trajectory
Inside Room
Inside Room trajectory
Trapped Corner
Trapped Corner trajectory
Sliding Window Transformer (Random Embedding)

Center Point
Center Point trajectory
Inside Room
Inside Room trajectory
Trapped Corner
Trapped Corner trajectory
Sliding Window Transformer (No Curiosity)

Center Point
Center Point trajectory
Inside Room
Inside Room trajectory
Trapped Corner
Trapped Corner trajectory
Sliding Window Transformer (PCA Head)

Center Point
Center Point trajectory
Inside Room
Inside Room trajectory
Trapped Corner
Trapped Corner trajectory
New Environment (PCA Head)

Center Point
Center Point trajectory
Bedroom
Bedroom trajectory
Toilet
Toilet trajectory