News·Unclaimed·

MPC-Injection: Biasing Off-Policy Locomotion RL Toward Controller-Induced Behavior Basins

arXiv:2606.26392v1 Announce Type: new Abstract: Reinforcement learning (RL) for locomotion frequently converges to locally optimal but undeployable behaviors, such as vibrating limbs or scooting on the torso, that maximize return without producing a usable gait. We present MPC-Injection, a low-over

via RSS