A VLA that Learns from Experience

A method for training our generalist policies with RL to improve success rate and throughput on real-world tasks.