A VLA that Learns from Experience
A method for training our generalist policies with RL to improve success rate and throughput on real-world tasks.