The paper is titled, Vision-Language-Action Models Transfer Web Knowledge to Robotic Control, and reveals new capabilities to transfer web knowledge to real-world robot: "High-capacity models pretrained on broad web-scale datasets provide an effective and powerful platform for a wide range of downstream tasks."
Source