Wednesday, June 17, 2026
Home / Technology / Collecting robot training data is dirty, unglamoro...
Technology

Collecting robot training data is dirty, unglamorous work. Some AI labs are already paying XDOF to do it.

CN
CitrixNews Staff
·
Collecting robot training data is dirty, unglamorous work. Some AI labs are already paying XDOF to do it.

Two weeks ago, OpenAI said it would relaunch the robotics program it shuttered in 2021 — the latest signal that the biggest AI labs are racing to teach machines to operate in the physical world. But building capable robots requires something the AI industry doesn’t yet have, which is the training data to match that used for language models.

That gap is creating a new kind of infrastructure business. Unlike LLMs that were trained on a vast sea of publicly available text, robots need data that captures physical interaction, and that kind of data barely exists. YouTube videos and footage captured by gig workers are low-fidelity and hard to reconcile with the physical world.

Originally reported by TechCrunch. Read the full story at the original source.