We turn raw robot & egocentric footage into training-ready episodes: object tracks, hand poses, action labels.
Equip your own operators or tap the Rimb contributor network. The capture app guides every clip so it clears QA the first time, with consent recorded at the source.
Connect a bucket, paste a link, or shoot in the app. Clips are ingested, deduped, and versioned into a single training set.
Drag the scrubber on a real annotated episode below: object tracks, hand poses, and action labels move with every frame. It's the exact format you'd receive.
Best viewed on a desktop browser — here's a recording instead.
Each episode arrives as a stack of reviewed layers, not a raw clip: object tracks, hand poses, action and language labels, plus the proprioceptive signals manipulation policies train on.
Every object detected and tracked frame-to-frame with a stable identity.
Both hands as 21-keypoint poses, grasp by grasp.
Temporal segments marking each action and sub-task in the episode.
Action-aligned natural-language descriptions, anchored to the frames.
Per-episode provenance: who appears, what they agreed to, what rights attach.
Per-frame 8-dim proprioceptive state and action vectors, in RLDS / LeRobot.
Unreviewed labels add noise your model will memorize. We set the reject criteria up front, then check every episode before it ships.
The same episode, written to the format your training stack already reads, so there's no glue code to write.
dataset/ ├─ data/*.parquet └─ videos/*.mp4
step {
observation,
action, reward
}episode.mcap ros2 · foxglove ready
Every episode carries a consent packet: who appears, what they agreed to, where the footage came from, and what rights attach. It's stored as fields in the data, not a separate PDF.
Egocentric footage is footage of people: hands, homes, workplaces. IL BIPA · TX CUBI are long-standing biometric consent law, and CA AB 2013 put training-data transparency duties in effect on Jan 1, 2026. In the EU, the AI Act's Article 10 adds data-governance duties for high-risk systems. Buyers increasingly ask where the footage came from before they buy.
We are not lawyers. We do not certify compliance. We structure the documentation so you can.
We capture footage wherever the work happens: kitchens, warehouses, labs. Hover a sample to see footage.
Wet, occluded, deformable objects handled at speed.
Tidying, laundry, and kitchen chores: long-horizon tasks in cluttered, real homes.
Pick, pack, and tote handling at fulfillment speed, across a changing mix of SKUs.
Contact-rich insertion, fastening, and fixture work that needs frame-accurate labels.
Shelf restocking and front-of-house handling in spaces shared with people.
Tube, tray, and instrument handling under protocol, where provenance matters.
Describe your robot task below. If it's a fit, we'll annotate 5 episodes on that task and send them back free in 7–10 days. No commitment required.
STEP 1 · FREE
Describe your task below. If it's a fit, we annotate 5 episodes and send them back free in 7–10 days. No commitment.
RLDS · LeRobot v2.1 · MCAP
STEP 2 · AFTER SAMPLE
If the sample confirms fit, we move to a fixed-scope pilot on your task, evaluated against a metric you define in writing before we start.
object tracks · hand poses · language · consent