Imitation Learning
Learning from demonstrations โ BC, DAgger, diffusion policies.
Why it matters in robotics
Imitation learning is the dominant paradigm behind today's real-robot manipulation and vision-language-action (VLA) policies, so it shows up constantly in robot-learning and applied-ML interviews. Interviewers probe whether you understand why naive behavior cloning fails (covariate shift and compounding error) and can explain the standard fixes: DAgger and interactive imitation, action chunking with temporal ensembling, and diffusion/flow policies for multimodal action distributions. Expect questions connecting these ideas to systems you should recognize by name (ACT/ALOHA, Diffusion Policy, OpenVLA, pi-0) and to practical concerns like demonstration collection via teleoperation. A common trap is the mode-averaging failure of MSE regression on multimodal data, which tests whether you can reason about the policy's representational capacity rather than just its loss. Strong candidates can sketch the theory (horizon-dependent error growth) and the engineering tradeoffs in the same answer.
Application focus
The same topic, tailored to the robot you're building. Your choice is remembered across the roadmap and every topic.
At a glance
How behavior cloning fails via covariate shift and how DAgger fixes it by relabeling the learner's own states.
What to study
- โBehavior cloning as supervised learning, and why covariate shift causes compounding errors that grow with the task horizon.
- โDAgger and interactive imitation: aggregating expert labels on the learner's own state distribution to break the i.i.d. assumption.
- โAction chunking and temporal ensembling (ACT) to reduce compounding error, and diffusion/flow policies for multimodal action distributions.
- โHow imitation learning underpins modern VLA policies (OpenVLA, pi-0) and the practicalities of demonstration collection via teleoperation.
Study by time budget
Pick the path that fits the time you have before your interview.