Statistical Analysis for Trustworthy Robot EvaluationDate: April 16, 2025[Link to Event]Lecture on confidence bounds for performance estimation and hypothesis tests for policy comparison.See AlsoRelevant papers are listed below:Is Your Imitation Learning Policy Better than Mine? Policy Comparison with Near-Optional StoppingHow Generalizable is My Behavior Cloning Policy? A Statistical Approach to Trustworthy Performance EvaluationShare on Twitter Facebook LinkedIn Previous Next