Good that you already thought about this list of "claims". The next step would be to focus on a selected functionality and its most interesting effect for the use case (i.e. the claim to be tested). Concerning speed, fluency is a measure that you seem to aim at (a well-know measure in human-robot interaction research).
Good that you already thought about this list of "claims". The next step would be to focus on a selected functionality and its most interesting effect for the use case (i.e. the claim to be tested). Concerning speed, fluency is a measure that you seem to aim at (a well-know measure in human-robot interaction research).