Changes for page 4. Evaluation Methods
Last modified by Demi Breen on 2023/04/09 14:54
To version 8.1
edited by Demi Breen
on 2023/04/09 14:54
on 2023/04/09 14:54
Change comment:
There is no comment for this version
Summary
-
Page properties (2 modified, 0 added, 0 removed)
Details
- Page properties
-
- Author
-
... ... @@ -1,1 +1,1 @@ 1 - xwiki:XWiki.MarkNeerincx1 +XWiki.Demibreen1000 - Content
-
... ... @@ -1,21 +1,22 @@ 1 - 1 +A within-subject designed experiment is when each participant is exposed to more than one experiment under testing. A between-subject design is when participants only do one experiment [1]. With the within-subject design, a risk is the so-called 'demand effect', which entails that they might expect the researchers to want certain results, and will then act as such. Another thing that might happen with within-subject design is that participants might experience a learning effect, i.e. learning from the first experiment. [2] 2 2 3 -Our research topic is: 4 -The effect of goal-based and emotion-based explanations in prompting PwD for physical activity. 5 -\\We will create two systems, both trying to motivate the PwD to go for a walk in the garden. One will use goal-based explanations and the other emotion-based explanations. Maybe we would also need a control group (no explanation), resulting in three systems. 3 +Quite some established questionnaires exist regarding human-robot interaction. However, most are more about the usability of a system where the user has a specific goal. Examples of these questionnaires are SASSI [3], SUS [4], and APA [5]. Questionnaires also concerning the robot's perceived likeability and general interaction are GodSpeed [6] and a questionnaire proposed by Herink et al. [7], where the latter is more elaborate. [8] proposes the Self-Assessment Manikin (SAM), a non-verbal assessment based on pictures used to measure pleasure, arousal, and dominance as a reaction to some form of stimulation. Finally, [9] explains the AffectButton, an interface component that lets users enter the most appropriate expression by moving their mouse to the proper location. 6 6 7 -Independent effect: explanation method 8 8 9 - Dependenteffect: motivation to go to the garden6 +=== References === 10 10 11 -Confounding effects: Personal enjoyment of nature, weather, 8 +[1] Greenwald, A. G. (1976). Within-subjects designs: To use or not to use?. //Psychological Bulletin//, //83//(2), 314. 9 +[2] Seltman, H. J. (2012). Experimental design and analysis (pp. 340) 12 12 11 +[3] Hone, K. S., & Graham, R. (2000). Towards a tool for the subjective assessment of speech system interfaces (SASSI). //Natural Language Engineering//, //6//(3-4), 287-303. 13 13 14 -The between-subject study design fits with the limited time that we have. It also makes sure there's no learning effect like what could occur with a within-subject study. We do have to take into account the potential differences between the groups meaning we cannot take the results as a direct conclusion. 13 +[4] Lewis, J. R. (2018). The system usability scale: past, present, and future. //International Journal of Human–Computer Interaction//, //34//(7), 577-590. 14 +[5] Fitrianie, S., Bruijnes, M., Li, F., Abdulrahman, A., & Brinkman, W. P. (2022, September). The artificial-social-agent questionnaire: establishing the long and short questionnaire versions. In //Proceedings of the 22nd ACM International Conference on Intelligent Virtual Agents// (pp. 1-8). 15 15 16 - Aftereach evaluationsession,theparticipantwill be askedto fill in a questionnaire.There's quiteomeexisting forhuman-robot interaction. However, they are more about the usability of the system. While we see our system justas a conversationalmotivator for going outside.So we don'tsee these questionnaires as fit:16 +[6] Bartneck, C. (2023). Godspeed Questionnaire Series: Translations and Usage. 17 17 18 - SASSI,SUS(SystemUsabilityscale),Godspeedquestionnaire,ASA questionnaire,AttrakDiff,SUISQ18 +[7] Heerink, M., Krose, B., Evers, V., & Wielinga, B. (2009, September). Measuring acceptance of an assistive social robot: a suggested toolkit. In RO-MAN 2009-The 18th IEEE International Symposium on Robot and Human Interactive Communication (pp. 528-533). IEEE. 19 19 20 +[8] Bradley, M. M., & Lang, P. J. (1994). Measuring emotion: the self-assessment manikin and the semantic differential. //Journal of behavior therapy and experimental psychiatry//, //25//(1), 49-59. 20 20 21 - Allparticipantsoftheevaluationwillbepartfthe course.So they willallbefamiliarwith the robotinquestion. They willallbestudents at theTU Delftaged20-25.22 +[9] Broekens, J., & Brinkman, W. P. (2013). AffectButton: A method for reliable and valid affective self-report. //International Journal of Human-Computer Studies//, //71//(6), 641-667.