Changes for page 4. Evaluation Methods
Last modified by Manali Shah on 2023/04/10 12:28
From version 8.1
edited by Manali Shah
on 2023/04/10 12:28
on 2023/04/10 12:28
Change comment:
There is no comment for this version
To version 4.2
edited by Manali Shah
on 2023/03/29 12:16
on 2023/03/29 12:16
Change comment:
There is no comment for this version
Summary
-
Page properties (1 modified, 0 added, 0 removed)
Details
- Page properties
-
- Content
-
... ... @@ -1,28 +1,24 @@ 1 -The following steps we reused to design and evaluate the prototype proposed against the corresponding control condition:1 +The following steps will be used to design and evaluate the prototype proposed against the corresponding control condition: 2 2 3 -~1. Confirm the prototype: For the pilotstudy,the scenario to be tested, and the control situation weresetupat theInsyght Lab at TU Delft, and preliminary testing wasdone by the team members. This includes the robots with and without interactive storytelling whichwereconfirmed and working.The voice input and touch input to the robot were verified.3 +~1. Confirm the prototype: The prototype for the scenario to be tested, and the control situation will first be setup, and preliminary testing will be done by the team members. This includes the robots with and without interactive storytelling which should be confirmed and working. 4 4 5 -2. Develop Questions: We now develop the metrics on which the robot must be evaluated. We decided to use a modified version of the Godspeed questionnaire, which each participant was made to fill after interacting with the robot. This questionnaire has been elaborated below.5 +2. Develop Questions: 6 6 7 -3. Invite participants:Dueto limited time and resources, patientswith dementia (the actual users) couldnot be used for the study. We instead use TU Delft students to test the prototype.7 +3. Design Methods 8 8 9 +4. Implement and adapt: 9 9 11 +5. Make decisions: 12 + 10 10 **Research Question** 11 11 12 -" Can personalized,interactive storytellinghave apositiveeffecton thewell-beingofpeoplewithdementia?"15 +"Is interactive storytelling more engaging and beneficial than storytelling in the third person for persons suffering from dementia?" 13 13 14 -Sub RQ1: Does it improve the patient’s mood? 15 -Sub RQ2: Does it spark interactions with other people? 16 -Sub RQ3: Does it motivate them to complete their daily activities? 17 -Sub RQ4: Does it promote memory retention? 18 -Sub RQ5: Does it improve the storytelling experience? 19 - 20 20 Thus, our control situation is the scenario of a robot narrating a story without any involvement of the patient, and the scenario we want to evaluate is the one where the robot narrates the same story while trying to engage and take inputs from the patient. With this, we aim to find whether it is beneficial and engaging for patients with dementia. 21 21 22 - 23 23 **The Within-Subject Design** 24 24 25 -As part of the experiment design, we chose the within subject design over between subject. This means that each participant will interact with the robot twice. This was done due to the limited number of participants, and to avoid any biases of participant preferences. However, we ensure that the order of talking to each robot changes with the participant, i.e, half the participants talk to the robot in the control situation first and then the robot in the experimental situation. For the other half this order is reversed. This was done to avoid carry over effects.21 +As part of the experiment design, we chose the within subject design over between subject. This means that each participant will interact with the robot twice. This was done due to the limited number of participants, and to avoid any biases of participant preferences. 26 26 27 27 28 28 **Summative Evaluation** ... ... @@ -32,21 +32,11 @@ 32 32 33 33 **Questionnaire** 34 34 35 - We used amodifiedversion of the Godspeed questionnaire forourevaluation [1]. It measures the **anthropomorphism, animacy, likeability, intelligence, and safety**ofthe robot. This uses a Likert scale where the user must rate questions as a number between 1 and 5; both numbers being at opposite poles. We decided to go ahead with the Godspeed questionnaire, because in dealing with patients with dementia, it seemed relevant to measure the above mentioned 5 characteristics of the robot, as they play an important role in making the patient feel more comfortable and at ease.31 +-modified godspeed questionnaire for robot 36 36 37 - To measure whether patientswith dementiacompletedtheactivitytheywere meant to do, and to evaluatewhether storytelling made a difference totheirmeal, weaddedthe following questions:33 +-statistical test (p value) for evaluation 38 38 39 -1. Please rate the question according to the following attributes. - Mood of the patient after the activity. (Scale of 1 to 5) 40 40 41 -2. Please rate the question according to the following attributes. - Patient's feedback about the story experience (Scale of 1 to 5) 42 - 43 -3. Please rate the question according to the following attributes. - Patient's enjoyment (Scale of 1 to 5) 44 - 45 -4. Did the patient complete the activity? (Yes/No) 46 - 47 -5. How many minutes did the patient take to complete the activity? (<10 minutes, 10-25 minutes, 25-40 minutes, >40 minutes) 48 - 49 - 50 50 **Prototype** 51 51 52 52 We present a low fidelity prototype of the robot, which means a simple demonstration of the initial stages of the robot, meant for formative feedback. We wizard-of-oz the approach, and for now just present one story (in interactive and non interactive modes) for purposes of the experiment. The final robot is expected to have various templates of stories. ... ... @@ -54,10 +54,7 @@ 54 54 For prototyping, we will use incremental prototyping, which means adding features one by one and testing for each. We start with the most basic feature, complete a cycle of testing, and then add on new features to create new versions of the prototype. For the robot, we will first build the non interactive storytelling robot, then add music to it, and then add gestures. With each stage, we test the working of it, and if working as expected, we will move on to adding the next feature. 55 55 56 56 57 -**Evaluation of Results** 58 58 59 - We decided to use the**paired sampled t test** sincetheexperimentwasa**withinsubject** experiment.The **one tailedt test**was usedsincewe wantto find if onecondition isbetter thanthe other. Though the one tailed t tests more powerful, itcould be debatablewhether it isbetterthanthe two tailedt testin this scenario, sincewith theonetailed t test, weassume alreadythatthe experimentalscenariowill performbetter thanthe controlscenario.44 +**Since we don't have many participants, should we skip the statistical test? Can we just report average values of responses for both scenarios?** 60 60 61 - 62 - 63 -[1]C. Bartneck, D. Kuli´c, E. Croft, and S. Zoghbi, “Measurement instruments for the anthropomorphism, animacy, likeability, perceived intelligence, and perceived safety of robots,” International Journal of Social Robotics, vol. 1, no. 1, p. 71–81, 2008. 46 +**Questionnaire should be a formal one, or should we ask 4-5 questions through Pepper? Or both?**