Changes for page Test
                  Last modified by Andrei Stefan on 2022/04/04 13:38
              
      
      From version  94.1
 94.1  
    
     94.1
 94.1  
    
              edited by Xinqi Li
        
on 2022/04/02 01:33
     on 2022/04/02 01:33
      Change comment:
              There is no comment for this version
          
         
      To version  33.1
 33.1  
    
     33.1
 33.1  
    
              edited by Andrei Stefan
        
on 2022/03/14 22:48
     on 2022/03/14 22:48
      Change comment:
              There is no comment for this version
          
         Summary
Details
- Page properties
- 
      - Author
-   ... ... @@ -1,1 +1,1 @@ 1 -XWiki. mona981 +XWiki.AndreiStefan 
- Content
-   ... ... @@ -1,131 +3,128 @@ 1 -Our robot aims to help delay the stage of dementia or slow down the deterioration of memory. The best situation is that we can test the robot with real PwD and in a relatively long time period to see if this robot really works, which is impossible for our project. So our evaluation performs in a group control way. Participants are divided into two groups, group A with the intelligent one, and group B with the dumb one. 2 - 3 3 = Problem statement and research questions = 4 4 5 -The main use cases that the evaluation focuses on are UC001: Daily todo list and UC005: Quiz. Based on the claims corresponding to those use cases, we derive the following research questions: 6 - 7 -1. Are PwD willing to play the quiz? 8 -1. Are PwD happy to listen to music? 9 -1. Are PwD happy if they get the correct answer? 10 -1. Does PwD enhance their memory of the association between music and activities? 11 - 12 12 = Method = 13 13 14 - Thecontrolgroup evaluationisused. Onegroupofparticipantsinteracts withadumbrobotandanothergroupinteractswith the intelligentrobot.The onlydifferencebetween thesetwogroups istheindependentvariable-dumbor intelligentrobot, whichmakes ourresultmore reasonable.5 +We are doing a mixed-method approach. We are producing quantitative data from questionnaires after the interaction with the robot, by asking them how they felt about the whole experience. By measuring this data, we will assess if we successfully achieved our claims and determine the answers to the research questions. 15 15 16 -Besides, Our group decided to use a mixed-method approach for the evaluation. 17 - 18 -* Quantitative data will be derived during the experiment such as the number of mistakes the participant makes during the quiz. The participants were also asked to provide a score based on the given system usability scale^^1^^. 19 -* Qualitative data expected to be gathered through questionnaires, such as to what extent participants are satisfied with using the robot, is also adopted for evaluation. 20 - 21 -By measuring these two types of data, we will manage to assess if our claims are achieved and the research questions are answered. 22 - 23 23 == Participants == 24 24 25 - We invited19participants.Tovalidate our researchquestion thatthequizwill help people bettermemorizemusic-activity links,participants willbedivided intotwo groups,GroupAwiththe intelligent robot(9participants)andGroupB(10participants)with the dumbrobot.9 +Due to covid restrictions, we have to “simulate” our robot-human interaction with elderly people affected by dementia. For that reason, we will ask roughly eighteen people how will pretend to be people with different dementia types, the loved ones or the caregiver. We will create three groups / six people in each. In each group, one person will “play” the role of a caregiver, one plays the loved one, and the remaining four people will do the role of the person with dementia. In each group, the “onset of dementia played” should be different. In Group A, people act like they just have early-onset symptoms, but they are almost fully functioning. They sometimes get confused, but these moments swiftly pass and they are back to their true selves. In Group B, people should have a harder time keeping track of things, forgetting about tasks, people and memories should be common. These people’s life is constantly affected by the disease, but they sometimes have clear moments when they are back to their original selves before dementia. In Group C people have late-onset of dementia. Constant confusion is more common than moments of clarity which usually doesn't last long. People should have a hard time keeping track of anything, and remembering is not something that’s even possible anymore. 26 26 27 27 == Experimental design == 28 28 29 -The experiment will be conducted to simulate the reinforcement learning process of musical memory related to daily activities and to investigate if the quiz is indeed able to help with the learning. 30 -All participants would sign a consent form that informed them of the usage of the collected data and our goal of evaluations. In our prototype, users can personalize the association between music and activities based on their existing intrinsic knowledge. But due to the limited time and requiring a comparable result between groups, in evaluation, we forced 6 pieces of music and activities. Participants listened to the music and were asked the remember the associated activities. 31 -In the end, the participants would take a quiz to see how much they remembered. They are also asked to fill in a questionnaire including the feeling of the robot and possible feedback. 13 +The experiment will be conducted by simulating the interaction between the patient, robot and other actors. The group of participants will be divided into 3 subgroups that simulate patients in different stages of dementia. The participants will be given artificial memory loss by forcing them to remember a large number of songs they have to associate with certain activities. From each group one of the participants will be asked to play the caregiver and another participant will take the roll of loved one. With these participants the interactions between the actors will be tested. The interactions are described in the design patterns. 32 32 33 -1. How many questions did you answer correctly? (Points from 0-6) 34 -1. You feel the robot can help you remember the task. (Agree, Neutral, Disagree) 35 -1. You feel the robot is annoying. (Agree, Neutral, Disagree) 36 -1. Based on the given system usability scale, please give our robot a score. (0-100) 15 +When the participant playing the role of patient has learned the association between the music pieces and activities the robot will start playing certain pieces of music. The participant has to recall the correct activity associated with the music piece. When this is wrong the loved one can step in and call to remind the 37 37 38 -Except for the previous questions, we also collect feedback from participants 39 - 40 -1. What did you like most about the robot? 41 -1. What did you dislike most about the robot? 42 -1. Do you have any further suggestions? (*optional) 43 - 44 44 == Tasks == 45 45 46 -The participants are asked to memorize the association between the given music and activities as best as they can during the play with the robot. 47 -The robot would play the music and ask the participant to answer the correct activity. 48 -In the end, the participant would do the final test and we count the number of correct answers. 19 +**Event: Activity** 49 49 50 -== Measures == 21 +{{html}} 22 +<table> 23 + <tr> 24 + <td>No.</td> 25 + <td>Group A</td> 26 + <td>Group B</td> 27 + <td>Group C</td> 28 + </tr> 29 + <tr> 30 + <td>1</td> 31 + <td>Memorize five pieces of music corresponding with different activities within three minutes;</td> 32 + <td>Memorize seven pieces of music corresponding with different activities within three minutes;</td> 33 + <td>Memorize ten pieces of music corresponding with different activities within three minutes;</td> 34 + </tr> 35 + <tr> 36 + <td>2</td> 37 + <td>Say “I will do [activity_name] now.”/ ”I don’t know.” to the robot after hearing the robot play music1;</td> 38 + <td>Say “I will do [activity_name] now.”/ ”I don’t know.” to the robot after hearing the robot play music1;</td> 39 + <td>Say “I will do [activity_name] now.”/ ”I don’t know.” to the robot after hearing the robot play music1;</td> 40 + </tr> 41 + <tr> 42 + <td>3</td> 43 + <td>Say “I will do [activity_name] now.”/ ”I don’t know.” to the robot after hearing the robot play music2;</td> 44 + <td>Say “I will do [activity_name] now.”/ ”I don’t know.” to the robot after hearing the robot play music2;</td> 45 + <td>Say “I will do [activity_name] now.”/ ”I don’t know.” to the robot after hearing the robot play music2;</td> 46 + </tr> 47 + <tr> 48 + <td>4</td> 49 + <td>Ignore the music when hearing the robot playing music3;</td> 50 + <td>Ignore the music when hearing the robot playing music3;</td> 51 + <td>Ignore the music when hearing the robot playing music3;</td> 52 + </tr> 53 + <tr> 54 + <td>5</td> 55 + <td>Say “I will do [task_name] now.”/ ”I don’t know.” to the robot after hearing the robot play music3 again.</td> 56 + <td>Say “I will do [task_name] now.”/ ”I don’t know.” to the robot after hearing the robot play music3 again.</td> 57 + <td>Say “I will do [task_name] now.”/ ”I don’t know.” to the robot after hearing the robot play music3 again.</td> 58 + </tr> 59 +<table> 60 +{{/html}} 51 51 52 -Count the correct answer in the final test. 53 -After the experiment, ask the user to fill in the system usability scale and the questionnaire regarding mood and satisfaction. 62 +**Event: Quiz** 54 54 55 -== Procedure == 56 - 57 -**Event: Quiz** 58 - 59 59 {{html}} 60 60 <table> 61 61 <tr> 62 62 <td>No.</td> 63 - <td>Group A with the intelligent robot</td> 64 - <td>Group B with the dumb robot</td> 68 + <td>Group A</td> 69 + <td>Group B</td> 70 + <td>Group C</td> 65 65 </tr> 66 66 <tr> 67 67 <td>1</td> 68 - <td>Participants sign the consent form and read the instruction for the evaluation;</td> 69 - <td>Participants sign the consent form and read the instruction for the evaluation;</td> 74 + <td>Say "I would like to do a quiz now." to the robot;</td> 75 + <td>Say "I would like to do a quiz now." to the robot;</td> 76 + <td>Say "I would like to do a quiz now." to the robot;</td> 70 70 </tr> 71 71 <tr> 72 72 <td>2</td> 73 - <td>Participants memorize six pieces of music corresponding with different activities;</td> 74 - <td>Participants memorize six pieces of music corresponding with different activities;</td> 80 + <td>Say “The activity is [activity_name].”/ ”I don’t know.” to the robot after hearing the robot play music1;</td> 81 + <td>Say “The activity is [activity_name].”/ ”I don’t know.” to the robot after hearing the robot play music1;</td> 82 + <td>Say “The activity is [activity_name].”/ ”I don’t know.” to the robot after hearing the robot play music1;</td> 75 75 </tr> 76 76 <tr> 77 77 <td>3</td> 78 - <td>Participants play quiz with the smart robot for three minutes, which will correct the participant when wrong answers are given;</td> 79 - <td>Participants play quiz with the dumb robot for three minutes, which will not correct the participant when wrong answers are given;</td> 86 + <td>Say “The activity is [activity_name].”/ ”I don’t know.” to the robot after hearing the robot play music2;</td> 87 + <td>Say “The activity is [activity_name].”/ ”I don’t know.” to the robot after hearing the robot play music2;</td> 88 + <td>Say “The activity is [activity_name].”/ ”I don’t know.” to the robot after hearing the robot play music2;</td> 80 80 </tr> 81 81 <tr> 82 82 <td>4</td> 83 - <td>Test how well participants remember the music-activity pairs by counting the mistakes made;</td> 84 - <td>Test how well participants remember the music-activity pairs by counting the mistakes made;</td> 92 + <td>Say “The activity is [activity_name].”/ ”I don’t know.” to the robot after hearing the robot play music3;</td> 93 + <td>Say “The activity is [activity_name].”/ ”I don’t know.” to the robot after hearing the robot play music3;</td> 94 + <td>Say “The activity is [activity_name].”/ ”I don’t know.” to the robot after hearing the robot play music3;</td> 85 85 </tr> 86 86 <tr> 87 87 <td>5</td> 88 - <td>Participants fill in the questionnaire and give the feedback;</td> 89 - <td>Participants fill in the questionnaire and give the feedback;</td> 98 + <td>Say “I want to end the quiz now.” to the robot.</td> 99 + <td>Say “I want to end the quiz now.” to the robot.</td> 100 + <td>Say “I want to end the quiz now.” to the robot.</td> 90 90 </tr> 91 91 <table> 92 92 {{/html}} 93 93 94 -== M aterial==105 +== Measures == 95 95 96 -Robot(NAO) with setting music, consent form, laptop 107 +During the experiment, count how many times the user answers with the wrong task. 108 +After the experiment, ask the user to fill in the system usability scale and the questionnaire regarding mood and satisfaction. 97 97 98 -= Results=110 +== Procedure == 99 99 100 -[[image:result2.png||height="400px"]] 101 -From the left figure, we can see the distribution of the number of correct answers. The average score of all participants is 3.6 among 6 questions. For group A, the average score is 3.3 and for group B the average score is 3.8. This bias can be explained because our group size is not large enough to eliminate the various memory ability. but we can also find that all participants in group A can learn something because they have no 0 scores but several participants in group B got 0 scores. In this degree, we can show that our robot does help in memory. 112 +1. Sign the consent form; 113 +2. Complete the given tasks as instructed; 114 +3. Complete a questionnaire 102 102 103 - Fromthe middle figure, we can findthat people in group A tend to think our robot can helpimprove the memory task and only a few of them thought our robot is annoying, as shown in the right figure.116 +== Material == 104 104 105 -[[image:result4.png||height="400px"]] 106 -As shown in the above figure, group A with our intelligent robot gave our robot an average score of 66.7, and group B with the dumb robot gave 58.2. In this scale, we can see that participants are more willing to play with our intelligent robot. 118 +Robot 107 107 108 - Also, wecollect some feedback fromtheparticipants.Most of them liked theappearanceof the robot which is consistent with thereasons we choosetheNAO. Peopleare more engaged and willingtointeractwith a humanoidrobot.Someof them complained about the speechrecognition of this robot.120 +The robot plays an important part in our experiment. 109 109 110 - = Discussion=122 +Consent form 111 111 112 - Weassumethat our intelligent robot can help peoplestrengthen the association between music and activities. The resultof average correct answersdidn't approve this. Several reasons existed. First, our participants were not real PwD and their memory abilities vary. Our group size(about 10 for each group) was not large enough. Also, Participants were only given a limited time. The short duration of the quiz and not using personalised music also accounted for this biased result. However, the overall usability score between the two groups and some quantitative results above also shows that our claim PwD are more willing to play with our intelligent robot and PwD are happy to use the robot could still hold.124 += Results = 113 113 114 - Besides, our robot waslimited byseveral key factors,126 += Discussion = 115 115 116 -* Due to the limited time and resources, we could not evaluate all the claims that were made in the use cases. This limited the broadness of our conclusion about the effectiveness of the system. 117 -* As mentioned before, the small sample size made the accuracy of the result doubtable. Having a larger and more diverse sample group would allow us to more accurately predict real-world usage. 118 -* The accuracy of the speech recognition system in the NAO and the availability of test subjects and robots also limited the evaluation. 119 - 120 -In the future, we could improve in the following aspects, 121 - 122 -* Test a full implementation of the system in a real setting with PwD. 123 -* Research should also be done to look if the robot is actually necessary, or if the advantage of the system could be achieved by a cheaper alternative, such as a virtual robot on a tablet. (Also inspired by the feedback we got. One participant asked why we didn't create an APP.) 124 - 125 - 126 - 127 127 = Conclusion = 128 - 129 -= Reference = 130 - 131 -Bangor, A., Kortum, P. T., & Miller, J. T. (2008). An empirical evaluation of the system usability scale. Intl. Journal of Human–Computer Interaction, 24(6), 574-594. 
 
- result1.png
-   - Author
-   ... ... @@ -1,1 +1,0 @@ 1 -XWiki.mona98 
- Size
-   ... ... @@ -1,1 +1,0 @@ 1 -107.3 KB 
- Content
 
- result2.png
-   - Author
-   ... ... @@ -1,1 +1,0 @@ 1 -XWiki.mona98 
- Size
-   ... ... @@ -1,1 +1,0 @@ 1 -169.4 KB 
- Content
 
- result3.png
-   - Author
-   ... ... @@ -1,1 +1,0 @@ 1 -XWiki.mona98 
- Size
-   ... ... @@ -1,1 +1,0 @@ 1 -217.7 KB 
- Content
 
- result4.png
-   - Author
-   ... ... @@ -1,1 +1,0 @@ 1 -XWiki.mona98 
- Size
-   ... ... @@ -1,1 +1,0 @@ 1 -52.8 KB 
- Content
 
- XWiki.XWikiComments[0]
-   - Author
-   ... ... @@ -1,1 +1,0 @@ 1 -Anonymous 
- Comment
-   ... ... @@ -1,1 +1,0 @@ 1 -Refer to the claims in the problem statement and research questions. 
- Date
-   ... ... @@ -1,1 +1,0 @@ 1 -2022-03-21 19:53:54.868 
 

 
             
             
            