Changes for page Test
Last modified by Clara Stiller on 2022/04/05 13:44
From version
63.2


edited by Vishruty Mittal
on 2022/04/02 12:56
on 2022/04/02 12:56
Change comment:
There is no comment for this version
To version
52.2


edited by Clara Stiller
on 2022/03/27 16:23
on 2022/03/27 16:23
Change comment:
There is no comment for this version
Summary
Details
- Page properties
-
- Author
-
... ... @@ -1,1 +1,1 @@ 1 -XWiki. Vishruty1 +XWiki.ClaraStiller - Content
-
... ... @@ -1,4 +1,5 @@ 1 -Evaluation is an iterative process where the initial iterations focus on examining if the proposed idea is working as intended. Therefore, we want to first understand how realistic and convincing the provided dialogues and suggested activities are, and would they be able to prevent people from wandering. To examine this, we conduct a small pilot study with students, who role-play having dementia. We then observe their interaction with Pepper to examine the effectiveness of our dialog flow in preventing people from wandering. 1 +The best way to test our prototype would be a study with persons with dementia. Still, testing the robot in a real environment would be very time-consuming, because it is not predictable if and when people with dementia start wandering. That is out of scoop for our project. 2 +However, we want to get a first impression of how realistic and convincing the provided dialogues and suggested activities are. In a small study with students, who play the role of having dementia, we are observing the interaction with the robot and want to find out how effective it is in preventing people from wandering. 2 2 3 3 4 4 = Problem statement and research questions = ... ... @@ -23,37 +23,23 @@ 23 23 24 24 = Method = 25 25 26 -A between-subject study with students who play the role of having dementia. Data will be collected with a questionnaire that participants fill outbefore and after interactingwithPepper. The questionnairecaptures differentaspectsoftheconversationalong with theirmood beforeandafter theinteractionwithPepper.27 +A between-subject study with students who play the role of having dementia. Data will be collected with a questionnaire (before and after participation), observing the participant's body language and the way that they're responding to Pepper. 27 27 28 -For our between-subject study, our independent variable is Pepper trying to distract the users by mentioning different activities along with the corresponding music. Through this, we want to measure the effectiveness of music and activities in preventing people from leaving the care home, which is thereby our dependent variable. So we developed 2 different prototype designs- 29 - 30 -Design X - It is the full interaction flow where Pepper suggests activities and uses music to distract people from leaving. 31 -Design Y - It is the control condition where pepper simply tries to stop people from leaving by physically keeping its hand on the door. 32 - 33 33 == Participants == 34 34 35 -1 7students who play the role of having dementia. They will be divided into two groups. One group (11 participants) will be interacting withdesignX(group 1) robot while the other group (6students) will interact with thedesignY(group 2).31 +18 students who play the role of having dementia. They will be divided into two groups. One group (11 participants) will be interacting with the intelligent (group 1) robot while the other group (7 students) will interact with the unintelligent robot (group 2). 36 36 It is assumed that all participants are living at the same care center. 33 +Before they start, they can choose how stubborn they want to be and where they want to go. 37 37 38 38 == Experimental design == 39 39 40 - ***BeforeExperiment:**37 +All questions collect quantitative data, using a 5 point Likert scale wherever applicable. 41 41 42 -We will explain to the participants the goal of this experiment and what do they need to do to prevent ambiguity. Therefore, as our participants are students and only playing the role of having dementia, we will give them a level of stubbornness/ willpower with which they are trying to leave the care home. 43 -Participants will also be given a reason to leave, from the below list: 39 +1. Observe the participant's mood and see how the conversation goes. Observe the level of aggression (tone, volume, pace) 40 +1. Observe whether the mood is improved and the decision has been changed. 41 +1. Observe how natural the conversation is. (conversation makes sense) 42 +1. Participants fill out questionnaires. 44 44 45 -going to the supermarket 46 -going to the office 47 -going for a walk 48 - 49 -After this preparation, the participant fills a part of the questionnaire. 50 - 51 -Experiment: 52 -The participant begins interacting with Pepper who is standing near the exit door. The participant and robot have an interaction where the robot is trying to convince him/her to stay inside. 53 - 54 -After Experiment: 55 -After the participant finishes interacting with Pepper, he/she will be asked to fill out the remaining questionnaire. Almost all the questions in the questionnaire collect quantitative data, using a 5 point Likert scale. The questionnaire also used images from Self Assessment Manikin (SAM) so that user can self attest to their mood before and after their interaction with Pepper. 56 - 57 57 == Tasks == 58 58 59 59 Because our participants only play the role of having dementia, we will give them a level of stubbornness/ willpower with they are trying to leave. We try to detect this level with the robot. ... ... @@ -112,12 +112,11 @@ 112 112 113 113 Pepper, laptop, door, and music. 114 114 102 + 115 115 = Results = 104 +=== Comparison between intelligent (cond. 1) and less intelligent (cond. 2) prototype === 116 116 117 117 {{html}} 118 -<!--=== Comparison between intelligent (cond. 1) and less intelligent (cond. 2) prototype === 119 - 120 -{{html}} 121 121 <img src="/xwiki/wiki/sce2022group05/download/Test/WebHome/Stay_inside.svg" width="500" height="270" /> 122 122 {{/html}} 123 123 ... ... @@ -152,8 +152,9 @@ 152 152 153 153 ==== Condition 2 - less intelligent prototype: ==== 154 154 155 -Participants assigned to condition 2 weren't convinced to leave. We saw, that most of them tried to continue talking to pepper when it raises its arm to block the door, even though it didn't listen. They were surprised by peppers reaction and asked for a reason why they are not allowed to leave. In order to have a natural conversation flow, the robot should provide an explanation for each scenario that tells why the person is not allowed to leave. This confirms that our approach, to give reason to stay inside, might be helpful to convince PwD to stay inside. 141 +Participants assigned to condition 2 weren't convinced to leave. We saw, that most of them tried to continue talking to pepper when it raises its arm to block the door, even though it didn't listen. They were surprised by peppers reaction and asked for a reason why they are not allowed to leave. In order to have a natural conversation flow, the robot should provide an explanation for each scenario that tells why the person is not allowed to leave. This confirms that our approach, to give reason to stay inside, might be helpful to convince PwD to stay inside. 156 156 143 + 157 157 === Problems that occurred during the evaluation === 158 158 159 159 1. lots of difficulties with speech recognition: ... ... @@ -167,136 +167,28 @@ 167 167 One of the most frequent and noticeable reactions from participants was **confusion**. This feeling was caused by two main factors: 168 168 misunderstandings from speech recognition which leads to unsuitable answers from pepper, as well as the unsuitable environment and setting of our evaluation. 169 169 The reasons for failure in speech recognition are listed above. An unsuitable answer can e.g. be an argument to stay inside, that doesn't fit the participant's reason to leave. Also, some people told in a long sentence that they don't like the provided activity and still want to leave. If the speech recognition fails in this case and pepper understood you would like to do the activity, it seems like it encourages you to leave, instead of doing the activity. This leads to the total opposite of our intention. 170 -Furthermore, we found out, that our prototype doesn't fit in the environment of the lab. We encourage the participant to do some activities, that they can't do in the lab environment (go to the living room, have a coffee or do a puzzle). If the robot tells asks you if you want to do the activity, most people don't know how to react and are insecure about how to answer. Participants "freeze" in front of the robot or just left the room. --> 171 -{{/html}} 157 +Furthermore, we found out, that our prototype doesn't fit in the environment of the lab. We encourage the participant to do some activities, that they can't do in the lab environment (go to the living room, have a coffee or do a puzzle). If the robot tells asks you if you want to do the activity, most people don't know how to react and are insecure about how to answer. Participants "freeze" in front of the robot or just left the room. 172 172 173 -=== RQ1: Are people convinced not to go out unsupervised? === 174 174 175 -{{html}} 176 -<table style="width: 100%"> 177 -<tr> 178 -<td style="width: 50%"> 179 -<img src="/xwiki/wiki/sce2022group05/download/Foundation/Operational%20Demands/Personas/WebHome/RQ1.jpg?height=250&rev=1.1" /> 180 -</td> 181 -<td> 182 -Comment on the graph 183 -</td> 184 -</tr> 185 -</table> 186 -{{/html}} 187 187 188 -= ==RQ2: How does theinteractionchange the participant's mood?===161 += Discussion = 189 189 190 -{{html}} 191 -<table style="width: 100%"> 192 -<tr> 193 -<td style="width: 50%"> 194 -<img src="/xwiki/wiki/sce2022group05/download/Foundation/Operational%20Demands/Personas/WebHome/RQ2.jpg?height=250&rev=1.1" /> 195 -</td> 196 -<td> 197 -Comment on the graph 198 -</td> 199 -</tr> 200 -</table> 201 -{{/html}} 202 202 203 -=== RQ3: Can the robot respond appropriately to the participant's intention? === 204 204 205 -{{html}} 206 -<table style="width: 100%"> 207 -<tr> 208 -<td style="width: 50%"> 209 -<img src="/xwiki/wiki/sce2022group05/download/Foundation/Operational%20Demands/Personas/WebHome/RQ3.jpg?height=250&rev=1.1" /> 210 -</td> 211 -<td> 212 -Comment on the graph 213 -</td> 214 -</tr> 215 -</table> 216 -{{/html}} 217 217 218 -=== RQ4: How do the participants react to the music? === 219 219 220 -{{html}} 221 -<table style="width: 100%"> 222 -<tr> 223 -<td style="width: 50%"> 224 -<img src="/xwiki/wiki/sce2022group05/download/Foundation/Operational%20Demands/Personas/WebHome/RQ4.jpg?height=250&rev=1.1" /> 225 -</td> 226 -<td> 227 -Comment on the graph 228 -</td> 229 -</tr> 230 -</table> 231 -{{/html}} 167 += Limitation= 232 232 233 -=== RQ5: Does the activity that the robot suggests prevent people from wandering/ leaving? === 169 +* participants don't have dementia, only playing the role of having it. --> it's hard to play someone starting to wander 170 +* issues with speech recognition leads to "wrong" dialogues and unintended behavior of the robot 171 +* unsuitable environment confused participants in how they should continue, they might have done something else, if they would know where to do the provided activities 234 234 235 -{{html}} 236 -<table style="width: 100%"> 237 -<tr> 238 -<td style="width: 50%"> 239 -<img src="/xwiki/wiki/sce2022group05/download/Foundation/Operational%20Demands/Personas/WebHome/RQ5.jpg?height=250&rev=1.1" /> 240 -</td> 241 -<td> 242 -Comment on the graph 243 -</td> 244 -</tr> 245 -</table> 246 -{{/html}} 173 + 247 247 248 -=== RQ6: Can pepper identify and catch the attention of the PwD? === 249 - 250 -{{html}} 251 -<table style="width: 100%"> 252 -<tr> 253 -<td style="width: 50%"> 254 -<img src="/xwiki/wiki/sce2022group05/download/Foundation/Operational%20Demands/Personas/WebHome/RQ6.jpg?height=250&rev=1.1" /> 255 -</td> 256 -<td> 257 -Comment on the graph 258 -</td> 259 -</tr> 260 -</table> 261 -{{/html}} 262 - 263 -=== Reliabity Scores === 264 - 265 -{{html}} 266 -<table style="width: 100%"> 267 -<tr> 268 -<td style="width: 50%"> 269 -<img src="/xwiki/wiki/sce2022group05/download/Foundation/Operational%20Demands/Personas/WebHome/RelScores.jpg?height=250&rev=1.1" /> 270 -</td> 271 -<td> 272 -Comment on the graph 273 -</td> 274 -</tr> 275 -</table> 276 -{{/html}} 277 - 278 -= Limitation = 279 - 280 -* **Lab Environment**: The lab environment is different from a care home, the participants found it difficult to process the suggestions made by Pepper. For example, if Pepper asked someone to visit the living room, it created confusion among the participants regarding their next action. 281 - 282 -* **Role-Playing**: Participants for the experiment are not actual patients suffering from dementia. Hence it is naturally difficult for them to enact the situations and replicate the mental state of an actual person suffering from dementia. 283 - 284 -* **Speech Recognition**: The speech recognition module inside Pepper is not perfect. Therefore, in certain cases, Pepper misinterpreted words spoken by the participants and triggered an erroneous dialogue flow. The problems commonly occurred with words that sound similar such as "work" and "walk". Moreover, there are some additional hardware limitations that hampered the efficiency of the speech recognition system. One prominent issue is that the microphone within Pepper is only active when the speaker is turned off. A blue light in the eye of Pepper indicated the operation of the microphone. Since most of the participants are not used to interacting with Pepper found it difficult to keep this limitation in mind while trying to have a natural conversation. 285 - 286 -* **Face Detection**: The face recognition module within Pepper is also rudimentary in nature. It can not detect half faces are when participants approach from the side. Adding to the problem, the lighting condition in the lab was not sufficient for the reliable functioning of the face recognition module. Hence Pepper failed to notice the participant in some cases and did not start the dialogue flow. 287 - 288 288 = Conclusions = 289 289 290 -* People who liked the activity tend to stay in 291 -* People who knew the music found it more fitting 292 -* People are more convinced to stay in with the intelligent prototype 293 -* Cannot conclude whether moods were improved 294 -* Need to experiment with the actual target user group to derive on concrete conclusion 295 -* Experiment with personalization 177 +just some notes... 296 296 297 -= Future Work = 298 - 299 -* **Personalisation**: Personalize music, and activity preferences according to the person interacting with Pepper. 300 -* **Robot Collaboration**: Collaborate with other robots such as Miro to assist a person with dementia while going for a walk instead of the caretaker. 301 -* **Recognise Person**: For a personalised experience, it is essential that Pepper is able to identify each person based on an internal database. 302 -* **Fine Tune Speech Recognition**: Improvements are necessary for the speech recognition module before the actual deployment of the project in a care home. Additionally, support for multiple languages can be considered to engage with non-English speaking people. 179 +* Evaluation is not significant enough to answer our main question/ goal, due to many issues with speech reco and misunderstandings 180 +* difference in effectiveness between cond 1 and cond 2, but not in all cases 181 +* Confusion + waiting for the robot to listen -> bad for PwD, because they are already confused and might not remember to wait for the eyes to turn blue
- XWiki.XWikiComments[1]
-
- Comment
-
... ... @@ -1,1 +1,1 @@ 1 - Addclaimnumber withRQ6onceclaimsare fixed back1 +Modified Research Questions so that they align with claims - Date
-
... ... @@ -1,1 +1,1 @@ 1 -2022-0 4-02 11:44:44.3391 +2022-03-26 13:50:45.49
- XWiki.XWikiComments[2]
-
- Author
-
... ... @@ -1,1 +1,0 @@ 1 -XWiki.Vishruty - Comment
-
... ... @@ -1,1 +1,0 @@ 1 -Check if RQ and claims align with each other(CL numbers) once claims are reverted. - Date
-
... ... @@ -1,1 +1,0 @@ 1 -2022-04-02 11:45:38.960