Changes for page Test

Last modified by Clara Stiller on 2022/04/05 13:44

From version Icon 52.1 Icon
edited by Clara Stiller
on 2022/03/27 15:48
Change comment: There is no comment for this version
To version Icon 52.3 Icon
edited by Sayak Mukherjee
on 2022/04/02 02:40
Change comment: There is no comment for this version

Summary

Details

Icon Page properties
Author
... ... @@ -1,1 +1,1 @@
1 -XWiki.ClaraStiller
1 +XWiki.SayakMukherjee
Content
... ... @@ -1,5 +1,5 @@
1 -The best way to test our prototype would be a study with persons with dementia. Still, testing the robot in a real environment would be very time-consuming, because it is not predictable if and when people with dementia start wandering. That is out of scoop for our project.
2 -However, we want to get a first impression of how realistic and convincing the provided dialogues and suggested activities are. In a small study with students, who play the role of having dementia, we are observing the interaction with the robot and want to find out how effective it is in preventing people from wandering.
1 +The best way to test our prototype would be a study with persons with dementia. However, testing the robot in a real environment would be very time-consuming, because it is not predictable if and when people with dementia start wandering. Hence it is out of the scope of this project.
2 +Nevertheless, we wanted to get a first impression of how realistic and convincing the provided dialogues and suggested activities are. In a small study with students, who play the role of having dementia, we are observing the interaction with the robot with an aim to find out how effective it is in preventing people from wandering.
3 3  
4 4  
5 5  = Problem statement and research questions =
... ... @@ -101,7 +101,8 @@
101 101  
102 102  
103 103  = Results =
104 -=== Comparison between intelligent (cond. 1) and less intelligent (cond. 2) prototype ===
104 +{{html}}
105 +<!--=== Comparison between intelligent (cond. 1) and less intelligent (cond. 2) prototype ===
105 105  
106 106  {{html}}
107 107  <img src="/xwiki/wiki/sce2022group05/download/Test/WebHome/Stay_inside.svg" width="500" height="270" />
... ... @@ -138,9 +138,8 @@
138 138  
139 139  ==== Condition 2 - less intelligent prototype: ====
140 140  
141 -Participants assigned to condition 2 weren't convinced to leave. We saw, that most of them tried to continue talking to pepper when it raises its arm to block the door, even though it didn't listen. They were surprised by peppers reaction and asked for a reason why they are not allowed to leave. In order to have a natural conversation flow, the robot should provide an explanation for each scenario that tells why the person is not allowed to leave. This confirms that our approach, to give reason to stay inside, might be helpful to convince PwD to stay inside.
142 +Participants assigned to condition 2 weren't convinced to leave. We saw, that most of them tried to continue talking to pepper when it raises its arm to block the door, even though it didn't listen. They were surprised by peppers reaction and asked for a reason why they are not allowed to leave. In order to have a natural conversation flow, the robot should provide an explanation for each scenario that tells why the person is not allowed to leave. This confirms that our approach, to give reason to stay inside, might be helpful to convince PwD to stay inside.
142 142  
143 -
144 144  === Problems that occurred during the evaluation ===
145 145  
146 146  1. lots of difficulties with speech recognition:
... ... @@ -154,21 +154,135 @@
154 154  One of the most frequent and noticeable reactions from participants was **confusion**. This feeling was caused by two main factors:
155 155  misunderstandings from speech recognition which leads to unsuitable answers from pepper, as well as the unsuitable environment and setting of our evaluation.
156 156  The reasons for failure in speech recognition are listed above. An unsuitable answer can e.g. be an argument to stay inside, that doesn't fit the participant's reason to leave. Also, some people told in a long sentence that they don't like the provided activity and still want to leave. If the speech recognition fails in this case and pepper understood you would like to do the activity, it seems like it encourages you to leave, instead of doing the activity. This leads to the total opposite of our intention.
157 -Furthermore, we found out, that our prototype doesn't fit in the environment of the lab. We encourage the participant to do some activities, that they can't do in the lab environment (go to the living room, have a coffee or do a puzzle). If the robot tells asks you if you want to do the activity, most people don't know how to react and are insecure about how to answer. Participants "freeze" in front of the robot or just left the room.
157 +Furthermore, we found out, that our prototype doesn't fit in the environment of the lab. We encourage the participant to do some activities, that they can't do in the lab environment (go to the living room, have a coffee or do a puzzle). If the robot tells asks you if you want to do the activity, most people don't know how to react and are insecure about how to answer. Participants "freeze" in front of the robot or just left the room. -->
158 +{{/html}}
158 158  
160 +1. What percentage of people are prevented from going out unsupervised? (Quantitative) (CL01, CL05)
161 +1. How does the interaction change the participant's mood? (CL02)
162 +1. Can the robot respond appropriately to the participant's intention? (CL03)
163 +1. How do the participants react to the music? (CL04)
164 +1. Does the activity that the robot suggests prevent people from wandering/ leaving? (CL06)
165 +1. Can pepper identify and catch the attention of the PwD?
159 159  
167 +=== RQ1: Are people convinced not to go out unsupervised? ===
168 +{{html}}
169 +<table style="width: 100%">
170 +<tr>
171 +<td style="width: 50%">
172 +<img src="/xwiki/wiki/sce2022group05/download/Foundation/Operational%20Demands/Personas/WebHome/Storyboard_1.png?height=750&rev=1.1" />
173 +</td>
174 +<td>
175 +Comment on the graph
176 +</td>
177 +</tr>
178 +</table>
179 +{{/html}}
160 160  
161 -= Discussion =
181 +=== RQ2: How does the interaction change the participant's mood? ===
182 +{{html}}
183 +<table style="width: 100%">
184 +<tr>
185 +<td style="width: 50%">
186 +<img src="/xwiki/wiki/sce2022group05/download/Foundation/Operational%20Demands/Personas/WebHome/Storyboard_1.png?height=750&rev=1.1" />
187 +</td>
188 +<td>
189 +Comment on the graph
190 +</td>
191 +</tr>
192 +</table>
193 +{{/html}}
162 162  
195 +=== RQ3: Can the robot respond appropriately to the participant's intention? ===
196 +{{html}}
197 +<table style="width: 100%">
198 +<tr>
199 +<td style="width: 50%">
200 +<img src="/xwiki/wiki/sce2022group05/download/Foundation/Operational%20Demands/Personas/WebHome/Storyboard_1.png?height=750&rev=1.1" />
201 +</td>
202 +<td>
203 +Comment on the graph
204 +</td>
205 +</tr>
206 +</table>
207 +{{/html}}
163 163  
209 +=== RQ4: How do the participants react to the music? ===
210 +{{html}}
211 +<table style="width: 100%">
212 +<tr>
213 +<td style="width: 50%">
214 +<img src="/xwiki/wiki/sce2022group05/download/Foundation/Operational%20Demands/Personas/WebHome/Storyboard_1.png?height=750&rev=1.1" />
215 +</td>
216 +<td>
217 +Comment on the graph
218 +</td>
219 +</tr>
220 +</table>
221 +{{/html}}
164 164  
223 +=== RQ5: Does the activity that the robot suggests prevent people from wandering/ leaving? ===
224 +{{html}}
225 +<table style="width: 100%">
226 +<tr>
227 +<td style="width: 50%">
228 +<img src="/xwiki/wiki/sce2022group05/download/Foundation/Operational%20Demands/Personas/WebHome/Storyboard_1.png?height=750&rev=1.1" />
229 +</td>
230 +<td>
231 +Comment on the graph
232 +</td>
233 +</tr>
234 +</table>
235 +{{/html}}
165 165  
237 +=== RQ6: Can pepper identify and catch the attention of the PwD? ===
238 +{{html}}
239 +<table style="width: 100%">
240 +<tr>
241 +<td style="width: 50%">
242 +<img src="/xwiki/wiki/sce2022group05/download/Foundation/Operational%20Demands/Personas/WebHome/Storyboard_1.png?height=750&rev=1.1" />
243 +</td>
244 +<td>
245 +Comment on the graph
246 +</td>
247 +</tr>
248 +</table>
249 +{{/html}}
166 166  
251 +=== Reliabity Scores ===
252 +{{html}}
253 +<table style="width: 100%">
254 +<tr>
255 +<td style="width: 50%">
256 +<img src="/xwiki/wiki/sce2022group05/download/Foundation/Operational%20Demands/Personas/WebHome/Storyboard_1.png?height=750&rev=1.1" />
257 +</td>
258 +<td>
259 +Comment on the graph
260 +</td>
261 +</tr>
262 +</table>
263 +{{/html}}
167 167  
265 += Limitation=
266 +
267 +* **Lab Environment**: The lab environment is different from a care home, the participants found it difficult to process the suggestions made by Pepper. For example, if Pepper asked someone to visit the living room, it created confusion among the participants regarding their next action.
268 +
269 +* **Role-Playing**: Participants for the experiment are not actual patients suffering from dementia. Hence it is naturally difficult for them to enact the situations and replicate the mental state of an actual person suffering from dementia.
270 +
271 +* **Speech Recognition**: The speech recognition module inside Pepper is not perfect. Therefore, in certain cases, Pepper misinterpreted words spoken by the participants and triggered an erroneous dialogue flow. The problems commonly occurred with words that sound similar such as "work" and "walk". Moreover, there are some additional hardware limitations that hampered the efficiency of the speech recognition system. One prominent issue is that the microphone within Pepper is only active when the speaker is turned off. A blue light in the eye of Pepper indicated the operation of the microphone. Since most of the participants are not used to interacting with Pepper found it difficult to keep this limitation in mind while trying to have a natural conversation.
272 +
273 +* **Face Detection**: The face recognition module within Pepper is also rudimentary in nature. It can not detect half faces are when participants approach from the side. Adding to the problem, the lighting condition in the lab was not sufficient for the reliable functioning of the face recognition module. Hence Pepper failed to notice the participant in some cases and did not start the dialogue flow.
274 +
168 168  = Conclusions =
276 +* People who liked the activity tend to stay in
277 +* People who knew the music found it more fitting
278 +* People are more convinced to stay in with the intelligent prototype
279 +* Cannot conclude whether moods were improved
280 +* Need to experiment with the actual target user group to derive on concrete conclusion
281 +* Experiment with personalization
169 169  
170 -just some notes...
283 += Future Work =
284 +* **Personalisation**: Personalize music, and activity preferences according to the person interacting with Pepper.
285 +* **Robot Collaboration**: Collaborate with other robots such as Miro to assist a person with dementia while going for a walk instead of the caretaker.
286 +* **Recognise Person**: For a personalised experience, it is essential that Pepper is able to identify each person based on an internal database.
287 +* **Fine Tune Speech Recognition**: Improvements are necessary for the speech recognition module before the actual deployment of the project in a care home. Additionally, support for multiple languages can be considered to engage with non-English speaking people.
171 171  
172 -* Evaluation is not significant enough to answer our main question/ goal, due to many issues with speech reco and misunderstandings
173 -* difference in effectiveness between cond 1 and cond 2, but not in all cases
174 -* Confusion + waiting for the robot to listen -> bad for PwD, because they are already confused and might not remember to wait for the eyes to turn blue