Changes for page Test

Last modified by Clara Stiller on 2022/04/05 13:44

Manage
- Copy
Actions
- Export
- Print Preview
Viewers
- Source
- Children
- Content
- Comments (3)
- Annotations
- Attachments (7)
- History
- Information

From 68.1 to 69.1 From 85.1 to 86.1

From version

69.1

edited by Vishruty Mittal
on 2022/04/02 13:01

Change comment: There is no comment for this version

To version

85.1

edited by Vishruty Mittal
on 2022/04/02 15:20

Change comment: There is no comment for this version

Raw
Rendered

Summary

Page properties (1 modified, 0 added, 0 removed)

Details

Page properties

Content

@@ -1,6 +1,5 @@
  Evaluation is an iterative process where the initial iterations focus on examining if the proposed idea is working as intended. Therefore, we want to first understand how realistic and convincing the provided dialogues and suggested activities are, and would they be able to prevent people from wandering. To examine this, we conduct a small pilot study with students, who role-play having dementia. We then observe their interaction with Pepper to examine the effectiveness of our dialog flow in preventing people from wandering.
--
  = Problem statement and research questions =
  **Goal**: How effective is music and dialogue in preventing people with dementia from wandering?
@@ -32,13 +32,12 @@
  == Participants ==
--17 students who play the role of having dementia. They will be divided into two groups. One group (11 participants) will be interacting with design X (group 1) robot while the other group (6 students) will interact with the design Y (group 2).
--It is assumed that all participants are living at the same care center.
++The ideal participants for our user study would have been people suffering from dementia. As the people in this section fall under vulnerable groups, testing with them would have been very difficult due to the current pandemic situation. Therefore we planned to conduct our experiments with students instead.
++Our experiment involves 17 students who play the role of having dementia. They will be divided into two groups. One group (11 participants) will be interacting with design X while the other group (6 students) will interact with design Y.
  == Experimental design ==
  **Before Experiment:**
--
  We will explain to the participants the goal of this experiment and what do they need to do to prevent ambiguity. Therefore, as our participants are students and only playing the role of having dementia, we will give them a level of stubbornness/ willpower with which they are trying to leave the care home.
  Participants will also be given a reason to leave, from the below list:
@@ -56,8 +56,12 @@
  == Material ==
--Pepper, laptop, door, and music.
++The items required for this evaluation are the following:
++* Pepper
++* Door
++* Caretaker in a nearby room in case of emergency
++
  = Results =
  {{html}}
@@ -125,7 +125,8 @@
  <img src="/xwiki/wiki/sce2022group05/download/Foundation/Operational%20Demands/Personas/WebHome/RQ1.jpg?height=250&rev=1.1" />
  </td>
  <td>
--Comment on the graph
++We used a Likert scale for this question, 1 being the lowest and 5 being the highest. Participants who interacted with design Y tend to agree less to stay inside compared to the people who interacted with design X.
++
  </td>
  </tr>
  </table>
@@ -140,7 +140,9 @@
  <img src="/xwiki/wiki/sce2022group05/download/Foundation/Operational%20Demands/Personas/WebHome/RQ2.jpg?height=250&rev=1.1" />
  </td>
  <td>
--Comment on the graph
++We notice a positive change in valence with the full flow i.e design X (although negligible). This can be because of the music. The valence does not decrease for the baseline which might be due to the novelty effect of seeing Pepper for the first time. The change in arousal in both scenarios is nearly negligible. This might be due to the fact that the interaction with Pepper was very short.
++Additionally, in the case of the full flow i.e design X, these values might have not changed significantly as per the expectation (valence higher, arousal lower) because the music was not personalized for participants.
++
  </td>
  </tr>
  </table>
@@ -155,7 +155,9 @@
  <img src="/xwiki/wiki/sce2022group05/download/Foundation/Operational%20Demands/Personas/WebHome/RQ3.jpg?height=250&rev=1.1" />
  </td>
  <td>
--Comment on the graph
++We notice a very minute difference between the full flow i.e design X, and control condition, design Y. There might be many reasons behind this. The speech recognition module in Pepper was not very efficient to understand different accents and thereby misunderstood words in some cases. <br>
++The null hypothesis is perceived message understanding for both the conditions is equal. Given the p value, the null hypothesis can not be rejected. High variance in data and also restrictive sample size could be the reasons behind the insignificant result.
++
  </td>
  </tr>
  </table>
@@ -170,7 +170,7 @@
  <img src="/xwiki/wiki/sce2022group05/download/Foundation/Operational%20Demands/Personas/WebHome/RQ4.jpg?height=250&rev=1.1" />
  </td>
  <td>
--Comment on the graph
++We found that participants who knew the songs, enjoyed the music and thought it fit the situation more than the ones who did not know the songs.
  </td>
  </tr>
  </table>
@@ -185,7 +185,7 @@
  <img src="/xwiki/wiki/sce2022group05/download/Foundation/Operational%20Demands/Personas/WebHome/RQ5.jpg?height=250&rev=1.1" />
  </td>
  <td>
--Comment on the graph
++As per these results, we can say that if participants have a predilection toward the suggested activity, there is a higher chance of them staying in. Therefore there is a direct correlation between people staying in and their interest in the activity. After personalization, we expect the score to be further increased.
  </td>
  </tr>
  </table>
@@ -200,7 +200,11 @@
  <img src="/xwiki/wiki/sce2022group05/download/Foundation/Operational%20Demands/Personas/WebHome/RQ6.jpg?height=250&rev=1.1" />
  </td>
  <td>
--Comment on the graph
++We find that the values for co-presence for both conditions are very similar. This may be attributed to the novelty effect and also to the fact that the face recognition module remains unchanged.
++The values for attention allocation are similar, but the controlled flow (design Y) has a higher value. We suspect that the potential reason might be, that people start to lose focus with the elongated conversations.
++
++Besides the co-presence, all the observations are not statistically significant because of the high variance in the limited responses.
++
  </td>
  </tr>
  </table>
@@ -215,7 +215,7 @@
  <img src="/xwiki/wiki/sce2022group05/download/Foundation/Operational%20Demands/Personas/WebHome/RelScores.jpg?height=250&rev=1.1" />
  </td>
  <td>
--Comment on the graph
++We achieved a high Cronbatch alpha score (>60%) for almost all the sections of our analysis. Thereby providing reliability to our evaluation.
  </td>
  </tr>
  </table>