Wiki source code of b. Test

Version 25.1 by Jean-Paul Smit on 2024/04/07 00:14

Hide last authors
Rixt Hellinga 5.1 1 === 1. Introduction ===
Rixt Hellinga 4.1 2
Jean-Paul Smit 11.1 3 This study aims to evaluate the effectiveness of the Nao robot in aiding memory retrieval through the use of a between-subjects design with a test group and a control group. We will simulate conditions of early-stage dementia in student participants to test whether interactions with Nao enhance the retrieval of memories fabricated created by us, mimicking lost memories typical of individuals with dementia.
Ruud de Jong 1.1 4
Rixt Hellinga 16.1 5 Recall the image from the [[3. Evaluation Methods>>Main.b\. Human Factors.Measuring Instruments.WebHome]] as in the figure below. The first two concepts, that of Autonomy and Relatedness and the memory self-efficacy, will be tested by using validated surveys. The memory recall will be tested by a custom survey that was catered to be used with the robot in the domain of interest, that is of using the NAO as an encyclopedia for recalling familiar people.
Rixt Hellinga 5.2 6
Jean-Paul Smit 15.1 7 [[image:Socio-Cognitive Engineering - Frame 1.jpg]]
Jean-Paul Smit 11.1 8
9
Kris van Melis 2.1 10 === 2. Method ===
Ruud de Jong 1.1 11
Jean-Paul Smit 24.1 12 [[Figure 2: procedure of the controlled experiment>>image:Procedure.png]]
Jean-Paul Smit 23.2 13
Kris van Melis 2.1 14 ==== 2.1 Participants ====
Ruud de Jong 1.1 15
Rixt Hellinga 5.1 16 As actual PwDs are not available for this research, the participants will consist of several student volunteers. These will be divided into two groups: those using the Nao robot for memory retrieval (the test group) and those using text-based prompts (the control group).
Ruud de Jong 1.1 17
18
Rixt Hellinga 5.1 19 ==== 2.2 Materials ====
Ruud de Jong 1.1 20
Jean-Paul Smit 24.1 21 Materials for the test group include the NAO robot, an OpenAI chatbot running on a mobile phone, and a laptop to control the human-like movements of NAO, as described in the [[prototype>>doc:3\. Evaluation.a\. Prototype.WebHome]]. The control group requires written-out text prompts instead.
Rixt Hellinga 5.1 22 Both groups require a set of fabricated memories, memory quizzes, writing materials, and evaluation forms for feedback. ** **
Ruud de Jong 1.1 23
24
Rixt Hellinga 5.1 25 ==== 2.3 Measures ====
Ruud de Jong 1.1 26
Jean-Paul Smit 24.1 27 Memory retrieval success will be measured using a [[quiz>>doc:.Quiz.WebHome]] based on the fabricated memories. Participant feedback and observational notes will also be collected to assess the user experience and interaction effectiveness. We will evaluate the recall and memory efficacy of both groups and try to relate the given answers to values** **such as autonomy and relatedness.
Ruud de Jong 1.1 28
29
Rixt Hellinga 5.1 30 ==== 2.4 Procedure ====
Ruud de Jong 1.1 31
Rixt Hellinga 5.1 32 Participants will receive an overview of their role as a PwD and undergo a session with either Nao or text prompts to retrieve memories. Following the session, they will complete the memory quiz. Feedback will be gathered post-interaction.
33
34 //**Phase 1: Preparation**//
35
36 1. **NAO Robot Setup**: Confirm that Nao is fully operational and our team is equipped to facilitate the interactions by doing a pre-test/trial run with our prototype.
37 1. **Recruitment and Consent**: Recruit a diverse group of student volunteers, ensuring informed consent for participation.
38 1. **Training**: Brief participants on their roles and the concept of simulated memory loss without revealing specific details of the fabricated memories.
39
Jean-Paul Smit 11.1 40 //**Phase 2: Baseline Assessment**//
Rixt Hellinga 5.1 41
42 1. **Pre-test**: Optionally, assess the students’ ability to recall general information unrelated to the fabricated memories to establish a baseline.
43 1. **Documentation**: Record responses to analyze as a baseline for later comparison.
44
Jean-Paul Smit 11.1 45 //**Phase 3: Memory Recall**//
Rixt Hellinga 5.1 46
47 1. **Introduction**: Introduce participants in the test group to the NAO robot, explaining its purpose, and provide the control group with their first text-based memory prompt.
48 1. **Simulation**: Conduct sessions where the test group interacts with Nao aiming to recall a set of fabricated memories.
49 1. **Analog control group simulation: **Conduct sessions with an analog encyclopedia with equally complex information on relatives.
50 1. **Memory Quiz**: Following each session, provide a written quiz to both groups based on the memories they were asked to retrieve.
51 1. **Documentation**: Collect the completed quizzes, which will serve as the primary documentation of memory retrieval.
52
53 //**Phase 4: Evaluation**//
54
55 1. **Scoring**: Assess the memory quizzes from both groups to evaluate the quantity and accuracy of retrieved memories.
56 1. **Analysis**: Compare the effectiveness of memory retrieval between the NAO-assisted group and the text-prompt group.
57 1. **Participant Feedback**: Gather feedback from all participants regarding their experience with the memory retrieval process through a questionnaire.
58 1. **Observer Notes**: Collect observations from staff regarding the interaction between participants and the Nao robot or text prompts.
59
60 //**Phase 5: Feedback and Improvement**//
61
62 1. **Participant Reflections**: Review participant feedback for insights into the memory retrieval process and suggestions for improvement.
63 1. **Staff and Observer Insights**: Analyze observations from staff and observers for additional perspectives on the effectiveness and emotional impact of the interactions.
64 1. **Improvement Strategies**: Based on the feedback and results, develop a plan to enhance the memory retrieval capabilities of the Nao robot and the overall test design.
65
William OGrady 12.1 66 === 3. Results ===
67
Jean-Paul Smit 23.3 68 Twenty University students participated in the controlled experiment (M= 45%, F=55%). After gaining consent through the consent form, all participants were briefed to act as if experiencing the interaction from the viewpoint of a PwD. From the sample, half were asked to interact with the NAO and half were only using a voice-based version of the encyclopedia prototype.
William OGrady 12.1 69
Jean-Paul Smit 25.1 70 The main finding of the experiment is on Relatedness as can be seen in Table 1: the increase of affect in both Arousal and Valence dimensions during the experiment with the test group was higher than the experiment with the control group. The control group mean values in arousal and valence show no significant change at all.
Jean-Paul Smit 23.2 71
72
Jean-Paul Smit 24.1 73 [[Figure 3: Pie chart showing the gender distribution of our sample. ~| Figure 4: Boxplot chart showing the familiarity with robots and the attitude towards technology.>>image:gender & familiarity.png]]
Jean-Paul Smit 23.2 74
75
Jean-Paul Smit 25.1 76
77 Competence. No significant effect on using the NAO (p > 0.05)
78
79 Covariates. had no significant influence on recall score (p > 0.05)
80
81
Jean-Paul Smit 24.1 82 For other analyses, most resulting p values had an alpha above 0.05 and thus were not significant enough.
83
84 [[Table 1: showing the Affect variables>>image:Table of means.png||height="214" width="737"]]
85
William OGrady 12.1 86 === 4. Discussion ===
87
Jean-Paul Smit 25.1 88 The NAO may positively affects participant’ s affective state. Yet the other results were not significant enough to make a valid decision about. There are a few possible reasons why that is that will be elaborated upon in this section.
89
90
Jean-Paul Smit 23.1 91 **Sample Group and Size impact validity of the study**
92
Jean-Paul Smit 24.1 93 The ecological validity of the study is impacted by the fact that there were no PwD in our sample. The scope of the experiment was limited to TU Delft University students. That means that future research may benefit from a closer approach to an experiment which is closer to the experience of PwD. Moreover, the controlled experiment was restricted to a cohort of 20 participants, underscoring the potential for enhancing result validity through the utilization of a larger sample size
Jean-Paul Smit 23.1 94
95
William OGrady 13.1 96 **//Participants don't know about points system so they didn't answer with "getting the most points" in mind//**
97
Jean-Paul Smit 24.1 98 //Ambiguities in the evaluation briefing has led to several aspects in the results that might misrepresent the participants' gathered knowledge. Points were awarded to the participant for certain key descriptors, each family member's role, occupation, likes, dislikes and so on. Of course, this wasn't known to the participants, so they might have omitted descriptors they deemed less important or trivial and therefore scored worse, even though they'd heard and remembered them.//
William OGrady 12.1 99
William OGrady 13.1 100
101 //**Participants don't know which people or facts are important, so they can get stuck in spots that are unrewarded**//
102
Jean-Paul Smit 24.1 103 //The choice to create a sprawling, multi-faceted database also had the side-effect of participants finding out a lot of information that was not rewarded by the grading system in any way. For example, the user can ask the robot to elaborate on certain memories or character traits of family members. There are also people in the database that act as ancillary characters and to create a sense of realism to the database, but participants can likely get stuck on learning about them as there is no implied hierarchy of importance to the participant. //
William OGrady 12.1 104
105
Jean-Paul Smit 24.1 106 **//GPT Assistant can elaborate on any question, and therefore the user does not know what belongs to the database and don't know where to focus//**
William OGrady 13.1 107
Jean-Paul Smit 24.1 108 //Another limitation of the evaluation is with the GPT Assistant's ability to consistently elaborate on any question posed by the participant.//
William OGrady 13.1 109
110
111
William OGrady 12.1 112 === 5. Conclusions ===
113
Rixt Hellinga 19.1 114 ===== //**Ethical Considerations**// =====
Rixt Hellinga 5.1 115
Rixt Hellinga 21.1 116 We commit to high ethical standards, respecting the sensitive nature of simulating dementia conditions, and ensuring the well-being and dignity of all participants throughout the study. To ensure this we present a form containing the ethical considerations to each participant.