Changes for page b. Test
Last modified by Jean-Paul Smit on 2024/04/09 15:23
From version 28.1
edited by Pravesha Ramsundersingh
on 2024/04/08 10:56
on 2024/04/08 10:56
Change comment:
There is no comment for this version
To version 23.2
edited by Jean-Paul Smit
on 2024/04/07 00:03
on 2024/04/07 00:03
Change comment:
There is no comment for this version
Summary
-
Page properties (2 modified, 0 added, 0 removed)
-
Attachments (0 modified, 0 added, 1 removed)
Details
- Page properties
-
- Author
-
... ... @@ -1,1 +1,1 @@ 1 -xwiki:XWiki. PraveshaRamsundersingh1 +xwiki:XWiki.jeanpaulsmit - Content
-
... ... @@ -9,7 +9,7 @@ 9 9 10 10 === 2. Method === 11 11 12 -[[Figure 2: procedure of the controlled experiment>>image:Procedure.png]]12 +[[Figure: procedure of the controlled experiment>>image:Procedure.png]] 13 13 14 14 ==== 2.1 Participants ==== 15 15 ... ... @@ -18,13 +18,13 @@ 18 18 19 19 ==== 2.2 Materials ==== 20 20 21 -Materials for the test group include the NAO robot, an OpenAI chatbot running on a mobile phone, and a laptop to control the human-like movements of NAO, as described in the [[prototype>>doc:3\. Evaluation.a\. Prototype.WebHome]]. The control group requires written -out text prompts instead.21 +Materials for the test group include the NAO robot, an OpenAI chatbot running on a mobile phone, and a laptop to control the human-like movements of NAO, as described in the [[prototype>>doc:3\. Evaluation.a\. Prototype.WebHome]]. The control group requires written out text prompts instead. 22 22 Both groups require a set of fabricated memories, memory quizzes, writing materials, and evaluation forms for feedback. ** ** 23 23 24 24 25 25 ==== 2.3 Measures ==== 26 26 27 -Memory retrieval success will be measured using a [[quiz>>doc:.Quiz.WebHome]] based on the fabricated memories. Participant feedback and observational notes will also be collected to assess the user experience and interaction effectiveness. We will evaluate the recall and memory efficacy of both groups and try to relate the given answers to values** **such as autonomy and relatedness.27 +Memory retrieval success will be measured using a [[quiz>>doc:.Quiz.WebHome]] based on the fabricated memories. Participant feedback and observational notes will also be collected to assess the user experience and interaction effectiveness. We will evaluate the recall and memory-efficacy of both groups and try to relate the given answers to values** **such as autonomy and relatedness. 28 28 29 29 30 30 ==== 2.4 Procedure ==== ... ... @@ -65,62 +65,42 @@ 65 65 66 66 === 3. Results === 67 67 68 -Twenty University students participated in the controlled experiment (M= 45%, F=55%). After gaining consent through the consent form, all participants were briefed to act as if experiencing the interaction from the viewpoint of a PwD. From the sample, half wereasked to interact with theNAO and half were only using a voice-basedversion of the encyclopedia prototype.68 +Twenty University students participated in the controlled experiment (M= 45%, F=55%). After gaining consent through the consent form, all participants were briefed to act as if experiencing the interaction from the viewpoint of a PwD. From the sample, half experienced 69 69 70 - Themainfindingofthe experimentison Relatedness as can be seenin Table 1:theincreaseofaffect in both ArousalandValencedimensions during theexperiment withthe test group washigher than theexperimentwith thecontrol group.Thecontrolgroupmean values inarousal andvalenceshowno significantchange at all.70 +[[Pie chart showing the gender distribution of our sample. / Boxplot chart showing the familiarity with robots and the attitude towards technology.>>image:gender & familiarity.png]] 71 71 72 72 73 - [[Figure 3: Pie chart showingthegenderdistributionofour sample. ~| Figure4: Boxplot chartshowingthe familiaritywith robotsand the attitudetowards technology.>>image:gender &familiarity.png]]73 +Most resulting p values had an alpha above 0.05 and thus were not significant enough. 74 74 75 +[[Table showing the Affect variables>>image:Table of means.png||height="214" width="737"]] 75 75 76 - 77 -Competence. No significant effect on using the NAO (p > 0.05) 78 - 79 -Covariates. had no significant influence on recall score (p > 0.05) 80 - 81 - 82 -For other analyses, most resulting p values had an alpha above 0.05 and thus were not significant enough. 83 - 84 -[[Table 1: showing the Affect variables>>image:Table of means.png||height="214" width="737"]] 85 - 86 86 === 4. Discussion === 87 87 88 -The NAO may positively affects participant’ s affective state. Yet the other results were not significant enough to make a valid decision about. There are a few possible reasons why that is that will be elaborated upon in this section. 89 - 90 - 91 91 **Sample Group and Size impact validity of the study** 92 92 93 -The ecological validity of the study is impacted by the fact that there were no PwD in our sample. The scope of the experiment was limited to TU Delft University students. That means that future research may benefit from a closer approach to an experiment which is closer to the experience of PwD. Moreover, the controlled experiment was restricted to a cohort of 20 participants, underscoring the potential for enhancing result validity through the utilization of a larger sample size81 +The ecological validity of the study is impacted by the fact that there were no PwD in our sample.The scope of the experiment was limited to TU Delft University students. That means that future research may benefit from a closer approach to an experiment which is closer to the experience of PwD. Moreover, the controlled experiment was restricted to a cohort of 20 participants, underscoring the potential for enhancing result validity through the utilization of a larger sample size 94 94 95 95 96 96 **//Participants don't know about points system so they didn't answer with "getting the most points" in mind//** 97 97 98 -//Ambiguities in the evaluation briefing has led to several aspects in the results that might misrepresent the participants' gathered knowledge. Points were awarded to the participant for certain key descriptors, each family member's role, occupation, likes, dislikes and so on. Of course, this wasn't known to the participant s, so they might have omitted descriptors they deemed less important or trivial and therefore scored worse, even though they'd heard and remembered them.//86 +//Ambiguities in the evaluation briefing has led to several aspects in the results that might misrepresent the participants' gathered knowledge. Points were awarded to the participant for certain key descriptors, each family member's role, occupation, likes, dislikes and so on. Of course, this wasn't known to the participant, so they might have omitted descriptors they deemed less important or trivial and therefore scored worse, even though they'd heard and remembered them.// 99 99 100 100 101 101 //**Participants don't know which people or facts are important, so they can get stuck in spots that are unrewarded**// 102 102 103 -//The choice to create a sprawling, multi-faceted database also had the side-effect of participants finding out a lot of information that was not rewarded by the grading system in any way. For example, the user can ask the robot to elaborate o ncertain memories or character traits of family members. There are also people in the database that act as ancillary characters and to create a sense of realism to the database, but participants can likely get stuck on learning about them as there is no implied hierarchy of importance to the participant. //91 +//The choice to create a sprawling, multi-faceted database also had the side-effect of participants finding out a lot of information that was not rewarded by the grading system in any way. For example, the user can ask the robot to elaborate about certain memories or character traits of family members. There are also people in the database that act as ancillary characters and to create a sense of realism to the database, but participants can likely get stuck on learning about them as there is no implied hiearchy of importance to the participant. // 104 104 105 105 106 -**//GPT Assistant can elaborate on any question, and therefore the user does not know what belongs to the database and don't know where to focus//** 94 +**//GPT Assistant can elaborate on any question, and therefore the user does not know what belongs to the database, and don't know where to focus//** 107 107 108 -//Another limi tation of the evaluation is with the GPT Assistant's ability to consistently elaborate on any question posed by the participant.//96 +//Another limiation of the evaluation is with the GPT Assistant's ability to consistently elaborate on any question posed by the participant.// 109 109 110 110 111 -===== ===== 112 112 113 -(% class="wikigeneratedid" id="HEthicalConsiderations" %) 114 -//**Ethical Considerations**// 115 - 116 -We commit to high ethical standards, respecting the sensitive nature of simulating dementia conditions, and ensuring the well-being and dignity of all participants throughout the study. To ensure this we present a form containing the ethical considerations to each participant. 117 - 118 - 119 119 === 5. Conclusions === 120 120 121 -===== //** FinalRemarks**// =====102 +===== //**Ethical Considerations**// ===== 122 122 123 -W ith regardstoour final insights, there were some areas of improvement. For example, relatingtothe validityofthe testingprocedure, wehada relatively smallsample size andnone of the participants haddementia. Thismeansthat thetargetgroup forthis NAO systemwasnot tested. It was also only aspecific group of people,namely students with the agerange of20-25. There wasalso a level of ambiguityinthepoint systemof theevaluationandwith regards tothe LLM model focus,we founddeviationsfromtheGPT regardingthe mainobjective.However,althoughthese wereafew ofthe limitations throughout theprocess,wewerestillabletoevaluatetheNAO systemandcouldrecognisethattheNAO maypositivelyaffectsparticipant’ s affective state.104 +We commit to high ethical standards, respecting the sensitive nature of simulating dementia conditions, and ensuring the well-being and dignity of all participants throughout the study. To ensure this we present a form containing the ethical considerations to each participant. 124 124 125 - 126 -===== **//Potential Future Work//** ===== 106 +
- Test group data.jpg
-
- Author
-
... ... @@ -1,1 +1,0 @@ 1 -xwiki:XWiki.jeanpaulsmit - Size
-
... ... @@ -1,1 +1,0 @@ 1 -157.5 KB - Content