b. Test

Version 14.1 by Karthik Prakash on 2023/04/07 20:27

1. Introduction

In section a. Prototype two versions of the robot were presented, one with voice functionality and one without.

The main claims we are looking to test with this testing procedure are related to the functionality and usability of the robot.

The participants will be other students taking this course. The participants will be placed in the shoes of a PwD and be tasked with completing several basic actions with the robot while impaired in several known ways to simulate the difficulties of a PwD.

After the experiment, the participants will fill out a survey and be asked some more open-ended questions with the purpose of understanding how the interaction with the robot went, and whether they have anything that they find concerning regarding the possible use of the system and its functions in a real-life setting.

On top of this, a short questionnaire will be sent to several care homes throughout the Netherlands in hopes to get a general idea of whether the caretakers at the facilities think that the system would be a good fit for the proposed use case.

2. Method

The prototypes are evaluated in a simulated manner and conducted in-person experiments. Participants will be given a persona to play act.

2.1 Participants

All students in CS4235 Socio-Cognitive Engineering (2022-2023) in TU Delft are invited to test the robot. In the end, 14 students are presented.

2.2 Experimental design

Prior to the experiment, participants were asked to simulate the experience of a person with dementia. Participants who wore glasses were asked to remove them, while those who did not were provided with short-sighted or far-sighted glasses to replicate the blurred vision and degraded perception that is common in PwDs. Additionally, their index and middle fingers were taped together to simulate the difficulty in controlling movements that many PwDs experience.

For this experiment, we used a between-subject design. The control group interacted solely with the stand-alone application, which represented the robot without voice functionality, and received guidance through a single task paper. In contrast, the experimental group engaged with the fully-functional robot, which provided audio instructions, guidance, and encouragement.

2.3 Tasks

In the user test, the following tasks were asked of the participants:

Reminders for activities

Add a reminder that a relative will pay a visit on Sunday with the format as "<relative name> will visit you on Sunday at 3 pm for some tea". Set the reminder to remind you 10 min before that.
Check the remainder for this week and verify the new remainder added for the event.

Personal profile

Browse through relative profiles and read the info.
Add relatives as a contact in the "profile" section.

Memory games

Go to the Games section and play the game.

Medicine reminders

(for professional caregivers - not part of the evaluation)

In the section “My Health”, add a medicine reminder to take the medicine Donepezil, 1 time per day at 9 PM before going to bed.
Check medicines that have been added.
Delete medicines that have been added.

2.4 Measures

Two quantitative measures were employed in the user evaluation. The first measure aimed to test attributes including accessibility, trustworthiness, and comprehensibility. The second measure used was the System Usability Scale (SUS), which is a widely-used scale for evaluating the usability of the software.

In addition to the quantitative measures, a structured interview with open-ended questions was conducted with randomly selected participants. The aim of this interview was to gain a deeper understanding of participants' experiences with the robot, including any concerns they may have regarding the potential use of the system and its functions in a real-life setting.

By using both quantitative and qualitative measures, the user evaluation can provide a more comprehensive understanding of participants' experiences and perceptions of the system. This approach allows for a more nuanced analysis of the data and can provide valuable insights into the strengths and weaknesses of the system, as well as areas for improvement.

2.5 Procedure

The procedure was conducted as follows:

Welcome participants and give an introduction.
Get them to sign a consent form.
Prepare them to pretend to be a person with dementia.
Have interaction with the robot and complete the tasks.
Complete a questionnaire.
Have a short interview with randomly selected participants.

2.6 Material

Consent form. To protect the privacy of participants and ensure the evaluation process goes smoothly, we will ask participants to sign a consent form, indicating they are willing to take part in the evaluation and the data gathered from the experiment will be analyzed by researchers.
Pepper robot.
Questionnaire. Validated questions to test the functionality and usability.

3. Results

Results of interaction:

Tasks	Succeded by Themselves	Succeded with Some Guidance	Succeded with Detailed Explicit Instructions	Average Time to Complete Task (s)
Add a reminder	14.29%	28.57%	57.14%	75
Check weekly remainders on the Calendar page	100%	0%	0%	10
Create a personal profile	7.14%	50%	42.86%	43
Verify current profiles	85.71%	14.29%	0%	10
Play memory game	0%	42.86%	57.14%	200

if possible, note down some Parts Where Users Struggled in each task.

Results of survey:

Survey Results

Significance test:

4. Discussion

Limitations:

We ran into some hiccups while creating the application and performing the experiment:

Implementation:

We could not adapt the robot to the PwD due to time constraints. This means that we did not take into account the severity of the PwD's visual, acoustic and kinesthetic limitations while setting up Pepper.
We could not test the full capabilities of the robot due to privacy constraints. Since we fabricated the information about relatives to protect the privacy of participants, we were not able to perform the scenarios in a realistic manner.
Since the version of the Google Chrome browser on the Pepper tablet was outdated, we were not able to load our Flutter application onto it and simulate actual scenarios.

Evaluation:

It is important to note that the participants who took part in our experiment were not affected by dementia.
The experiment was not conducted in a real-world setting.
We were not able to test the improvement in memory recall since it involves sustained interactions with a particular user over time and our experiment involved brief, one-time interactions.

Future Improvements:

We can make our system more realistic/adapted to PwD by incorporating human-like responses, gestures and movements to Pepper.
We can make our system fully gesture/voice controlled to enable the PwD to use the system without assistance from a caregiver, increasing their autonomy.
We can incorporate privacy protocols like voice authentication and gaze detection to ensure that all personal information about the PwD, relatives and caregivers are kept safe and confidential.

5. Conclusions

After performing the experiment and running various statistical tests on the results obtained, we have made the following conclusions that hopefully answer some of our research questions:

We believe that an information support application DOES IMPROVE a PwD's well-being, since it can provide them with access to important information and support, improving their overall quality of life.
We believe that a robot assistant DOES IMPROVE the experience of a PwD using it. The robot can provide companionship and assistance, making them feel independent and less isolated.

While our experiment had its limitations, we believe that it provides a foundation for future research in developing personalized memory robots for people with dementia. We also believe that our research is applicable to mobile agents which increases the accessibility of the solution.