Measuring Instruments

Version 4.4 by Sofia Kostakonti on 2022/04/03 15:15

For the evaluation of a prototype, there are several frameworks that can be followed, starting with DECIDE[1]. Decide stands for:

Determine the goals
Explore the questions
Choose evaluation approach and methods
Identify practical issues
Decide about ethical issues
Evaluate, analyze, interpret, present data

First, we would have to determine the high-level goals for the study and the motivation behind them, since they can influence how we approach it. Then, we choose the evaluation approach, the methods that will be used, whether these are based on quantitative or qualitative data, and the process of data collecting, analysis, and presentation. At the same time, any practical issues, such as participants, budget, or schedule, are identified and a pilot study is performed if needed. It is important to adhere to any ethical procedures that are in place, to ensure the participant knows their rights and is protected. Finally, the evaluation of the data takes place, where it is determined whether the results are reliable, valid, without bias, unrelated to the environment and can generalize well.
Another framework used is IMPACT[2]:

IMPACT

Intention: Present the objectives and claims
Measures and metrics: "What, how, and why"
People: Define the participants
Activities: Use cases in activities
Context: Social, ethical, physical, etc. environment definition
Technologies: Hardware and software

Evaluation methods

 

Formative evaluation

- Open-ended evaluation of the design
- E.g. How will the users respond to the new design?

Summative evaluation

- Focus on the overall effect
- Summarizes if the objective is reached
- E.g. Are the participants happier when working with design X in comparison to design Y?

Data

- Qualitative: Explore, discover, instruct

- Quantitative: Describe, explain, predict

  - Subjective quantitative

Statistics

- Descriptive: Describe the dataset, e.g. mean time on task
- Inferential: Using a sample to infer about a population, e.g. predicted mean time on task based on user characteristics.

Experiment Design: Conditions

Within Subjects (Repeated Measures)

Each participant is subjected to all the test conditions.
Fewer subjects needed and reduces variance in the Can be difficult to setup due to subjects fatiguing, learning about the setup or simply not having enough time.

Between Subjects (Between Groups)

One subject undergoes only one test. Simple to execute, but results in significant variance due to inter-subject differences in characteristics.

Lenses

- Lense means to take different perspectives looking at your system
- E.g. perspective of the stakeholders, other groups, or technical/legal

[1] Kurniawan, S. (2004). Interaction design: Beyond human–computer interaction by Preece, Sharp and Rogers (2001), ISBN 0471492787.