Publication

Predicting human performance differences on multiple interface alternatives: KLM, GOMS and CogTool are unreliable

Jorritsma, W., Haga, P-J., Cnossen, F., Dierckx, R., Oudkerk, M. & van Ooijen, P., 2015, 6th International Conference on Applied Human Factors and Ergonomics (AHFE 2015) and the Affiliated Conferences. Elsevier

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

APA

Jorritsma, W., Haga, P-J., Cnossen, F., Dierckx, R., Oudkerk, M., & van Ooijen, P. (2015). Predicting human performance differences on multiple interface alternatives: KLM, GOMS and CogTool are unreliable. In 6th International Conference on Applied Human Factors and Ergonomics (AHFE 2015) and the Affiliated Conferences Elsevier.

Author

Jorritsma, Wiard ; Haga, Peter-Jan ; Cnossen, Fokie ; Dierckx, Rudi ; Oudkerk, Matthijs ; van Ooijen, Peter. / Predicting human performance differences on multiple interface alternatives: KLM, GOMS and CogTool are unreliable. 6th International Conference on Applied Human Factors and Ergonomics (AHFE 2015) and the Affiliated Conferences. Elsevier, 2015.

Harvard

Jorritsma, W, Haga, P-J, Cnossen, F, Dierckx, R, Oudkerk, M & van Ooijen, P 2015, Predicting human performance differences on multiple interface alternatives: KLM, GOMS and CogTool are unreliable. in 6th International Conference on Applied Human Factors and Ergonomics (AHFE 2015) and the Affiliated Conferences. Elsevier.

Standard

Predicting human performance differences on multiple interface alternatives: KLM, GOMS and CogTool are unreliable. / Jorritsma, Wiard; Haga, Peter-Jan; Cnossen, Fokie; Dierckx, Rudi; Oudkerk, Matthijs; van Ooijen, Peter.

6th International Conference on Applied Human Factors and Ergonomics (AHFE 2015) and the Affiliated Conferences. Elsevier, 2015.

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

Vancouver

Jorritsma W, Haga P-J, Cnossen F, Dierckx R, Oudkerk M, van Ooijen P. Predicting human performance differences on multiple interface alternatives: KLM, GOMS and CogTool are unreliable. In 6th International Conference on Applied Human Factors and Ergonomics (AHFE 2015) and the Affiliated Conferences. Elsevier. 2015


BibTeX

@inproceedings{8ac435055a9747948dcadd5010d4eab6,
title = "Predicting human performance differences on multiple interface alternatives: KLM, GOMS and CogTool are unreliable",
abstract = "Cognitive modeling tools, such as KLM, GOMS and CogTool, can be used to predict human performance on interface designs before they are implemented and without the need for user testing. The model predictions can inform interface design, because they allow designers to quantitatively compare multiple interface alternatives. However, little research has been done to determine how accurately cognitive modeling tools can predict human performance differences on interface alternatives. It is also unclear whether different modeling tools produce practically significantly different results. The goal of this study was to evaluate the accuracy of KLM, GOMS and CogTool for predicting human performance differences on multiple interface alternatives.Three tasks on three interface alternatives were modeled using KLM, GOMS and CogTool. The model predictions of each tool were compared to performance data of 20 expert users performing the tasks on the interfaces. For all tasks and all modeling tools, the model-predicted trend did not correspond to the trend in the human performance data.For the six statistically significant differences between the interfaces, all tools predicted the direction of difference correctly in four cases, and incorrectly in two cases. The average difference between the predicted and the observed magnitude of difference between the interfaces was 5.49 s for KLM (range: 0.8 – 13.35), 3.98 s for GOMS (range: 0.8 – 9.75) and 3.49 s for CogTool (range: 0.13 – 10.65). These differences between the tools were not statistically significant.In conclusion,KLM, GOMS and CogTool cannot reliably predict human performance differences on multiple interface alternatives. Our results indicate that if the models predict faster performance on interface A than on interface B, humans actually perform faster on interface B than on interface A in one third of the cases. This raises questions about the validity of these cognitive modeling tools in interface design practice.",
author = "Wiard Jorritsma and Peter-Jan Haga and Fokie Cnossen and Rudi Dierckx and Matthijs Oudkerk and {van Ooijen}, Peter",
year = "2015",
language = "English",
booktitle = "6th International Conference on Applied Human Factors and Ergonomics (AHFE 2015) and the Affiliated Conferences",
publisher = "Elsevier",

}

RIS

TY - GEN

T1 - Predicting human performance differences on multiple interface alternatives: KLM, GOMS and CogTool are unreliable

AU - Jorritsma, Wiard

AU - Haga, Peter-Jan

AU - Cnossen, Fokie

AU - Dierckx, Rudi

AU - Oudkerk, Matthijs

AU - van Ooijen, Peter

PY - 2015

Y1 - 2015

N2 - Cognitive modeling tools, such as KLM, GOMS and CogTool, can be used to predict human performance on interface designs before they are implemented and without the need for user testing. The model predictions can inform interface design, because they allow designers to quantitatively compare multiple interface alternatives. However, little research has been done to determine how accurately cognitive modeling tools can predict human performance differences on interface alternatives. It is also unclear whether different modeling tools produce practically significantly different results. The goal of this study was to evaluate the accuracy of KLM, GOMS and CogTool for predicting human performance differences on multiple interface alternatives.Three tasks on three interface alternatives were modeled using KLM, GOMS and CogTool. The model predictions of each tool were compared to performance data of 20 expert users performing the tasks on the interfaces. For all tasks and all modeling tools, the model-predicted trend did not correspond to the trend in the human performance data.For the six statistically significant differences between the interfaces, all tools predicted the direction of difference correctly in four cases, and incorrectly in two cases. The average difference between the predicted and the observed magnitude of difference between the interfaces was 5.49 s for KLM (range: 0.8 – 13.35), 3.98 s for GOMS (range: 0.8 – 9.75) and 3.49 s for CogTool (range: 0.13 – 10.65). These differences between the tools were not statistically significant.In conclusion,KLM, GOMS and CogTool cannot reliably predict human performance differences on multiple interface alternatives. Our results indicate that if the models predict faster performance on interface A than on interface B, humans actually perform faster on interface B than on interface A in one third of the cases. This raises questions about the validity of these cognitive modeling tools in interface design practice.

AB - Cognitive modeling tools, such as KLM, GOMS and CogTool, can be used to predict human performance on interface designs before they are implemented and without the need for user testing. The model predictions can inform interface design, because they allow designers to quantitatively compare multiple interface alternatives. However, little research has been done to determine how accurately cognitive modeling tools can predict human performance differences on interface alternatives. It is also unclear whether different modeling tools produce practically significantly different results. The goal of this study was to evaluate the accuracy of KLM, GOMS and CogTool for predicting human performance differences on multiple interface alternatives.Three tasks on three interface alternatives were modeled using KLM, GOMS and CogTool. The model predictions of each tool were compared to performance data of 20 expert users performing the tasks on the interfaces. For all tasks and all modeling tools, the model-predicted trend did not correspond to the trend in the human performance data.For the six statistically significant differences between the interfaces, all tools predicted the direction of difference correctly in four cases, and incorrectly in two cases. The average difference between the predicted and the observed magnitude of difference between the interfaces was 5.49 s for KLM (range: 0.8 – 13.35), 3.98 s for GOMS (range: 0.8 – 9.75) and 3.49 s for CogTool (range: 0.13 – 10.65). These differences between the tools were not statistically significant.In conclusion,KLM, GOMS and CogTool cannot reliably predict human performance differences on multiple interface alternatives. Our results indicate that if the models predict faster performance on interface A than on interface B, humans actually perform faster on interface B than on interface A in one third of the cases. This raises questions about the validity of these cognitive modeling tools in interface design practice.

UR - http://www.ahfe2015.org/program1.html

M3 - Conference contribution

BT - 6th International Conference on Applied Human Factors and Ergonomics (AHFE 2015) and the Affiliated Conferences

PB - Elsevier

ER -

ID: 23616195