Designing assessments in classroom contexts or having them generated automatically requires - among other things - knowledge about the difficulty of what is assessed. Estimates of difficulty can be derived empirically, usually by piloting items, or theoretically from models. Empirical results, in turn, can inform theory and refine models. In this article, we compare four methods of estimating the item difficulty for a typical topic of introductory programming courses: control flow. For a given set of items that have been tested empirically, we also collected expert ratings and additionally applied measures of code complexity both from software engineering and from computer science education research The results show that there is some overlap between empirical results and theoretical predictions. However, for the simple item format that we have been using, the models all fall short in offering enough explanatory power regarding the observed variance in difficulty. Empirical difficulty in turn can serve as the basis for rules that can be used for item generation in the future.
TitelKoli Calling 2022 : 22nd Koli Calling International Conference on Computing Education Research
Redakteure/-innenIlkka Jormanainen, Andrew Petersen
ErscheinungsortNew York, NY, USA
Herausgeber (Verlag)Association for Computing Machinery
ISBN (Print)9781450396165
PublikationsstatusVeröffentlicht - 17.11.2022
No renderer: handleNetPortal,dk.atira.pure.api.shared.model.researchoutput.ContributionToBookAnthology


  • Eigene Veröffentlichung

ID: 5841532