Literature

1

Allan Birnbaum. Some Latent Trait Models and Their Use in Inferring an Examinee's Ability, pages 395–479. Addison-Wesley, 1968.

2

R. Darrell Bock. Estimating item parameters and latent ability when responses are scored in two or more nominal categories. Psychometrika, 37(1):29–51, 03 1972. doi:10.1007/bf02291411.

3

R. Darrell Bock and Murray Aitkin. Marginal maximum likelihood estimation of item parameters: application of an em algorithm. Psychometrika, 46(4):443–459, 1981. doi:10.1007/bf02293801.

4

Hua-Hua Chang, Chun Wang, and Zhiliang Ying. Information theory and its application to testing. In Wim J. Van der Linden, editor, Handbook of Item Response Theory, Volume Two: Statistical Tools, volume 2, chapter 7, pages 105–122. CRC press, Boca Raton, 2016.

5

Ying Cheng, Ke-Hai Yuan, and Cheng Liu. Comparison of reliability measures under factor analysis and item response theory. Educational and Psychological Measurement, 72(1):52–67, 2012. doi:10.1177/0013164411407315.

6

B.W. Domingue, K. Kanopka, and R. et al. Kapoor. The intermodel vigorish as a lens for understanding (and quantifying) the value of item response models for dichotomously coded items. Psychometrika, 89:1034–1054, 2024. doi:10.1007/s11336-024-09977-2.

7

Conor Durkan, Artur Bekasov, Iain Murray, and George Papamakarios. Neural spline flows. arXiv preprint arXiv:1906.04032, 2019. URL: https://arxiv.org/abs/1906.04032.

8

Carl F. Falk and Li Cai. Maximum marginal likelihood estimation of a monotonic polynomial generalized partial credit model with applications to multiple group analysis. Psychometrika, 81(2):434–460, 2016. doi:10.1007/s11336-014-9428-7.

9

Doyoung Kim, Rafael Jaime De Ayala, Abdullah A Ferdous, and Michael L Nering. The comparative performance of conditional independence indices. Applied Psychological Measurement, 35(6):447–471, 2011. doi:10.1177/0146621611407909.

10

Eiji Muraki. A generalized partial credit model: Application of an EM algorithm. ETS Research Report Series, 1992(1):i–30, 1992. doi:10.1177/014662169201600206.

11

Georg Rasch. Studies in mathematical psychology: I. Probabilistic models for some intelligence and attainment tests. Nielsen & Lydiche, Oxford, England, 1960.

12

Davor Runje and Sharath M. Shankaranarayana. Constrained monotonic neural networks. arXiv preprint arXiv.2205.11775, 2023. URL: https://arxiv.org/abs/2205.11775.

13

Fumiko Samejima. Estimation of latent ability using a response pattern of graded scores. Psychometrika, 34(S1):1–97, 1969. doi:10.1007/bf03372160.

14

Youngsuk Suh and Daniel M. Bolt. Nested Logit Models for Multiple-Choice Item Response Data. Psychometrika, 75(3):454–473, 2010. doi:10.1007/s11336-010-9163-7.

15

Christopher J. Urban and Daniel J. Bauer. A deep learning algorithm for high-dimensional exploratory item factor analysis. Psychometrika, 86(1):1–29, 03 2021. doi:10.1007/s11336-021-09748-3.

16

Joakim Wallmark and Marie Wiberg. The bit scale: a metric score scale for unidimensional item response theory models. Psychometrika, pages 1–17, 2025. doi:10.1017/psy.2025.10071.

17

Wim J. Van der Linden. Handbook of Item Response Theory, Volume Two: Statistical Tools. CRC Press, Boca Raton, 2016.

18

Joakim Wallmark, Maria Josefsson, and Marie Wiberg. Introducing Flexible Monotone Multiple Choice Item Response Theory Models and Bit Scales. arXiv preprint arXiv.2410.01480, 2024. URL: https://arxiv.org/abs/2410.01480.