Literature

1: Allan Birnbaum. Some Latent Trait Models and Their Use in Inferring an Examinee's Ability, pages 395–479. Addison-Wesley, 1968.
2: R. Darrell Bock. Estimating item parameters and latent ability when responses are scored in two or more nominal categories. Psychometrika, 37(1):29–51, 03 1972. doi:10.1007/bf02291411.
3: R. Darrell Bock and Murray Aitkin. Marginal maximum likelihood estimation of item parameters: application of an em algorithm. Psychometrika, 46(4):443–459, 1981. doi:10.1007/bf02293801.
4: Hua-Hua Chang, Chun Wang, and Zhiliang Ying. Information theory and its application to testing. In Wim J. Van der Linden, editor, Handbook of Item Response Theory, Volume Two: Statistical Tools, volume 2, chapter 7, pages 105–122. CRC press, Boca Raton, 2016.
5: Ying Cheng, Ke-Hai Yuan, and Cheng Liu. Comparison of reliability measures under factor analysis and item response theory. Educational and Psychological Measurement, 72(1):52–67, 2012. doi:10.1177/0013164411407315.
6: B.W. Domingue, K. Kanopka, and R. et al. Kapoor. The intermodel vigorish as a lens for understanding (and quantifying) the value of item response models for dichotomously coded items. Psychometrika, 89:1034–1054, 2024. doi:10.1007/s11336-024-09977-2.
7: Conor Durkan, Artur Bekasov, Iain Murray, and George Papamakarios. Neural spline flows. arXiv preprint arXiv:1906.04032, 2019. URL: https://arxiv.org/abs/1906.04032.
8: Carl F. Falk and Li Cai. Maximum marginal likelihood estimation of a monotonic polynomial generalized partial credit model with applications to multiple group analysis. Psychometrika, 81(2):434–460, 2016. doi:10.1007/s11336-014-9428-7.
9: Doyoung Kim, Rafael Jaime De Ayala, Abdullah A Ferdous, and Michael L Nering. The comparative performance of conditional independence indices. Applied Psychological Measurement, 35(6):447–471, 2011. doi:10.1177/0146621611407909.
10: Eiji Muraki. A generalized partial credit model: Application of an EM algorithm. ETS Research Report Series, 1992(1):i–30, 1992. doi:10.1177/014662169201600206.
11: Georg Rasch. Studies in mathematical psychology: I. Probabilistic models for some intelligence and attainment tests. Nielsen & Lydiche, Oxford, England, 1960.
12: Davor Runje and Sharath M. Shankaranarayana. Constrained monotonic neural networks. arXiv preprint arXiv.2205.11775, 2023. URL: https://arxiv.org/abs/2205.11775.
13: Fumiko Samejima. Estimation of latent ability using a response pattern of graded scores. Psychometrika, 34(S1):1–97, 1969. doi:10.1007/bf03372160.
14: Youngsuk Suh and Daniel M. Bolt. Nested Logit Models for Multiple-Choice Item Response Data. Psychometrika, 75(3):454–473, 2010. doi:10.1007/s11336-010-9163-7.
15: Christopher J. Urban and Daniel J. Bauer. A deep learning algorithm for high-dimensional exploratory item factor analysis. Psychometrika, 86(1):1–29, 03 2021. doi:10.1007/s11336-021-09748-3.
16: Joakim Wallmark and Marie Wiberg. The bit scale: a metric score scale for unidimensional item response theory models. Psychometrika, pages 1–17, 2025. doi:10.1017/psy.2025.10071.
17: Wim J. Van der Linden. Handbook of Item Response Theory, Volume Two: Statistical Tools. CRC Press, Boca Raton, 2016.
18: Joakim Wallmark, Maria Josefsson, and Marie Wiberg. Introducing Flexible Monotone Multiple Choice Item Response Theory Models and Bit Scales. arXiv preprint arXiv.2410.01480, 2024. URL: https://arxiv.org/abs/2410.01480.