Book chapter
Assessing AI capabilities with education tests
AI and the Future of Skills, Volume 2, pp.40-64
Educational Research and Innovation, OECD Publishing
11/16/2023
DOI: 10.1787/bbdeb1e0-en
Abstract
This chapter introduces three exploratory studies that assessed the capabilities of artificial intelligence (AI) through standardised education tests designed for humans. The first two studies, conducted in 2016 and 2021/22, asked experts to evaluate AI’s performance on the literacy and numeracy tests of the OECD’s Survey of Adult Skills (PIAAC). The third study collected expert judgements of whether AI can solve science questions from the OECD's Programme for International Student Assessment (PISA). The studies aimed to refine the assessment framework for eliciting expert knowledge on AI using established educational assessments. They explored different test formats, response methodologies and rating instructions, along with two distinct assessment approaches. A “behavioural approach” used in the PIAAC studies emphasised smaller expert groups engaging in discussions, and a "mathematical approach" adopted in the PISA study relied more heavily on quantitative data from a larger expert pool. This chapter presents the results of the studies and discusses the advantages and disadvantages of their methodological approaches.
Details
- Title: Subtitle
- Assessing AI capabilities with education tests
- Creators
- Mila Staneva - Organisation de Coopération et de Développement EconomiquesAbel Baret - Organisation de Coopération et de Développement EconomiquesÁngel Aso-Mollar - Universitat Politècnica de ValènciaJoseph BlassSalvador Carrión PonzVincent Conitzer - Carnegie Mellon UniversityUlises Cortes - Universitat Politècnica de CatalunyaPradeep Dasigi - Allen InstituteAngel de Paula - Universitat Politècnica de ValènciaCarlos Galindo - Universitat Politècnica de ValènciaJanice Gobert - Rutgers Sexual and Reproductive Health and RightsJordi Gonzàlez - Universitat Autònoma de BarcelonaFredrik Heintz - Linköping UniversityJim HendlerDaniel HendrycksLawrence Hunter - University of Colorado Anschutz Medical CampusJuan Izquierdo-Domenech - Universitat Politècnica de ValènciaMaria JuarezAina Juraco FriasAviv KerenRik Koncel-KedziorskiDavid Leake - Indiana UniversityBao Sheng Loe - University of CambridgeFernando Martinez-Plumed - Universitat Politècnica de ValènciaAqueasha Martin-Hammond - Indiana UniversityCynthia Matuszek - University of Maryland, College ParkAntoni Mestre GascónJose Andres Moreno - Universitat Politècnica de ValènciaConstantine NakosTaylor OlsonCarolyn Rose - Carnegie Mellon UniversityAreg Mikael Sarvazyan - Universitat Politècnica de ValènciaBrian Scassellati - Yale UniversityWout Schellaert - Universitat Politècnica de ValènciaClaes Strannegård - Chalmers University of TechnologyNeset Tan - University of AucklandTadahiro Taniguchi - Panasonic (Poland)Karina Vold - University of TorontoMichael Wooldridge - University of Oxford
- Resource Type
- Book chapter
- Publication Details
- AI and the Future of Skills, Volume 2, pp.40-64
- Series
- Educational Research and Innovation
- DOI
- 10.1787/bbdeb1e0-en
- eISSN
- 2076-9679
- ISSN
- 2076-9660
- Publisher
- OECD Publishing; Paris
- Number of pages
- 25
- Language
- English
- Date published
- 11/16/2023
- Academic Unit
- Computer Science
- Record Identifier
- 9984958640502771
Metrics
3 Record Views