Skip to main content

Learner corpora around the world

cecl |

This list is still work in progress. We would like it to be as comprehensive as possible. If you have a learner corpus or know of one that is not listed on this webpage, send a message to Magali Paquot and we will add it to the list. We hope you will find the list useful for your research!

The list below only contains learner corpora, i.e. electronic collections of continuous written or spoken data produced by foreign or second language learners.

For a list of learner corpus-based datasets (treebanks, error lists, etc.), click here.

To refer to this list :

Centre for English Corpus Linguistics (date of access): Learner Corpora around the World. Louvain-la-Neuve: Université catholique de Louvain. https://uclouvain.be/en/research-institutes/ilc/cecl/learner-corpora-around-the-world.html


© 2019, Université catholique de Louvain

Learner corpora 

Last updated 18 February 2025

Use the query box below to search for specific keywords (e.g. languages, task type, medium). 

  

Learner corpus-based datasets