International Corpus of Learner English v2 (Handbook + CD-Rom)
Sylviane Granger, Estelle Dagneaux, Fanny Meunier & Magali Paquot
Presses universitaires de Louvain, Louvain-la-Neuve, 2009
The International Corpus of Learner English (Version 2) is a corpus of writing by higher intermediate to advanced learners of English. It contains 3.7 million words of EFL writing from learners representing 16 mother tongue backgrounds (Bulgarian, Chinese, Czech, Dutch, Finnish, French, German, Italian, Japanese, Norwegian, Polish, Russian, Spanish, Swedish, Turkish and Tswana).
It differs from the first version published in 2002 not only by its increased size and range of learner populations, but also by its interface, which contains two new functionalities:
1. built-in concordancer allowing users to search for word forms, lemmas and/or part-of-speech tags;
Note, however, that the unannotated ICLE texts can be downloaded in txt format and processed with any other software tools (concordancer, POS-tagger, parser, etc.).
2. breakdown of the query results according to the learner profile information:
- Distribution of the occurrences of a linguistic query according to the whole set of ICLE-recorded variables (e.g. native language, gender, age, type of task)
- 'Range' analysis: distribution of ICLE variables for the texts that match a corpus collection or a linguistic query
The interface was developed by the Centre de Traitement Automatique du Langage (CENTAL) of the University of Louvain.