This is not the latest version of this item. The latest version can be found here.
KoKo German L1 Learner Corpus v1
Please use the following text to cite this item or export to a predefined format:
Abel, Andrea; Glaznieks, Aivars and Culy, Chris, 2012, KoKo German L1 Learner Corpus v1, CLARIN DSpace, http://hdl.handle.net/20.500.12124/10
Authors
Item identifier
Project URL
Referenced by
Date issued
2012-12
Size
1503 texts,
950,000 tokens
Language(s)
Description
The KoKo Corpus is an error-annotated learner corpus of L1 German speakers. It
has been created with the aim to investigate and describe the writing skills of
German-speaking secondary-school pupils at the end of their school career by
analysing authentic texts produced in classrooms.
The corpus building process was guided by two goals:
1. describe writing skills at the transition from secondary school to
university,
2. determine external factors that may influence the distribution of writing
skills, such as the region, sociolinguistic (gender, age), socio-economic, and
language-related biographical factors (L1, preferred variety of German, reading
and writing habits, etc.).
The pupils were selected from three different German-speaking areas:
- North Tyrol (Austria), South Tyrol (Italy), and Thuringia (Germany).
Classes were sampled randomly, using the size of the cities in which the
schools were located (small vs. medium vs. big) and the type of school
(providing general education vs. education specific to a particular profession)
as strata for the sampling. Since data were collected during regular courses,
the typical formation of secondary-school classes in the three regions is
represented in the whole corpus. Most of the participants are German native
speakers (n=1319, 82.7%).
Person-related metadata provides information about:
- writer's L1
- writer's gender
- type of school the essay comes from
- location of the school the essay comes from
- grade attended at data collection
Collections
Files in this item
Loading files... This may take a few seconds as file previews are being generated. If the process takes too long, please contact the system administrator test@test.sk