LEKO v1.0

Abel, Andrea

Please use the following text to cite this item or export to a predefined format:

Share

dc.contributor.author	Abel, Andrea
dc.contributor.author	Zanasi, Lorenzo
dc.contributor.author	Nicolas, Lionel
dc.contributor.author	Konecny, Christine
dc.contributor.author	Autelli, Erica
dc.date.accessioned	2023-02-22T09:50:42Z
dc.date.available	2023-02-22T09:50:42Z
dc.date.issued	2021
dc.description	The LEKO corpora LEKO_Kolipsi and LEKO_Merlin provide lexical annotations for phraseological elements in Italian L2 writing on the basis of a subset of the texts of the Kolipsi-1 corpus and the Merlin corpus respectively. The annotations were jointly created by the University of Innsbruck (Austria) and Eurac Research Bolzano (Italy) within the project LEKO, whose aim was to describe the use of phrasemes in these texts. There are manual annotations for phraseme category, lexical errors, morpho-syntactic features and error explanations. LEKO_Kolipsi contains about 55 000 tokens in 282 texts from 141 pupils of the final year of upper secondary school, representing two different text types (email and letter, narrative and argumentative genre) as described in the Kolipsi-1 documentation. LEKO_Merlin contains about 9 000 tokens in 50 texts from 50 examinees, who took part in an official language test (TELC) for Italian. The documents have been transcribed according to the Kolipsi-1 and Merlin Transcription guidelines. Annotation guidelines for the lexical annotations can be found here. Note: The LEKO corpora do not contain manual annotations for non-lexical errors, foreign word insertions, target language transcriptions, ambiguous writings or other annotations available in the base corpora Kolipsi-1 and Merlin. In order to retrieve any of those annotations and/or full target versions of the student writings please consult the base corpora directly.
dc.identifier.uri	http://hdl.handle.net/20.500.12124/33
dc.language.iso	ita
dc.publisher	Institute for Applied Linguistics, Eurac Research
dc.relation.isreferencedby	http://hdl.handle.net/10863/7683
dc.rights	CLARIN ACADEMIC END-USER LICENCE (ACA-BY-NC-NORED 1.0)
dc.rights.label	ACA
dc.rights.uri	https://gitlab.inf.unibz.it/commul/var/eurac-licenses/-/raw/v1.0/EULA-CLARIN-ACA-BY-NC-NORED.md
dc.subject	Phraseology
dc.subject	Phrasemes
dc.subject	Lexical combinations
dc.subject	learner language
dc.subject	student writing
dc.subject	non-standard language
dc.title	LEKO v1.0
dc.type	corpus
local.branding	Eurac Research
local.contact.person	Aivars Glaznieks porta@eurac.edu Eurac Research
local.contact.person	Jennifer-Carmen Frey porta@eurac.edu Eurac Research
local.demo.uri	https://commul.eurac.edu/annis/leko
local.files.count	7
local.files.size	8685842
local.has.files	yes
local.hasCMDI	false
local.hidden	false
local.language.name	Italian
local.size.info	64 000 tokens
local.size.info	332 texts
local.sponsor	nationalFunds 02/40.3 Autonomous Province of Bozen/Bolzano LeKo - Lexemkombinationen und typisierte Rede im mehrsprachigen Kontext. Authentische Sprachdaten für die Erarbeitung didaktischer Materialien zur italienischen Wortkombinatorik für deutschsprachige L2-Lerner
metashare.ResourceInfo#ContentInfo.mediaType	text

Collections

Eurac Research: Learner Language
PORTA

This item isAcademic Use

and licensed under:

CLARIN ACADEMIC END-USER LICENCE (ACA-BY-NC-NORED 1.0)

Files in this item

This item contains no files.

Show simple item record