Please use the following text to cite this item or export to a predefined format:
Frey, Jennifer-Carmen; Glaznieks, Aivars and Stemle, Egon W., 2019, DIDI - The DiDi Corpus of South Tyrolean CMC 1.0.0, CLARIN DSpace, http://hdl.handle.net/20.500.12124/7
dc.contributor.authorFrey, Jennifer-Carmen
dc.contributor.authorGlaznieks, Aivars
dc.contributor.authorStemle, Egon W.
dc.date.accessioned2019-03-07T17:58:26Z
dc.date.available2019-03-07T17:58:26Z
dc.date.issued2019-03-07
dc.descriptionThe DiDi corpus has an overall size of around 600.000 Tokens gathered from 136 South Tyrolean Facebook users who participated in the DiDi project. It consists of 11.102 Facebook wall posts, 6.507 wall comments and 22.218 private messages. All messages were written by the participants throughout the year 2013. Please read the fulldescription of the corpus for further details. Please consider also the description of the method of data collection and the full description of the DiDi project and its research questions. As every participant could offer either his/her private messages, his/her texts on the wall or both, the corpus comprises wall posts and wall comments from 130 profiles and private messages of 56 profiles 50 participants granted access to both types of data. Free access to the corpus is given to the wall posts and comments. Due to privacy issues the access to the private messages is restricted. Access to the private messages can be given for scientific research only, after signing a non-disclosure agreement. In case you are interested in the data for scientific reasons, please contact the research team. All texts were anonymised in order to guarantee that the participants' identity cannnot be infered from the texts. The anonymisation included person names, group names, geographical names and adjectival references, institution names, hyperlinks, mail addresses, phone numbers, numbers of bank accounts, servers, postal codes and other private information. Please, read the anonymisation document for the anonymisation keys. The corpus offers a vast range of research opportunities for linguists that are interested in CMC in general, and more specific in multilingual language use, the use of regional varieties, code switching, code shifting and code mixing phenomena, etc. Access to the DiDi corpus: https://commul.eurac.edu/annis/didi
dc.identifier.urihttp://hdl.handle.net/20.500.12124/7
dc.language.isodeu
dc.language.isoita
dc.language.isoeng
dc.language.isolad
dc.publisherInstitute for Applied Linguistics, Eurac Research
dc.relation.isbasedonhttps://gitlab.inf.unibz.it/commul/didi/data-bundle/-/tags/v1.0.0
dc.relation.isreferencedbyhttp://www.eurac.edu/en/research/autonomies/commul/Documents/DiDi/NLP4CMC-2015_DiDi_paper.pdf
dc.relation.isreferencedbyhttp://www.eurac.edu/en/research/autonomies/commul/Documents/DiDi/didi_clic-it2016_FINAL.pdf
dc.rightsCLARIN ACADEMIC END-USER LICENCE (ACA-BY-NC-NORED 1.0)
dc.rights.labelACA
dc.rights.urihttps://gitlab.inf.unibz.it/commul/var/eurac-licenses/-/raw/v1.0/EULA-CLARIN-ACA-BY-NC-NORED.md
dc.source.urihttp://www.eurac.edu/didi
dc.subjectFacebook
dc.subjectSocial Media
dc.subjectComputer-mediated Communication
dc.subjectChat
dc.subjectStatus Updates
dc.subjectComment
dc.subjectSocial Networking Sites
dc.subjectMultilingualism
dc.subjectDialect
dc.subjectSouth Tyrol
dc.subjectInstant Messaging
dc.subjectCMC
dc.titleDIDI - The DiDi Corpus of South Tyrolean CMC 1.0.0
dc.typecorpus
local.brandingCMC & WaC
local.contact.personCorpus Manager clarin@eurac.edu Eurac Research CLARIN Centre (ERCC)
local.demo.urihttps://commul.eurac.edu/annis/didi
local.files.count6
local.files.size70312349
local.has.filesyes
local.hasCMDIfalse
local.hiddenfalse
local.language.nameGerman
local.language.nameItalian
local.language.nameEnglish
local.language.nameLadino
local.size.info23000 texts
local.size.info370000 tokens
local.sponsornationalFunds x Autonome Provinz Bozen - Südtirol, Abteilung Bildungsförderung, Universität und Forschung, Landesgesetz vom 13. Dezember 2006, Nr. 14 ''Forschung und Innovation" / Provincia autonoma di Bolzano - Alto Adige, Ripartizione Diritto allo studio, università e ricerca scientifica, Legge provinciale 13 dicembre 2006, n. 14 ''Ricerca e innovazione'' Digital Natives - Digital Immigrants. Schreiben auf Social Network Sites: Eine korpusunterstützte Sprachbeobachtung des aktuellen Sprachgebrauchs in Südtirol unter besonderer Berücksichtigung des Alters
metashare.ResourceInfo#ContentInfo.mediaTypetext
This item isAcademic Use
and licensed under:
 Files in this item
Loading files... This may take a few seconds as file previews are being generated. If the process takes too long, please contact the system administrator