Hamburg Map Task Corpus

[Overview] [Previous] [Next]

Full name Hamburg Map Task Corpus
Type spoken/audio/exmaralda
Project Z2 "Computer Assisted Methods for the creation and analysis of multilingual data"
Data owner Hamburger Zentrum für Sprachkorpora (corpora@uni-hamburg.de)
Short description Audio recordings of map tasks with adult L2 users of German. The speakers' L1 and their L2 proficiencies vary. The maps used for the tasks are available.
Keywords adult L2 acquisition, learner corpus, task-oriented communication, successive bilingualism, L2 data, adult bilingualism, simultaneous bilingualism, map task,
Language(s) German, Standard (deu),
Reference Hedeland, Hanna & Schmidt, Thomas (2012): Technological and methodological challenges in creating, annotating and sharing a learner corpus of spoken German. Submitted to: Schmidt, Thomas & Wörner, Kai (eds.): Multilingual Corpora and Multilingual Corpus Analysis. Hamburg Studies in Multilingualism (14). Amsterdam: John Benjamins.
Transcription orthographic transcription according to HIAT (simplified HIAT)
Annotations pos [=Part of Speech (TreeTagger, STTS tagset)], pos-sup [=Superordinate part of Speech (manual, STTS tagset)], pho [=Phonetic annotation], c [=Indicates that the automatic pos-annotation is incorrect], lemma [=Lemma], disfluency [=Disfluency],
Size 26 speakers
24 communications
24 transcriptions
24 recordings (total duration: 3:17:48.79 hours)
21433 transcribed words
Access Password protected access. Password will be given upon demand. Please request a password by email to .
Version(s) 0.1 [2010-09-16] - First version, transcriptions of all recordings, no annotations yet
0.2 [2011-09-30] - Disfluency annotation, POS tagging, lemmatization
Corpus homepage: http://vs.corpora.uni-hamburg.de/corpora/z2-hamatac/public/index.html