| Hamburg Map Task Corpus | ![]() |
| Full name | Hamburg Map Task Corpus |
|---|---|
| Type | spoken/audio/exmaralda |
| Project | Z2 "Computer Assisted Methods for the creation and analysis of multilingual data" |
| Data owner | Hamburger Zentrum für Sprachkorpora (corpora@uni-hamburg.de) |
| Short description | Audio recordings of map tasks with adult L2 users of German. The speakers' L1 and their L2 proficiencies vary. The maps used for the tasks are available. |
| Keywords | adult L2 acquisition, learner corpus, task-oriented communication, successive bilingualism, L2 data, adult bilingualism, simultaneous bilingualism, map task, |
| Language(s) | German, Standard (deu), |
| Reference | Hedeland, Hanna & Schmidt, Thomas (2012):
Technological and methodological challenges in creating, annotating and sharing a
learner corpus of spoken German. Submitted to: Schmidt, Thomas & Wörner, Kai
(eds.): Multilingual Corpora and Multilingual Corpus Analysis. Hamburg Studies in
Multilingualism (14). Amsterdam: John Benjamins. |
| Transcription | orthographic transcription according to HIAT (simplified HIAT) |
| Annotations | pos [=Part of Speech (TreeTagger, STTS tagset)], pos-sup [=Superordinate part of Speech (manual, STTS tagset)], pho [=Phonetic annotation], c [=Indicates that the automatic pos-annotation is incorrect], lemma [=Lemma], disfluency [=Disfluency], |
| Size | 26 speakers 24 communications 24 transcriptions 24 recordings (total duration: 3:17:48.79 hours) 21433 transcribed words |
| Access | Password protected access. Password will be given upon demand. Please request a password by email to corpora@uni-hamburg.de. |
| Version(s) | 0.1 [2010-09-16] - First version, transcriptions of all recordings, no annotations yet 0.2 [2011-09-30] - Disfluency annotation, POS tagging, lemmatization |
| Corpus homepage: | http://vs.corpora.uni-hamburg.de/corpora/z2-hamatac/public/index.html |