The HCRC Map Task Corpus Annotations Version 1.0 LICENCE: The copyright holder grants to the downloader of these files unrestricted licence to use all the corpus materials (transcription, annotation, tools, documentation) included herein, subject only to the following restriction: the contribution of HCRC is acknowledged in any public presentation or publication of any work based on the corpus. Initial collection, transcription, annotation and publication supported by Economic and Social Research Council (ESRC), UK -- subsequent work supported by the ESRC and the Engineering and Physical Sciences Research Council (EPSRC), UK under various grants, the University of Edinburgh and the HCRC Language Technology Group. The HCRC Map Task Corpus Annotations Version 1.0 carries no warranty of any kind. Since HCRC continues to use the Corpus in our own research, we welcome contact with colleagues engaged in similar projects. For this reason we ask users to notify us at maptask@cogsci.ed.ac.uk as a matter of courtesy of the topic of their intended work with these materials. |
This page contains HCRC's first public release of its annotation of the Map Task Corpus. These annotations include dialogue structure at three levels (moves, games, and transactions), part of speech tags, syntax, gaze, landmark references, and when the participants were using their pens. The annotations are represented in XML using a technique called ``stand-off annotation'' (see Isard, A. (2001) "An XML architecture for the HCRC Map Task Corpus", Proceedings of Bi-Dialog 2001, June 2001, Bielefeld, Germany [PS format] [PDF format]). The annotation release includes updated transcription of the dialogues. Many of the annotations provide pointers to times in the original sound files which allow the speech material to be located easily.
Before using these annotations, we recommend that you consider whether you would prefer to work with the same annotations translated into the format for the NITE XML Toolkit, since they are very similar but come with some graphical user interfaces and a good search facility.