D. Information and Further Reading

make sure this links to the right version and change update info below

Updated printable documentation: March 07

Download v.0.2

Notes on version 0.2.  In October 2006, we decided to move over NXT documentation from being completely web-based to being written in DocBook so that we can generate HTML, JavaDoc, and PDF at will. We are rewriting much of the documentation at the same time. Versions of the documentation numbered before v1.0 are incomplete, although the outline gives some idea of our intentions for it. In this version, version 0.2, not all of the information has been checked for accuracy yet. The most likely difficulties concern the following areas: corpus resources, ontologies, and object sets; validation; incomplete description of data set concepts. In addition, not all the formatting works, and the query reference manual has not been fully converted over to DocBook, so it is incomplete and hard to read in this version.

NXT's history and funding

The NITE XML Toolkit is software that arose out of a European Commission-funded collaboration between the University of Edinburgh's The Language Technology Group, the University of Stuttgart's Institut för Maschinelle Sprachverarbeitung (IMS), and the Deutsches Forschungszentrum för Könstliche Intelligenz (DFKI). Although the NITE project itself finished in 2003, the software is now being maintained and further developed via Sourceforge; the University of Twente has been a particularly active contributor. NXT is in use on a number of large distributed projects including JASTand TALK. NXT is in use on a wide range of corpora, representing everything from Biblical text structure to the relationship between deictic expressions and gestures in multimodal referring expressions. Its users range from individual PhD students up to large multi-site projects, many of whom contribute to development in some way. The AMI consortium is its biggest user and also the largest current contributor to its development. Other past and current funders are The Engineering and Physical Sciences Research Council (UK) , The Economic and Social Research Council (UK), and Scottish Enterprise, via The Edinburgh-Stanford Link.

Bibliography

Papers about the NITE XML Toolkit or development concerns

CarlettaJ.EvertS.HeidU.KilgourJ. (in press) The NITE XML Toolkit: data model and query. Language Resources and Evaluation Journal

CarlettaJ.EvertS.HeidU.KilgourJ.RobertsonJ.VoormannH. (2003) The NITE XML Toolkit: flexible annotation for multi-modal language data. Behavior Research Methods, Instruments, and Computers, special issue on Measuring Behavior, 35(3), 353-363.

MayoN.KilgourJ.CarlettaJ. (2006) Towards an alternative implementation of NXT's query language via XQuery. EACL Workshop on Multi-dimensional Markup in Natural Language Processing, Trento, Italy, April 4th.

ReidsmaD.JovanovicH.HofsD. (2005) Designing annotation tools based on properties of annotation problems. Measuring Behavior 2005 , 5th International Conference on Methods and Techniques in Behavioral Research, 30 August - 2 September 2005, Wageningen, The Netherlands.

Research papers that mention NXT in use (more than in passing)

BlaylockN.SwainB.AllenJ. (2009) <title>TESLA: A Tool for Annotating Geospatial Language Corpora</title>. In Proceedings of the North American Chapter of the Association for Computational Linguistics - Human Language Technologies (NAACL HLT) 2009, Toronto, Canada, May 2009.

CalhounS.NissimM.SteedmanM.BrenierJ. (2005) A framework for annotating information structure in discourse. In Frontiers in Corpus Annotation II: Pie in the Sky, ACL2005 Conference Workshop, Ann Arbor, Michigan, June 2005.

CarlettaJ.C.KilgourJ. (2005) The NITE XML Toolkit Meets the ICSI Meeting Corpus: Import, Annotation, and Browsing.. In MLMI'04: Proceedings of the Workshop on Machine Learning for Multimodal Interaction., SamyBengio and HerveBourlard eds. Springer-Verlag Lecture Notes in Computer Science Volume 3361. ISBN: 3-540-24509-X. This is an updated version of a workshop paper.

CarlettaJ.DingareS.NissimM.NikitinaT. (2004) Using the NITE XML Toolkit on the Switchboard Corpus to study syntactic choice: a case study. In Fourth Language Resources and Evaluation Conference, Lisbon, Portugal, May.

GutU.MildeJ-T.VoormannH.HeidU. (2004) Querying annotated speech corpora. In Speech Prosody (International Conference), Nara, Japan, March 23-26, ed. by BernardBel and IsabelleMarlien, ISCA, 569-572.

HeidU.VoormannH.MildeJ-T.GutU.ErkK.PadoS. (2004) Querying both time-aligned and hierarchical corpora with NXT Search. In Fourth Language Resources and Evaluation Conference, Lisbon, Portugal, May.

IsardA.BrockmannC.OberlanderJ. (2005) Re-Creating Dialogues from a Corpus. In Proceedings of the Corpus Linguistics 2005 Workshop on Using Corpora for Natural Language Generation, July 2005 Birmingham, U.K.

Pre-NITE paper motivating the concept

CarlettaJ.McKelvieD.IsardA.MengelA.KleinM. (2005) A generic approach to software support for linguistic annotation using XML. G.Sampson D.McCarthy Corpus Linguistics: Readings in a Widening Discipline Continuum InternationalLondon and NY ISBN: 082648803X

Paper about the NITE project in general

SoriaC.BernsenN. O.CadeeN.CarlettaJ.DybkjaerL.EvertS.HeidU.IsardA.KolodnytskyM.LauerC.LeziusW.NoldusL.PirrelliV.ReithingerN. (2002) Advanced tools for the study of natural interactivity Third International Conference on Language Resources and Evaluation (LREC 2002) Las PalmasSpain May