AMI Corpus Participant IDs Explained

(Source http://www.idiap.ch/amicorpus/documentations/ami-corpus-participant-ids-explained)

Describing the information held
in participant IDs

Participant IDs in the AMI Meeting corpus take the form: [MF][IET][EDO][0-9][0-9][0-9]

A limited number of Participant IDS also have a further 2-3 letters at the end. Examples of participant IDs are MIO016; FEE088; MTD012ME.
First Letter: gender
Must be either M or F
Second Letter: location
Must be either I, E or T for the location in which the participant was recorded. These stand for IDIAP, Switzerland; University of Edinburgh, UK; TNO, Holland
Third Letter: Native Language
Either E, D or O for English, Dutch or Other
Numbers
Three numbers chosen to make a unique identifier
For TNO meetings the role of the participant in the scenario meeting is normally appended to the Participant ID where PM, ID, ME and UID stand for Project Manager, Industrial Designer, Marketing Expert and User Interface Designer respectively. Note that further information was collected about the participants in a questionnaire regarding language skills. That information is available when you download the NXT-format information as corpusResources/participants.xml. There is also information mapping channel and camera numbers to these participants (and their roles for scenario meetings) in meetings.xml.

AMI Corpus Meeting IDs Explained 

(Source http://www.idiap.ch/amicorpus/documentations/ami-corpus-meeting-ids-explained)

How to read AMI meeting IDs, plus a list of ids used in the corpus.

IDs take the form [IETB][SNB][1-5][0-9][0-9][0-9][a-z].

First character: I for IDIAP, E for Edinburgh, T for TNO, B for Brno

Second character: S for scenario-based using our remote control design scenario, N for naturally occurring, B for other scenario-based elicitations (e.g., using the ISSCO office move scenario or the movie club scenario).

4 numbers: 1000 series for IDIAP, 2000 for Edinburgh, 3000 for TNO, and 4000 for ISSCO (although these were recorded in the IDIAP room), 5000 for Brno.

For S (remote control scenario meetings), postfix a/b/c/d: first, second, third or fourth meeting in trial.

For N (natural meetings), the final a-z is optional and used for meetings in the same series (i.e., of the same group, which may or may not be exactly the same participants). If there is only one meeting in the series, it could be omitted.

The complete set of meeting ids used is below:

ES2002-ES2016 a-d; EN2001a-e; EN2002a-d; EN2003a; EN2004a; EN2005a; EN2006a-b; EN2009b-d; IB4001; IB4002; IB4003; IB4004; IB4005; IB4010; IB4011; IS1001-IS1009 a-d EXCEPT IS1002a, IS1005d; IN1001; IN1002; IN1005; IN1007; IN1008; IN1009; IN1012; IN1013; IN1014; IN1016; TS3003 - TS3012 a-d.


Copyright © 2006 by AMI project, All Rights Reserved.