AMI Corpus Pen, whiteboard and slide data
Details about the collection, pre-processing, and file formatting of auxiliary data, i.e. output from handwritten notes, whiteboard, and PowerPoint slides
Collection
To capture auxiliary data, a separate capture PC is used, which reads
the MIDI Time Code generated by the MTP-AV to accurately time-stamp the
data and ensure it is synchronized with the audio and video streams.
Handwritten notes
Each participant has access to a Logitech I/O digital pen throughout the
scenario. The pen stores the time stamped x-y co-ordinates of any pen
strokes made on special paper that contains an embedded 2D bar code. The
pen strokes are then downloaded to the auxiliary capture PC as xml
files for subsequent processing. The pens are synchronized to the
auxiliary capture PC at the beginning and end of each meeting. Since
they are not connected to the synchronization equipment during the
meeting, precise calibration cannot be guaranteed. In practice, the
pens' internal clocks do not drift by more than a few seconds during
each meeting, resulting in sufficiently calibrated data.
Whiteboard
An eBeam System 2 digital whiteboard is used to capture any pen strokes
the participants make. These are stored in XML format as time stamped
x-y coordinates of the pen.
Beamer
Any slides presented on the beamer are captured via a VisionRGB-Pro VGA
capture card, installed in the auxiliary capture PC and stored as JPEG
images. Each image is time-stamped using the MTC for accurate
integration with other data streams.
Pre-processing
The XML files generated by the E-Beam whiteboard capture system and the
Logitech IO pens are converted into JPEG images showing what the
participants wrote. A DIV-X movie of these files, synchronized with the
audio and video recordings is also produced. This shows when the strokes
were made, and whether it was done on the whiteboard or in the
participant's notebook. A transcription of written data is also
generated using an optical character recognition technique based on Liwicki and Bunke (2005). Optical character recognition based on the technique described in Chen et al. (2004)
is used to produce transcriptions of the captured slides. This, along
with the captured JPEG images and the HTML pages to which they are
attached forms the pen and whiteboard data component of the database.
File Formats
For an explanation of AMI meeting IDs, see AMI Corpus Meeting IDs Explained.
Pen data
When available, data from participants' handwritten notes are stored alongside the meeting's audio and video files in the /pens
subdirectory. For each page of written notes, there should be three
files with AVI, JPEG, and PEN extensions. File IDs take the form [meetingID].pen[1-4]-Page[0-9][0-9][0-9][0-9][0-9][0-9][0-9].200[0-9].
The four numbers after ‘Page’ correspond to the notebook page number.
This is followed by an underscore and the date the meeting took place,
e.g. ‘03.10.2005’. AVI files are video clips that show a re-drawing of
all pen stroke sequences for a given page (although not in the same time
as the original event). Still images showing a page of notes are stored
in JPEG files. Raw pen data may be found in XML files with the PEN
extension.
Whiteboard data
Whiteboard data can be found in a meeting's /whiteboard subdirectory, and also consist of a set of AVI, JPEG, and XML files. File IDs are very straightforward, taking the form [meetingID].strokes.
As described above for pen data, AVI files encode short video clips in
which pen stroke sequences are redrawn. Still images of whiteboard
content are stored in JPEG files, and raw whiteboard data are in XML
files.
Slide data
The "slides" directory contains JPEG files that are screenshots of the
automatically captured projection. A software was integrated into the
system to automatically detect the slide changes. Associated plain text
files are the automatic optical character recognition outputs. A set of
HTML files allows to quickly browse the slides on a web browser; two XML
files are for use in JFerret browser.