Issues concerning the collection and pre-processing of audio, video, and auxiliary data, and resolution status
As
indicated in the table below, some of the recordings are affected by
signal discontinuities. In most cases, this was due to a sudden failure
by the audio recording equipment. For the purpose of generating
transcriptions and annotated data for these files, the signals were
concatenated with zero padding (performed on 48kHz files) in
correspondence with the length of the discontinuity. Frame dropping
also occurred with some of the video signals collected in Edinburgh and
IDIAP when encoding them from DV tape to DivX. These have undergone
additional processing and are now synchronized with the audio signals.
Other data collection issues relate to seat swapping by meeting participants. In a few cases, this has resulted in the mislabeling of participant IDs and thumbnails. A very small number of meetings also feature an inappropriate positioning of headset microphones. In one meeting, a participant removes his headset altogether. These and other data collection and pre-processing issues are presented in the following table, along with details of whether and, where relevant, how the issue was resolved.
MEETING ID | PROBLEM | RESOLVED (Y/N) | NOTES |
* all meetings | Logitech I/O digital pen output not synchronized with rest of data | N | pens' internal clocks do not drift by more than a few seconds during each meeting, providing sufficiently accurate calibration |
E and I meetings | some frame drops in video signals when encoding from DV tape to DivX | Y | video now synchronized with audio signals |
E* meetings | RM RealAudio gain level is low and probably needs amplifying | N | |
JPEG captured slides are numerous | N | slide change detection mechanism probably too sensitive | |
shared files in other/ directories copied every four meetings | |||
pen stroke files recorded at end of each trial instead of after every meeting (scenario meetings only) | N | ||
ES2002a,b,c | participant 1 not wearing headset mic properly | N/A | lapel mic files okay for this participant |
ES2002b | audio dropout; ME (cam3) and UI (cam2) switch places at 00:11:00; ID (cam 1) takes seat off camera (at projector) for remainder of meeting at 00:22:37 | Y | single concatenated file generated, zero-padded from 01:04:04 to 01:40:16 |
ES2002c | ME (cam2) and UI (cam3) switch places at 00:10:20; ID (cam1) and UI (cam2) switch places at 00:14:52 | N/A | closeup videos inappropriately labeled for remaining 25 mins |
ES2004d | 2 audio dropouts; different audio file sizes | Y | section 1, channel 24 padded at end by 512 samples; section 2, channel 19 padded at end by 32768 samples, channels 20-24 padded at end by 512 samples; section 3 channel 19 padded at end by 32768 samples, channel 20-24 padded at end by 512 samples --- audio dropout1 zero-padded from 21:52:03 to 22:50:19; dropout2 zero-padded from 31:02:23 to 31:34:02 |
ES2005a | start of meeting not recorded; captured material lasts around 8 minutes | N | |
ES2006b | cam2 delayed | N | |
ES2006d | different audio file sizes | Y | channels 22,23 and 24 zero-padded with 512 blank audio samples |
ES2008a | participant loses lapel mic | N/A | headset okay |
ES2008b | all video files end at about 34min45sec while audio goes till the end (37min11sec) | N/A | audio video in sync |
ES2008c | audio dropout; different audio file sizes | Y | single concatenated file generated, zero-padded from 24:14:01 to 24:27:01 |
ES2009a | encoding problem with cam4 | N | |
ES2010d | audio dropout; different audio file sizes | Y | dropout1 zero-padded from 10:56:17 to 11:05:14; second section, channel 22 zero-padded at end by 32768 samples, and channels 23 and 24 zero-padded at end by 512 samples; dropout2 zero-padded from 13:50:02 to 14:04:00 |
ES2012* | audio quality is very weak | N | |
ES2012b | audio dropout | Y | single concatenated file generated, zero-padded from 23:25:15 to 23:34:19 |
ES2012c | 2 audio dropouts | Y | single concatenated file generated, zero-padded from 12:16:14 to 13:48:10, and 16:32:23 to 16:44:15 |
ES2013b | participants move microphone array B at 00:03:53 | N | it remains mispositioned until the end of the meeting |
ES2016a | audio dropout; different audio file sizes | Y | first section, channel 23 padded at end by 32768 samples, channel 24 padded at end by 512 samples; dropout zero-padded from 22:18:18 to 22:28:17 |
EN2001a | audio dropout; 5th person off-camera | Y | audio zero-padded from 00:03:17:24-00:03:30:07; 5th person sitting next to camera 2 |
EN2001d,e | 5th person off-camera | N/A | 5th person sitting next to camera 2 |
EN2005a | audio dropout | Y | audio zero-padded from 01:09:11:24 - 01:09:25:01 |
EN2006b | 2 audio dropouts; participants not properly seated | Y/N | audio zero-padded from 00:26:21:18 - 00:26:38:10 and from 00:26:52:18 - 00:27:07:17; all participants clustered around whiteboard side of table |