7.1 Evaluation levels

In addition to formal validation and ergonomic testing (Gibbon et al., 1997), which will take place at a later stage, the CoGesT annotation system ideally needs to be evaluated on two levels:

As for inter-annotator consistency, the annotations of different annotators can be compared automatically. Due to the limited exactness of segmentation a threshold defining a certain granularity has to be specified.

The two variables involved are:

Manual annotations show a tendency of having inconsistent deviations from a systematic annotation. To evaluate intra-annotator consistency (consistency of the annotations of the same set of gestures by a single annotator), the following experiment needs to be conducted:

Only a basic inter-annotator evaluation of the CoGesT system has been carried out so far. A complete evaluation as described above is in preparation and will be published in a separate document at some later stage.

Thorsten Trippel 2003-08-12