As discussed in chapter 4, annotators were intentionally given only the instruction to ``mark wherever the topic changes'' within documents. As a result, different annotators marked documents in slightly different ways: At one extreme, annotators marked only the start or end of large, clear, coherent topics. At the other extreme, some annotators made a mark every time a topic drifted in or out of immediate focus. For instance, in figure 5.13, all `disjoint' discussion is partitioned individually, such as the non-sequitur utterance ``I can go home now'', seen in figure 5.2.5. We can see here that the topic boundaries are used to constrain a topic amid a sea of noise--an assertion that the change from topic to no-topic is a topic change.
Figure 5.13:
NIST dialogue,
 |
Figure 5.14:
NIST dialogue 2,
 |
Figure 5.15:
A snippet from figure 5.13's source dialogue, showing unexpected topic-boundary marking style
 |
James Ballantine
2005-02-19