Other approaches

TextTiling, while an important algorithm in topic segmentation as has been shown above, is not the only approach available for discerning or segmenting topics in data. Other statistical topic cohesion models exist, such as [1], [9], and [16]. Further, while not a focus of this literature review, segmentation approaches exist for other kinds of media such as audio data and multimedia ([14], [21]).

Audio topic segmentation (in the case of [14] working on broadcast news recordings) often makes use of information not available in a transcript, such as prosodic and pitch-change cues in the recorded voice signal. This is an alternative approach to topic segmentation in spoken dialogue, and while complimentary to present research it is outside its scope.

Multimedia segmentation looks for different kinds of segments entirely; in [21], Wilcox and Boreczky work on audio and video data, using Hidden Markov Models to detect camera shot changes, transitions, fades, and dissolves. While useful, this is a different kind of segmentation, analogous to using paragraph breaks as topic markers in text: it relies on the original document containing markers hoped to coincide with topic change, and does not attempt to find topic change in the content itself.



Subsections
James Ballantine 2005-02-19