next up previous contents
Next: References Up: Probabilistic finite state devices: Previous: Definitions

Methodology

Outline of development methodology:

Transcription

Normalisation of transcription (filtering)

Count of words, calculation of a priori probabilities

Efficient organisation of language model as tree structure

Classification in terms of perplexity

Test of speech recogniser with and without language model
Extensions:

Trigrams, ... (Problem: decreasing data)

Class n-grams

Morphological n-grams (e.g. stem-based)

Integration with multiple knowledge bases, including linguistic parsers



Dafydd Gibbon
Fri Nov 28 02:24:58 MET 1997