next up previous
Next: References Up: Class notes on Diphone Previous: Diphone database construction

Segmentation and alignment

Many programmes have been developed over the past thirty years for the segmentation of speech signals and the alignment of symbol strings with them. Mark Liberman has provided a broad overview of work on tools of this kind (check for this on the web).

The best known systems are currently the following:

  1. Transcriber 1.2 (freeware, Edouard Geoffrois, Claude Barras, Zhibiao ..., Mark Liberman)
  2. esps/waves+ (Entropic)


Dafydd Gibbon, Mon Dec 21 10:23:16 CET 1998