next up previous
Next: Computer aided lexicon processing Up: Abidjan Course on Hypertext Previous: Computational Lexicography

Computer aided corpus processing

Three phases of computer aided corpus processing:

  1. corpus specification:

    scenario design and selection

  2. corpus acqusition:

    collection, text mining, speech recording

  3. corpus processing:

    1. verification
    2. corpus lexicon extraction
    3. markup (POS; treebanks)
    4. statistical analysis
    5. machine learning


Dafydd Gibbon, Sat Oct 17 18:58:17 CEST 1998