Next: 15.01.2002: Texttechnologie: Textannotation mit
Up: Einführung in die Computerlinguistik
Previous: A Computational Corpus Linguistics
Tasks:
Check the web for
- corpus linguistics oriented sites - hints:
Birmingham and the COBUILD dictionaries;
Lancaster and the CLAWS tagger;
- Pennsylvania (LDC; Penn Treebank)
- Sussex (Suzanne)
- List the available corpora for English: BNC, ...
- List the available corpora for German: ...
- Find basic information about Perl, the language most frequently used nowadays in Corpus Linguistics.
- What is a ``tagger''? What is a ``tagset''?
- What is ``markup''? What is ``annotation''? What is an ``annotation graph''? (check Pennsylvania again).
Dafydd Gibbon, Wed Feb 12 10:50:41 MET 2003 Automatically generated, links may change - update every session.