next up previous
Next: 15.01.2002: Texttechnologie: Textannotation mit Up: Einführung in die Computerlinguistik Previous: A Computational Corpus Linguistics

10.01.2002: Gruppenarbeit & Berichte

Tasks:

Check the web for

  1. corpus linguistics oriented sites - hints: Birmingham and the COBUILD dictionaries; Lancaster and the CLAWS tagger;
  2. Pennsylvania (LDC; Penn Treebank)
  3. Sussex (Suzanne)
  4. List the available corpora for English: BNC, ...
  5. List the available corpora for German: ...
  6. Find basic information about Perl, the language most frequently used nowadays in Corpus Linguistics.
  7. What is a ``tagger''? What is a ``tagset''?
  8. What is ``markup''? What is ``annotation''? What is an ``annotation graph''? (check Pennsylvania again).


Dafydd Gibbon, Wed Feb 12 10:50:41 MET 2003 Automatically generated, links may change - update every session.