simplex annotations and complex annotations dg 20040427 internal lexicography workshop description of a speech or language corpus general term annotation term used in speech technology labelling term used in text technology markup a simplex annotation eg label is an event eg an occurrence of a word phoneme syllable feature at a specific interval in a corpus a complex annotation consists of one or more tiers of events a tier is a sequence of events of the same type allen relations calculus of intervals 13 relations event logic axiomatic relational logic =def pair of a property and an interval johan van benthem amsterdam applied to phonology in order to explicate autosegmental phonology by steven bird and ewan klein 1989 event phonology event property interval attribute value interval attribute value t_start t_end examples of annotation xwaves espswaves+ propertyt_end eg table 1030 problem the beginning is only implicit and has to be inferred by the user or added in ad hoc fashion implicit partial interval definition sam propertyt_startt_end eg table 1030 1659 corresponding to orth table 1030 1659 praat same as sam but with its own notation tasx same as sam but with xml notation how does this relate to the lexicon? lexical acquisition list of lexical items eg a wordlist problem what is a word? make list by converting the text to a list of words sorting the list of words removing duplicates extract corpus properties of the list items ie microstructure elements which can be inferred from corpus relations by frequency count absolute or relative percent rank ordering lexical representation macrostructure overall structure of dictionary mesostructure generalisations over microstructures definitions of grammar pronunciation cross references eg semantic relations references to corpus eg concordance examples microstructure types of lexical information datcats data categories eg structural properties can be extracted from corpus external context eg collocations internal structure eg derived compound words idioms interpretative properties meaning semantic pragmatic form phonetic orthographic metadata properties local housekeeping properties lexicographer source dates of creation modification note there are global metadata properties which which apply to the whole lexicon eg language corpus used publication details note macrostructure contains mesostructure contains microstructure lexical access