|
Instead of what is usually understood as lemma, the tectogrammatical representation introduces t-lemma. This is, especially in the English part, still mostly identical with the base form of a word, but some parts of speech are already rendered by a string introduced by #. This applies e.g. to personal pronouns and negation particles. The original word is normally present on the lower layers, but in the tectogrammatical tree it is encoded by the given t-lemma and a combination of grammatemes (a set of cognitive and grammatical categories - for more detail see Section Grammatemes). For instance, the pronoun he would get the t-lemma #PersPron and grammatemes for definiteness, gender and number. A few t-lemmas with # (we will call them t-lemma substitutes) do not represent any node present on the lower layers, but they are only present on the tectogrammatical layer. Nodes that do not correspond to any surface nodes are called generated nodes. They either get a t-lemma substitute, or they are copies of nodes located elsewhere in the text. All generated nodes have the attribute value is_generated="1".
|