|
This page last changed on Apr 25, 2007 by j-norstad@northwestern.edu.
This note describes in detail Martin Mueller's "NUPOS" part of speech tagset and makes explicit the structure of the tagset and other related morphology objects such as "spellings", "word classes", "lemmas", and "word parts".
As a convention, in this discussion, when we use the term "word", it means "a specific single occurrence of a word somewhere in a text." For the concept of a "word in general", we will use the terms "headword" and "lemma", which we'll define and discuss in detail later.
The full version of NUPOS can handle both Greek and English texts and part of speech tagging. In this note we only describe the subset of NUPOS that deals with English.
Spellings
The first and most basic attribute of a word is its spelling. This may seem to be a simple concept, but especially for earlier texts from periods before spelling became regularized, it is useful to distinguish among several different meanings of the term "spelling". In NUPOS there are three different "spellings" for each word:
1. The "token spelling". This is the spelling of the word exactly as it appears in the original digital source for the text, including all capitalization and any typographical conventions that might be used in the source as markup for various purposes. For example, the original source for a text might contain a word token "common|lie", where the encoders used the vertical bar character "|" to mark up a soft hyphen at the end of a line. As another example, in some early printed texts, a "y" with a superscript "t" was used to represent the word "that". Such a word might be marked up as "y^t" in the source for such a text. As a final example, the token "@abper;fecit" might appear in the source for an early text. In this example "&abper;" is a symbol used in early typesetting as an abbreviation for "per" or "par".
The token spelling retains as much fidelity as possible with the original digital source. It will often contain various kinds of non-uniform markup, as used by the organizations that digitally encoded the texts. It may be of interest to some researchers, but most people will be more interested in the other two kinds of spellings.
The token spelling may be of importance in contexts where an application wishes to reproduce as much visual fidelity as possible with original printed texts when displaying the text to users.
2. The "standard original spelling". This is a version of the spelling with the typographical conventions normalized, and in most contexts is probably what one thinks of when one uses the general term "the spelling of the word". It is usually identical with the token spelling, but not always. In the examples above, the three tokens become the following "standard original spellings":
3. The "standard modern spelling". This is the standard modern orthographic form of the original spelling. But the morphological form is not modernized. Thus a spelling like "lovyth" is regularized to "loveth". "loveth" is not, however, regularized to "loves", but is rather recognized as a standard archaic form. In the three examples above, the standard modern spellings are as follows:
Note that "perfecit" is a Latin word, and at no point is there an attempt made to translate foreign words into English.
For modern texts, the three spellings are nearly always identical.
Word Parts
Words have spellings, as outlined above. We also want to enumerate and discuss in detail their other tagging attributes, such as word class, part of speech, and lemma. Before we can do this, however, we need to discuss a pesky complexity of texts - contractions.
Consider as an example the first word of Hamlet, "Who's". This is a single lexical word, and in this example all three spellings of the word are the same string "Who's".
In terms of the other attributes, however, this word is properly considered to be a lexical representation of the two separate words "who" and "is". Each part has its own word class, part of speech and lemma. In this particular example, it might also be possible to think of each part as having its own spelling or "sub-spelling", "who" and "'s", but in the general case it might be difficult to reasonably split up a spelling into its pieces, and the current version of NUPOS does not attempt to do this.
In Martin's NUPOS, this word "who's" is tagged as follows:
| word part |
major word class |
word class |
part of speech |
lemma |
| 1 |
wh-word |
crq |
q-crq |
who (crq) |
| 2 |
verb |
va |
vaz |
be (va) |
While we might wish that this complexity didn't exist or could be safely ignored, it can be important when analyzing texts. For example, consider the set of all words in Shakespeare which are instances of the auxiliary verb "be". In NUPOS, the first word of Hamlet is correctly included as a member of this set. It is also a member of the set of all words in Shakespeare which are instances of the wh-word "who".
As another example, consider the general notion of counting different kinds of words in Shakespeare. In NUPOS, the count of the total number of occurrences of the auxiliary verb "be" includes the first word of Hamlet, as it should, as does the count of the total number of occurrences of the wh-word "who". The first word of Hamlet is counted twice, once as "be" and once as "who". Note that as a consequence, the sum of the counts of the number of different kinds of words in Hamlet is equal to the number of word parts in Hamlet, not the number of words.
As a final example, consider an analysis of bigrams in Shakespeare. In NUPOS, the first word of Hamlet is considered to be an instance of the bigram "the lemma who (crq) followed by the lemma be (va)", as well as an instance of the bigram "word class crq followed by part of speech vaz".
In the general case, each word, while it usually only has one part, might have more than one part - two parts in the case of most contractions, but at least conceivably perhaps even more than two parts. While it is words which possess spelling attributes, it is their parts which possess the other morphological attributes, and this is an important distinction to keep in mind.
In the normal case, when a word has only one part, we often use the simple term "word" to refer to its unique part. For example, we say "this word is a verb", when to be precise what we are really saying is "the one and only part of this word is a verb."
Word Classes
In NUPOS, each word part has a "major word class" and a "word class". These concepts provide the coarsest ways to categorize words.
There are 16 major word classes, which should be self-explanatory:
adjective
adv/conj/pcl/prep
adverb
conjunction
determiner
foreign word
interjection
negative
noun
numeral
preposition
pronoun
punctuation
symbol
verb
wh-word
Major word classes are subdivided into a slightly finer categorization by "word class". There are 32 word classes in NUPOS:
| Name |
Description |
Major word class |
| j |
adjective |
adjective |
| jn |
adjective/noun |
adjective |
| jp |
proper adjective |
adjective |
| acp |
adverb/conjunction/particle/preposition |
adv/conj/pcl/prep |
| av |
adverb |
adverb |
| cc |
coordinating conjunction |
conjunction |
| cs |
subordinating conjunction |
conjunction |
| da |
adverb/determiner |
determiner |
| dt |
article |
determiner |
| d |
determiner |
determiner |
| fo |
foreign |
foreign word |
| fr |
French |
foreign word |
| gr |
greek |
foreign word |
| la |
Latin |
foreign word |
| uh |
interjection |
interjection |
| xx |
negative |
negative |
| an |
adverb/noun |
noun |
| n |
noun |
noun |
| np |
proper noun |
noun |
| nu |
numeral |
numeral |
| pp |
preposition |
preposition |
| pf |
preposition "of" |
preposition |
| pi |
indefinite pronoun |
pronoun |
| pn |
personal pronoun |
pronoun |
| px |
reflexive pronoun |
pronoun |
| pu |
punctuation |
punctuation |
| ch |
character |
symbol |
| sy |
symbol |
symbol |
| va |
auxiliary verb |
verb |
| vm |
modal verb |
verb |
| v |
verb |
verb |
| crq |
wh-word |
wh-word |
Note that each word class has a very short string which provides a name for the word class, and that each word class belongs to one and only one of the major word classes.
For example, for the major word class "verb", there are three word classes "va" (auxiliary verb), "vm" (modal verb), and "v" (verb). So in NUPOS, there are three kinds of verbs.
Parts of Speech
NUPOS has a fine-grained part of speech tagset, much finer-grained than the word classes and major word classes. There are 185 total English parts of speech in the current version of NUPOS.
Each part of speech belongs to one and only one word class, so the part of speech tagset in NUPOS represents a subdivision of the word class tagset, in the same way that the word class tagset represents a subdivision of the major word class tagset.
To continue the example of verbs, in NUPOS each of the verb word classes contains a number of parts of speech:
Each part of speech, in addition to belonging to a word class, is also characterized by, and largely defined by, how it is used in various grammatical categories. These categories and their possible values should be mostly self-explanatory to those familiar with English grammar.
As an example, the NUPOS part of speech "vmd2" is used for modal verbs used in the second person singular past tense. It has the following attributes in addition to its name "vmd2":
An example of this part of speech occurs in Act 5, Scene 1 of Hamlet, where Gertrude says "I hoped thou shouldst have been my Hamlet's wife;" In this passage, the word "shouldst" is tagged with the lemma "shall (vm)" and the part of speech "vmd2". By virtue of this tagging, we know all of the following facts about this word:
In a full implementation of NUPOS, any of these attributes can be used as a criterion for searching, grouping, sorting, counting, and analysis. For example, a researcher might compare the use of past tense modal verbs by one author to their use by another author, or he might do a search where he finds all uses of second person singular verbs in the works of Chaucer. Or he might find all of the verbs used in Spenser and generate a report which counts up how many times each of them are used in the various possible combinations of person and number.
The "syntax" attribute is used to specify how the part of speech is used. For example, the part of speech "av-j" is used for adjectives that are used as adverbs. The "syntax" attribute of this part of speech is "av". An example of this part of speech occurs in Act 1, Scene 1 of Hamlet, where Bernardo says "Long live the king!" The word "Long" in this passage in used as an adverb modifying the verb "live" and has the NUPOS part of speech "av-j". Contrast this with the word "long" in Act 3, Scene 1, where Hamlet says "That makes calamity of so long life;". In this passage, the word "long" is tagged with the part of speech "j", the part of speech for "normal" uses of adjectives. Both of the parts of speech "av-j" and "j" have the word class "j" and major word class "adjective", but "av-j" has the syntax attribute "av", while "j" has the syntax attribute "j".
Martin has also mentioned the possibility of more coarse-grained versions of NUPOS, finer grained than word classes but coarser than the full set of 185 parts of speech. These intermediate levels of NUPOS may be useful for data mining and other kinds of analysis. We have not yet worked out the details of this idea.
All of the NUPOS parts of speech are displayed in a table at the end of this note.
Lemmas
A lemma is a dictionary "headword" plus its word class.
For example, consider the verb "love" in Shakespeare. This lemma has the headword "love" and the word class "v". He uses this common lemma in 41 of his 42 works, a total of 1,135 times, in a variety of contexts with quite a few different parts of speech and spellings. For example, he uses it a total of 153 times with the part of speech "vvz", which is the NUPOS part of speech tag for verbs used in the third person singular in the present tense. 150 of these uses are spelled "loves", and three of them are spelled "loveth".
There is, of course, also a noun named "love". In NUPOS, there are two separate lemmas for the headword "love", one for the noun and one for the verb. In general, headwords like "love" are used to form NUPOS lemmas based on their word class, and the word class is listed along with the headword when naming the lemma. In our example, the NUPOS names for the two "love" lemmas are "love (n)" and "love (v)".
The set of all lemmas used in a work or collection of works is called the "lexicon" for the work or collection.
MorphAdorner
Pib's MorphAdorner reads source XML texts, locates sentence and word boundaries, and marks each word with five morphological tags - the three spellings, the NUPOS part of speech, and the lemma headword. For contractions, MorphAdorner emits multiple parts of speech and headwords.
It's important to note that MorphAdorner is more than just a part of speech tagger. It's also a spelling normalizer and a lemma tagger.
This tagging data emitted by MorphAdorner is sufficient to recover all of the information mentioned above for each word and word part, including the major word class, word class, part of speech category values, and lemma (headword plus major word class).
Summary
NUPOS comprises the following objects, attributes, and relationships:
Each word has three spellings: the token, standard original, and standard modern spellings.
Each word has an ordered list of word parts, usually only one except for contractions.
Each word part has a part of speech and a lemma.
Each part of speech has a name, a word class, and values for the grammatical categories of syntax, tense, mood, case, person, number, degree, and negative.
Each lemma has a name, a headword and a word class. The name of each lemma is formed from its headword and the name of its word class.
Each word class has a name and a major word class.
Each major word class has a name.
In a full implementation of NUPOS, all of these objects and their attributes can be used as criteria for searching, grouping, sorting, counting, and analysis.
The following drawing is sometimes useful as a way of summarizing NUPOS. It's not a formal UML diagram, and the drawing has no particular implementation implications, other than as a way of summarizing some of the functionality that any particular full implementation of NUPOS must support. It's just an informal way of making a picture out of the objects, attributes, and relationships enumerated above and described and defined in detail in this note. The double-headed arrow is used to indicate the relationship "may have more than one of", while the single-headed arrow indicates "has one and only one of". The term "list of" in the one-to-many relationship between words and their parts indicates that the parts of a word are ordered - there's a first one, then a second one, and so on. This is important for dealing with n-grams.

Parts of Speech Table
| Name |
Word Class |
Syntax |
Tense |
Mood |
Case |
Person |
Number |
Degree |
Negative |
| . |
pu |
pu |
|
|
|
|
|
|
|
| ; |
pu |
pu |
|
|
|
|
|
|
|
| ( |
|
pu |
|
|
|
|
|
|
|
| ) |
|
pu |
|
|
|
|
|
|
|
| , |
|
pu |
|
|
|
|
|
|
|
| a-acp |
acp |
av |
|
|
|
|
|
|
|
| av |
av |
av |
|
|
|
|
|
|
|
| av-an |
an |
av |
|
|
|
|
|
|
|
| av-c |
av |
av |
|
|
|
|
|
comp |
|
| av-da |
da |
av |
|
|
|
|
|
|
|
| av-j |
j |
av |
|
|
|
|
|
|
|
| av-jc |
j |
av |
|
|
|
|
|
comp |
|
| av-jn |
jn |
av |
|
|
|
|
|
|
|
| av-js |
j |
av |
|
|
|
|
|
sup |
|
| av-n1 |
n |
av |
|
|
|
|
sg |
|
|
| av-s |
av |
av |
|
|
|
|
|
sup |
|
| av-vvg |
v |
av |
pres |
ppl |
|
|
|
|
|
| av-vvn |
v |
av |
past |
ppl |
|
|
|
|
|
| avc-jn |
jn |
av |
|
|
|
|
|
comp |
|
| avs-jn |
jn |
av |
|
|
|
|
|
sup |
|
| avx-da |
da |
av |
|
|
|
|
|
|
no |
| c-acp |
acp |
cs |
|
|
|
|
|
|
|
| c-crq |
crq |
cs |
|
|
|
|
|
|
|
| cc |
cc |
cc |
|
|
|
|
|
|
|
| cc-acp |
acp |
cc |
|
|
|
|
|
|
|
| ccx |
cc |
cc |
|
|
|
|
|
|
nor |
| crd |
nu |
nu |
|
|
|
|
|
|
|
| cs |
cs |
cs |
|
|
|
|
|
|
|
| cst |
cs |
cs |
|
|
|
|
|
|
|
| d |
d |
d |
|
|
|
|
|
|
|
| d-da |
da |
d |
|
|
|
|
|
|
|
| dg |
da |
d |
|
|
gen |
|
|
|
|
| dg-da |
da |
d |
|
|
gen |
|
|
|
|
| dt |
dt |
dt |
|
|
|
|
|
|
|
| dx-da |
da |
d |
|
|
|
|
|
|
no |
| fw-fr |
fr |
fw |
|
|
|
|
|
|
|
| fw-gr |
gr |
fw |
|
|
|
|
|
|
|
| fw-it |
|
fw |
|
|
|
|
|
|
|
| fw-la |
la |
fw |
|
|
|
|
|
|
|
| fw-mi |
fo |
fw |
|
|
|
|
|
|
|
| j |
j |
j |
|
|
|
|
|
|
|
| j-av |
av |
j |
|
|
|
|
|
|
|
| j-jn |
jn |
j |
|
|
|
|
|
|
|
| j-vvg |
v |
j |
pres |
ppl |
|
|
|
|
|
| j-vvn |
v |
j |
past |
ppl |
|
|
|
|
|
| jc |
j |
j |
|
|
|
|
|
comp |
|
| jc-jn |
jn |
j |
|
|
|
|
|
comp |
|
| jc-vvg |
v |
j |
pres |
ppl |
|
|
|
comp |
|
| jc-vvn |
v |
j |
past |
ppl |
|
|
|
comp |
|
| jp |
jp |
jp |
|
|
|
|
|
|
|
| js |
j |
j |
|
|
|
|
|
sup |
|
| js-jn |
jn |
j |
|
|
|
|
|
sup |
|
| js-vvg |
v |
j |
pres |
ppl |
|
|
|
sup |
|
| js-vvn |
v |
j |
past |
ppl |
|
|
|
sup |
|
| lq |
pu |
pu |
|
|
|
|
|
|
|
| n-jn |
jn |
n |
|
|
|
|
|
|
|
| n-vag |
va |
n |
pres |
ppl |
|
|
|
|
|
| n-vvg |
v |
n |
pres |
ppl |
|
|
|
|
|
| n-vvn |
v |
n |
past |
ppl |
|
|
|
|
|
| n1 |
n |
n |
|
|
|
|
sg |
|
|
| n1-an |
an |
n |
|
|
|
|
sg |
|
|
| n1-j |
n |
n |
|
|
|
|
sg |
|
|
| n2 |
n |
n |
|
|
|
|
pl |
|
|
| n2-an |
an |
n |
|
|
|
|
pl |
|
|
| n2-da-x |
n |
n |
|
|
|
|
|
|
no |
| n2-j |
j |
n |
|
|
|
|
pl |
|
|
| n2-jn |
jn |
n |
|
|
|
|
pl |
|
|
| n2-vag |
va |
n |
pres |
ppl |
|
|
pl |
|
|
| n2-vvg |
v |
n |
pres |
ppl |
|
|
pl |
|
|
| ng1 |
n |
n |
|
|
gen |
|
sg |
|
|
| ng1-an |
an |
n |
|
|
gen |
|
sg |
|
|
| ng1-jn |
jn |
n |
|
|
gen |
|
sg |
|
|
| ng2 |
n |
n |
|
|
gen |
|
pl |
|
|
| ng2-jn |
jn |
n |
|
|
gen |
|
pl |
|
|
| njp |
jp |
np |
|
|
|
|
|
|
|
| njp2 |
jp |
np |
|
|
|
|
pl |
|
|
| njpg1 |
jp |
np |
|
|
gen |
|
sg |
|
|
| njpg2 |
jp |
np |
|
|
gen |
|
pl |
|
|
| np-n1 |
n |
np |
|
|
|
|
sg |
|
|
| np-n2 |
n |
np |
|
|
|
|
pl |
|
|
| np-ng1 |
n |
np |
|
|
gen |
|
sg |
|
|
| np1 |
np |
np |
|
|
|
|
sg |
|
|
| np2 |
np |
np |
|
|
|
|
pl |
|
|
| npg1 |
np |
np |
|
|
gen |
|
sg |
|
|
| npg2 |
np |
np |
|
|
gen |
|
pl |
|
|
| ord |
nu |
nu |
|
|
|
|
|
|
|
| p-acp |
acp |
pp |
|
|
|
|
|
|
|
| pc-acp |
acp |
pc |
|
|
|
|
|
|
|
| pi |
pi |
p |
|
|
|
|
|
|
|
| pi2 |
pi |
p |
|
|
|
|
pl |
|
|
| pig |
pi |
p |
|
|
gen |
|
|
|
|
| pix |
pi |
p |
|
|
|
|
|
|
|
| pn22 |
pn |
p |
|
|
|
2nd |
pl |
|
|
| pn31 |
pn |
p |
|
|
|
3rd |
sg |
|
|
| png11 |
pn |
p |
|
|
gen |
1st |
sg |
|
|
| png12 |
pn |
p |
|
|
gen |
1st |
pl |
|
|
| png21 |
pn |
p |
|
|
gen |
2nd |
sg |
|
|
| png22 |
pn |
p |
|
|
gen |
2nd |
pl |
|
|
| png31 |
pn |
p |
|
|
gen |
3rd |
sg |
|
|
| png32 |
pn |
p |
|
|
gen |
3rd |
pl |
|
|
| pno11 |
pn |
p |
|
|
obj |
1st |
sg |
|
|
| pno12 |
pn |
p |
|
|
obj |
1st |
pl |
|
|
| pno21 |
pn |
p |
|
|
obj |
2nd |
sg |
|
|
| pno31 |
pn |
p |
|
|
obj |
3rd |
sg |
|
|
| pno32 |
pn |
p |
|
|
obj |
3rd |
pl |
|
|
| pns11 |
pn |
p |
|
|
subj |
1st |
sg |
|
|
| pns12 |
pn |
p |
|
|
subj |
1st |
pl |
|
|
| pns21 |
pn |
p |
|
|
subj |
2nd |
sg |
|
|
| pns31 |
pn |
p |
|
|
subj |
3rd |
sg |
|
|
| pns32 |
pn |
p |
|
|
subj |
3rd |
pl |
|
|
| po11 |
pn |
p |
|
|
|
1st |
sg |
|
|
| po12 |
pn |
p |
|
|
|
1st |
pl |
|
|
| po21 |
pn |
p |
|
|
|
2nd |
sg |
|
|
| po22 |
pn |
p |
|
|
|
2nd |
pl |
|
|
| po31 |
pn |
p |
|
|
|
3rd |
sg |
|
|
| po32 |
pn |
p |
|
|
|
3rd |
pl |
|
|
| pp |
pp |
pp |
|
|
|
|
|
|
|
| pp-f |
pf |
pp |
|
|
|
|
|
|
|
| px11 |
px |
p |
|
|
|
1st |
sg |
|
|
| px12 |
px |
p |
|
|
|
1st |
pl |
|
|
| px21 |
px |
p |
|
|
|
2nd |
sg |
|
|
| px22 |
px |
p |
|
|
|
2nd |
pl |
|
|
| px31 |
px |
p |
|
|
|
3rd |
sg |
|
|
| px32 |
px |
p |
|
|
|
3rd |
pl |
|
|
| pxg21 |
px |
p |
|
|
gen |
2nd |
sg |
|
|
| q-crq |
crq |
q |
|
|
|
|
|
|
|
| r-crq |
crq |
r |
|
|
|
|
|
|
|
| rq |
pu |
pu |
|
|
|
|
|
|
|
| sy-sy |
sy |
sy |
|
|
|
|
|
|
|
| uh |
uh |
uh |
|
|
|
|
|
|
|
| uh-av |
|
uh |
|
|
|
|
|
|
|
| uh-crq |
crq |
uh |
|
|
|
|
|
|
|
| uh-jn |
jn |
uh |
|
|
|
|
|
|
|
| uh-n |
n |
uh |
|
|
|
|
sg |
|
|
| uh-v |
v |
uh |
|
inf |
|
|
|
|
|
| uh-x |
da |
uh |
|
|
|
|
|
|
no |
| va2 |
va |
va |
pres |
|
|
2nd |
sg |
|
|
| va2-imp |
va |
va |
pres |
impt |
|
2nd |
pl |
|
|
| va2x |
va |
va |
pres |
|
|
2nd |
sg |
|
not |
| vab |
va |
va |
pres |
|
|
|
|
|
|
| vad |
va |
va |
past |
|
|
|
|
|
|
| vad2 |
va |
va |
past |
|
|
2nd |
sg |
|
|
| vad31 |
va |
va |
|
|
|
3rd |
sg |
|
|
| vadp |
va |
va |
past |
|
|
|
pl |
|
|
| vadx |
va |
va |
past |
|
|
|
|
|
not |
| vag |
va |
va |
pres |
ppl |
|
|
|
|
|
| vai |
va |
va |
|
inf |
|
|
|
|
|
| vam |
va |
va |
pres |
|
|
1st |
sg |
|
|
| vamx |
va |
va |
pres |
|
|
1st |
sg |
|
not |
| van |
va |
va |
past |
ppl |
|
|
|
|
|
| vap |
va |
va |
pres |
|
|
|
pl |
|
|
| vaz |
va |
va |
pres |
|
|
3rd |
sg |
|
|
| vazx |
va |
va |
pres |
|
|
3rd |
sg |
|
not |
| vm2 |
vm |
vm |
pres |
|
|
2nd |
sg |
|
|
| vm2x |
vm |
vm |
pres |
|
|
2nd |
sg |
|
not |
| vmb |
vm |
vm |
pres |
|
|
|
|
|
|
| vmb1 |
vm |
vm |
pres |
|
|
1st |
sg |
|
|
| vmbx |
vm |
vm |
pres |
|
|
|
|
|
not |
| vmd |
vm |
vm |
past |
|
|
|
|
|
|
| vmd2 |
vm |
vm |
past |
|
|
2nd |
sg |
|
|
| vmd2x |
vm |
vm |
past |
|
|
2nd |
sg |
|
not |
| vmd31 |
vm |
vm |
past |
|
|
3rd |
sg |
|
|
| vmdp |
vm |
vm |
past |
|
|
|
pl |
|
|
| vmdx |
vm |
vm |
past |
|
|
|
|
|
not |
| vmi |
vm |
vm |
|
inf |
|
|
|
|
|
| vmn |
vm |
vm |
past |
ppl |
|
|
|
|
|
| vmp |
vm |
vm |
pres |
|
|
|
pl |
|
|
| vv2 |
v |
v |
pres |
|
|
2nd |
sg |
|
|
| vv2-imp |
v |
v |
pres |
impt |
|
2nd |
pl |
|
|
| vv2x |
v |
v |
pres |
|
|
2nd |
sg |
|
not |
| vvb |
v |
v |
pres |
|
|
|
|
|
|
| vvbx |
v |
v |
pres |
|
|
|
|
|
not |
| vvd |
v |
v |
past |
|
|
|
|
|
|
| vvd2 |
v |
v |
past |
|
|
2nd |
sg |
|
|
| vvd2x |
v |
v |
past |
|
|
2nd |
sg |
|
not |
| vvdp |
v |
v |
past |
|
|
|
pl |
|
|
| vvdx |
v |
v |
past |
|
|
|
|
|
not |
| vvg |
v |
v |
pres |
ppl |
|
|
|
|
|
| vvi |
v |
v |
|
inf |
|
|
|
|
|
| vvn |
v |
v |
past |
ppl |
|
|
|
|
|
| vvp |
v |
v |
pres |
|
|
|
pl |
|
|
| vvz |
v |
v |
pres |
|
|
3rd |
sg |
|
|
| xx |
xx |
xx |
|
|
|
|
|
|
not |
| z-sy |
ch |
sy |
|
|
|
|
|
|
|
| zz |
|
zz |
|
|
|
|
|
|
|
In this table, a few of the parts of speech have missing word classes. This is a bug that Martin will fix.
|