Korean IL0 Manual
Features
on All Nodes
Nouns
Verbs and Auxiliary verbs
Adjectives and
Adverbs
Copular
Constructions
Conjunctions
Punctuation
Each node in the dependency tree
can be thought of as an attribute-value matrix, i.e., a bundle of features with
values. All values must be set for each node in the tree. This will require
checking each node before finishing the analysis. Here is a list of features:
Position (wpos). The linear position of the word in the sentence. This should not be modified or annotated, except for new empty nodes created by the annotator, which should always be given the wpos 100.
Word (lex). This is the inflected word form associated with the node. It is almost always correctly displayed already.
Part-of-Speech (POS). This is the lexical class, taken from a short list. Example: verb. Specific options:
¡Æ
V --
verbs
¡Æ
N --
common nouns
¡Æ
PN
-- proper nouns
¡Æ
Adj
-- adjectives (including indicative adjectives)
¡Æ
Adv
-- adverbs (including conjunctive adverbs)
¡Æ
Conj
– coordinating conjunctions
¡Æ
Pun
-- punctuation marks
¡Æ
Sym
-- various symbols (dollar signs and the like)
¡Æ
Uh
-- speech-specifc sounds, even if meaningful (such as
/UH HUH/)
¡Æ
Misc
-- everything else
Base form (Root). This is the base form (lexeme) of the inflected form. A first "guess" will be included, which needs to be checked and corrected.
English Translation (Trans). This is an English translation for the given word and this should be added. If there exist more than one proper translation, all they should be given in the form of "gloss:translation1/translation2/translation3".
Deep role (DRole). This is the role of the node with respect to its mother, in some deeper representation. This is a little murky. We will use strictly syntactic criteria. DRole reflects the argument patterns of the the verb if it were in its active, non-ergative form.
¡Æ
Subj
-- subject
¡Æ
Obj
-- object
¡Æ
Obj2
-- indirect object
¡Æ
Mod
-- Modifier
¡Æ
Root
-- Root of sentence
Done. This feature is only a check to make sure that the default values have been checked. Set it to "Y" when you are done with the features for one node.
Noun is (almost always) combined with Particle(we may call it noun-suffix). Particle gives grammatical function to noun such as conjunction, possession, object, subject, location, time, comparison, and so on. We do not analyze particle at this point. We have a single node for a noun+particle and set noun as a base form of the node.
Nominal
modifiers
The head of a noun phrase is the
head noun. Any adnominals (e.g. demonstrative (pro)nouns ÀÌ ¡°this¡±, Àú
¡°that¡±, ±× ¡°the¡±) and adjectives are dependents of
the noun. If there are multiple adjectives, the default structure will simply
have each adjective as a direct dependent of the noun. For example, both the
demonstrative adnominal ÀÌ "this" and the adjectival
noun-based adnominal ÀÛÀº "small" are direct dependents of
the noun µµ½Ã "city" in the noun phrase ÀÌ ÀÛÀº µµ½Ã
"this small city".
Compound
Nouns
Compound noun phrases, when
clear, can have multiple noun phrases as dependents. For example, ÇÒÀÎÇ×°ø¿ä±Ý ¡°discount air fare¡± will have ¿ä±Ý ¡°fare¡± as the head and ÇÒÀÎ ¡°discount¡± and Ç×°ø ¡°air¡±
as its direct dependents. A good test for this is to remove each noun in turn,
to see if the phrase still retains part of its original sense. Because ÇÒÀÎÇ×°ø¿ä±Ý ¡°discount air fare¡± is a air fare and a
discount fare, this analysis is the one we want.
In contrast, a phrase like Ç×°ø¿ä±ÝÇÒÀÎ ¡°air fare discount¡± should be annotated
with ÇÒÀÎ ¡°discount¡± as the head, ¿ä±Ý ¡°fare¡± as its dependent, and Ç×°ø ¡°air¡± as a dependent of ¿ä±Ý ¡°fare¡±.
In cases where it's not clear
whether or which nouns modify each other, the default compound structure will
have all modifying nouns as direct dependents on the rightmost noun.
Proper
Nouns
Proper nouns should have the
value PN for feature POS. They are treated largely like nouns, except that
compound proper nouns are not analyzed syntactically as if they were common
nouns, but rather given single nodes. (The intuition is that they are really
fixed phrases.)
Dependent
Nouns (Functional Nouns)
Dependent nouns are nouns because they work as nouns in a sentence, but they are dependent because they are required to be modified by VP, ADJP, or NP, which means they cannot be used independently. Dependent nouns sometimes add minor meanings (e.g. »Ó ¡°only¡±), but mainly they are used in functional purpose. For example, there are ºÐ, °Í, ¹Ù, µûÀ§ (dependent noun-general), Áö, ¼ö, ¸® (dependent noun-subjective), µû¸§, »Ó, ÅÍ (dependent noun- descriptive), ÁÙ (dependent noun-objective), µí, ¸¸Å (dependent noun-adverbial), and ¸í, °³, ±×·ç (dependent noun-unit).
Dependent noun phrase can be subject, object or any other part by adding an appropriate particle (noun-suffix) as other noun phrases, but it is often translated into gerund, to-infinitive, or relative clause (with ¡°what¡±) in English. For example, the dependent noun °Í "thing" (actually it's not really "thing" since it is not used independently as "thing") is used in the following.
±×´Â ¾Æ¸§´Ù¿î °ÍÀ» ÁÁ¾ÆÇÑ´Ù.
"He beautiful thing like" (He likes beautiful things)
±×¸¦ ¼³µæÇÏ´Â °ÍÀº ºÒ°¡´ÉÇÏ´Ù. "him persuade thing+eun impossible" (To persuade him is impossible)
³»°¡ ÇÒ ¼ö ÀÖ´Â °ÍÀº ÀÌ°Í »ÓÀÌ´Ù. "I do can thing this only be" (What I can do is this)
Dependent noun is head of the phrase and VP, ADVP, or NP modifying this dependent noun is a dependent of the noun.
Numeral
Nouns and Counters
Numeral nouns such as ÀÌ½Ê ¡°twenty¡±, ½Ê»ç
¡°fourteen¡± are usually followed by counters such as ¸í
¡°people¡±, ³â ¡°year¡±, ¹ø
¡°times¡±. The counter is one kind of
dependent nouns (unit) and it is head of numeral noun phrase as explained in Dependent Nouns section. So in ¾ÆÈ© ¸í ¡°nine
people¡±, ¡°people¡± is head and ¡°nine¡± modifies ¡°people¡± giving the information
on how many people are there.
Numeral noun phrases may be
preceded by adjectives such as Á¦
¡°ordinal number¡±, ¾à ¡°about¡±, ÃÑ ¡°in
total¡±. For example, in the phrase
of ¾à ÀÌ½Ê ¸í ¡°about
twenty people¡±, both ¡°about¡± and ¡°twenty¡± are independent dependents of
¡°people.¡±
Particles
(Noun-suffixes, postpositions)
Nouns are followed by particles
to have grammatical functions in a sentence. There are many kinds of particles such
as °¡ ¡°ka¡± (subjective), ¸¦ ¡°reul¡± (objective), ¿¡°Ô
¡°eghye¡±
(indirect objective), ¿¡¼
¡°eseo¡±
(location). They don¡¯t appear as
separate nodes but in one node with nouns.
(see the first part of Nouns section)
Verbs
are heads of sentences and clauses.
Choosing
a Head
The head of any complete clausal
utterance is the main verb. Incomplete utterances (NPs) should have as their head
the usual head for that type of phrase. Verb form changes by adding a suffix to
the stem word. Different suffix can make a verb into different closing forms
(declarative, interrogative, imperative, exclamatory), conjunctive forms
(coordinate, subordinate), or derivative forms (noun, adjective).
Verbs are often followed by
auxiliary verbs to add more meanings such as negative (¾Ê´Ù
¡°not¡±), continuous (ÀÖ´Ù ¡°ing¡±), in
these cases, auxiliary verbs are dependents of a main verb.
Arguments
and adjuncts
In distinguishing between
arguments and adjuncts, consistency is the most important thing. This
distinction will matter most for annotating empty categories. In addition, each
argument will be annotated with a feature encoding its grammatical role. All non-arguments
will be annotated as adjuncts, including function words.
Grammatical
relations
The role of each argument
(subject, object, indirect object) must be annotated as a feature of its node.
Empty
categories and missing constituents
Missing arguments will appear as
empty nodes. Missing adjuncts will not.
¡¤
Empty nominal nodes: big-PRO:
1.
These
are cases of empty nodes where the meaning can be derived from the syntactic
context: when a sentence contains subordinate sentences (phrases). Please see this example sentence, °¡ÀÔÀÚ°¡ ¹øÈ£ À̵¿À»
½ÅûÇÏ¸é »ç¾÷ÀÚ¸¦ º¯°æÇÒ
¼ö ÀÖ´Ù
¡°subscriber number move request provider change can¡± ( If a subscriber requests
to change a number, he can change the provider.) In this sentence, the phrase °¡ÀÔÀÚ°¡ ¹øÈ£ À̵¿À»
½ÅûÇÏ¸é ¡°If a subscriber requests to change a
number¡± is a subordinate of the phrase »ç¾÷ÀÚ¸¦
º¯°æÇÒ ¼ö ÀÖ´Ù. ¡°can change the provider¡±. The subject °¡ÀÔÀÚ°¡ ¡°subscriber¡± is a subject for both phrases, so we introduce
an empty node for the subordinate phrase and identify the node with which it is
co-referential. We then copy the
co-referential node's word and lexeme values to the empty node, but add
brackets around the value: "<°¡ÀÔÀÚ°¡>".
¡¤
Empty nominal nodes: little-pro, missing
argument in passive
1.
The
missing argument in agentless passives. For example, in ³ª¿¡°Ô Ã¥ÀÌ
ÁÖ¾îÁ³´Ù "I was given a book", we know
from syntax that there is another argument role which is not explicitly filled,
namely the ddep subject. This role is added at IL0. We cannot tell syntactically what this
is, only pragmatically.
2.
Arbitrary
empty subjects, usually in adjunct clauses. For example, in "Eating jicama is good for children's brain growth", the
subject of "eating" is not specified.
In these cases, we label both
the lexeme and the word feature of the new node "<pro>". In case of doubt ("<pro>"
or "<°¡ÀÔÀÚ°¡>"), ask yourself: can I tell
from syntax alone what this node means?
If no, <pro>". If yes, fill in the lexeme.
Questions
Questions are treated as any other sentence, with the question word taking the same position and grammatical relation as the answer to the question would take.
Imperatives
If an overt subject is not
present, include an empty noun; otherwise an imperative will have the same
analysis as a declarative sentence.
As mentioned in Choosing a Head
section, the auxiliary verbs are dependents of a main verb. For example, in ²ÉÀ» ÆÈ¾Æ
ÁÖ¼¼¿ä ¡°flowers sell –juseyo¡±, ÆÈ¾Æ
¡°sell¡± is a main verb and ÁÖ¼¼¿ä ¡°-juseyo¡± is an
auxiliary verb adding a meaning of ¡°do something for others¡±, yes, the sentence
means ¡°Buy flowers¡±. So, ÆÈ¾Æ
¡°sell¡± is head of the
sentence.
Passive
A sentence can have passive meaning by changing a suffix of a verb. Following the meaning, the causer is given a role of subject and the caused is given a role of object. For example, ±×°¡ ³ª¸¦ ¹«½ÃÇÏ´Ù ¡°he me ignore¡± (He ignores me) can be changed into passive form of ³ª´Â ±×¿¡°Ô ¹«½Ã´çÇÑ´Ù. ¡°I him-from ignore-danghanda¡± (I am ignored by him), and both sentences have a subject of ¡°he¡± and an object of ¡°I¡±.
Causative
A sentence can have causative
meaning by changing a suffix of a verb. To give a proper role for each, make a
separate node for causing suffix and let it be a head of the sentence. The dependents will be the subject and the
caused action. For example, in ³»°¡ ±×¸¦ °øºÎ½ÃŲ´Ù ¡°I
him study-sikinda¡±
(I make him study), ½ÃŲ´Ù ¡°-sikinda¡± (make) is the head of
the sentence, and ³»°¡ ¡°I¡± and ±×¸¦
°øºÎ ¡°him study¡± are dependents of the head as a subject and an object
respectively. ±×¸¦ °øºÎ
¡°him study¡± are analyzed as °øºÎ ¡°study¡± is a head of the phrase and ±×¸¦ ¡°him¡±
is a dependent as a subject of the head ¡°study¡±.
Non-finite
clauses
There is no non-finite clause in
Korean.
Relative
clauses
There is no relative clause in
Korean.
Adjectives
Adjectives modify nouns. For
example, in the phrase of ¾Æ¸§´Ù¿î
¼Ò³à ¡°beautiful girl¡±, the adjective ¾Æ¸§´Ù¿î ¡°beautiful¡± modifies ¼Ò³à ¡°girl¡± by identifying the appearance of
the girl.
Predicate Adjectives
Adjectives can be predicates in
sentences. In this case, adjectives
almost look like verbs grammatically, but they are not about action or movement
but about quality or status.
Predicate adjectives can have different forms by changing a suffix like
verbs, but they show slightly different conjugation from verbs. Predicative
adjectives are simply thought as ¡°be + adjective¡± in English. For example, in ±×³à´Â ¾Æ¸§´ä´Ù ¡°she beautiful¡± (She is beautiful), ¾Æ¸§´ä´Ù ¡°beautiful¡± is head of a sentence.
Auxiliary Adjectives
As auxiliary verbs, there are
auxiliary adjectives. They can
follow verbs or predicate adjectives, and they add meanings about status. They are dependent of main verbs or
adjectives. For example, in ÇÏ°í ½Í´Ù
¡°do want¡± (want to do), ÇÏ°í ¡°do¡± is head.
Adverbs
Adverbs modify verbs. For
example, ±×³à´Â ¿ì¾ÆÇÏ°Ô ÃãÃè´Ù
"she danced gracefully.", the adverb ¿ì¾ÆÇÏ°Ô "gracefully" modifies the verb by specifying the
manner in which the action was performed.
When a noun is used as a
predicate, copula ¡°ÀÌ´Ù¡± is added to the end of the noun. The predicate is a head of the sentence
and the subject will have a role of ¡°subj¡±. For example, in ³ª´Â ÇлýÀÌ´Ù
¡°I student-ida¡± (I am a student), ÇлýÀÌ´Ù ¡°student-ida¡± is the head.
Conjunction has its own part-of-speech (Conj).
The
coordinating conjunction is placed as a dependent of the first conjunct with
role Mod, and the second conjunct is a dependent of the conjunction with role Obj. If a comma
acts as a conjunction, it is treated as such (given part-of-speech Conj and
analyzed as in the above sentence).
If noun particle or verb suffix is used to connect two phrase or
sentences without explicit conjunction, the second conjunct is a dependent of the
head of the first conjunct.
Subordinating
conjunctions are dependents the head of the principal clause in the sentence.
Subordinates are dependents of the conjunctions.
Remove all punctuation, except meaningful punctuation.
Examples:
Quotes -- leave them (open and closed) attached to the constituent which is quoted. If the quoted passage is not a constituent, quote each pience separately.
Commas -- leave them when they are used as conjunction.
Do remove:
All non-conjunction commas.
All sentence-final punctuation.
All dashes and so on.