265x Filetype PDF File size 0.58 MB Source: aclanthology.org
GF Modern Greek Resource Grammar
Ioanna Papadopoulou
University of Gothenburg
ioannapapa78@hotmail.com
Abstract whilst each of the syntactic parts of the sentence
(subject, object, predicate) is a carrier of a certain
The paper describes the Modern Greek (MG) case, a fact that allows various word order
Grammar, implemented in Grammatical structures. In addition, the language presents a
Framework (GF) as part of the Grammatical dynamic syllable stress, whereas its position
Framework Resource Grammar Library depends and alternates according to the
(RGL). GF is a special-purpose language for morphological variations. Moreover, MG is one
multilingual grammar applications. The RGL of the two Indo-European languages1 that retain a
is a reusable library for dealing with the productive synthetic passive formation. In order
morphology and syntax of a growing number to realize passivization, verbs use a second set of
of natural languages. It is based on the use of morphological features for each tense.
an abstract syntax, which is common for all
languages, and different concrete syntaxes 2 Grammatical Framework
implemented in GF. Both GF itself and the
RGL are open-source. RGL currently covers GF (Ranta, 2011) is a special purpose
more than 30 languages. MG is the 35th programming language for developing
language that is available in the RGL. For the
purpose of the implementation, a morphology- multilingual applications. It can be used for
driven approach was used, meaning a bottom- building translation systems, multilingual web
up method, starting from the formation of gadgets, natural language interfaces, dialogue
words before moving to larger units systems and natural language resources. GF is
(sentences). We discuss briefly the main capable of parsing and generating texts, while
characteristics and grammatical features of working from a language-independent
MG, and present some of the major difficulties representation of meaning. The GF Grammar is
we encountered during the process of based on two different modules. An abstract
implementation and how these are handled in module provides category and function
the MG grammar. declarations, thus it constitutes a representation
1 Introduction of a set of possible trees that reflect the
semantically relevant structure of a language,
Greek is a member of the Indo-European family and one or more concrete modules that contain
of languages and constitutes by itself a separate linearization type definitions and rules, therefore
branch of that family. Modern Greek (MG) can managing to relate the tree structures with linear
be easily traced back to Ancient Greek in the tree representations. The RGL contains the set of
form of letters, word roots and structures, despite grammars of natural languages that are
the fact that the language has undergone a series implemented in GF. The parallelism of the
of transformations through the ages and has been grammars is inevitable, given that their
a subject of considerable simplification. MG development is based on the same rules and
makes use of the Greek alphabet since the 8th functions that are defined in a common abstract
century B.C. Today the language is spoken by syntax. At the moment RGL covers 34
approximately 13.1 million people worldwide. languages2 that originate not only from the
Some of the general characteristics of MG refer European continent, but from all over the world.
to the diversity of the morphology and the use of The common API defines around 60 hierarchical
an extremely large number of morphological
features in order to express grammatical 1 The other one being Albanian
notations. Words are in their majority declinable, 2 http://www.grammaticalframework.org/lib/doc/status.html
126
Proceedings of the Student Research Workshop associated with RANLP 2013, pages 126–133,
Hissar, Bulgaria, 9-11 September 2013.
grammatical categories, and a large number of all the specific features which may affect their
syntactic functions. MG constitutes the newest formation. The creation of nouns includes 17
addition to the RGL and its implementation distinct functions that are categorized depending
consists of 28 concrete modules. on the noun ending, the stress movement,
3 Morphology whether the noun is parisyllabic or
imparisyllabic, or whether the noun augments its
Morphology constitutes the most important syllables when inflected. These functions also
aspect of the Greek Language. The words are in handle specific phenomena of the MG language,
their majority declinable, produced via a such as the change of gender of a noun in the
combination of meta-linguistic elements, such as plural form, or nouns that originate from Ancient
a stem and an ending. The endings are assigned Greek, and are still used nowadays, retaining
proportionally with the part of speech and the intact their form and endings. Similarly 6
type, and act as carriers of grammatical functions create adjectives, where we also
notations, indicating the gender, the number, the introduce the degree parameter that creates
case or the person, or in the case of verbs the additional forms for all three adjective genders.
tense, the mood, the voice and the aspect as well. The formation of the pronouns is of special
Appendix A presents the parameter types and interest, as MG makes use of two distinct types,
the emphatic and the weak. The weak form3
operations that are defined in the grammar. The occurs more often, whilst the use is always in
implementation of the GF MG morphology close connection with verbs, nouns or adverbs.
started from scratch. All declinable words Our grammar introduces both forms of the
needed to undergo a first simplistic pronoun, but it also alternates between them
categorization in order to create basic declension when the syntactic structure requires the use of a
tables, before moving to sub-categorizations that particular form. Greek proper nouns follow all
allowed us to treat the various irregularities that the declension patterns and irregularities of
govern the morphological structure of MG. One common nouns morphology, meaning that they
of the main aspects of MG is the presence of a are primarily inflected for gender, case and
dynamic syllable stress, a phenomenon that number. Moreover, they present a major
created additional difficulties in the differentiation comparing to other languages,
implementation of the morphology. A stress can which refers to the introduction of the proper
move from a stem to an ending but in many cases noun with a definite article that takes its form
the movement is realized inside the stem. Such according to the grammatical features of the
issues are handled in GF with the introduction of modified proper noun. The morphology of the
pattern matching functions and pattern macros. verb in MG consists of a complex inflection
The MG grammar includes 25 pattern matching system, as shown in Appendix B. Whilst in many
functions and macros that indentify stressed languages, the grammatical notations are
vowels, while at the same time they perform over expressed with the use primarily of syntax, MG
a string, checking the matches, transforming the uses the combination of a stem and an
stressed vowels into their unstressed form, and inflectional ending to express grammatical
assigning the stress to the correct character. They categories such as person, number, tense, voice,
also serve to assigning the appropriate case aspect and mood. The fact that MG retains a
ending or handle irregularities, such as the productive synthetic passive formation increases
addition of extra consonants and reduplication drastically the number of possible forms of the
cases. verb, as most verbs have a second set of
3.1 Declinable Parts of Speech morphological forms for each tense in order to
All nouns, proper nouns, adjectives, determiners, express passivization. Whilst Greek verbs are
quantifiers, pronouns, participles, articles and divided in two main categories, the second one
verbs in MG are declinable and each category subdivided into two smaller ones, (Holton et al
presents its own characteristics and irregularities. ,1999; Iordanidou, 1999), the creation of the verb
The implementation of the above categories morphology in our grammar imposed the
follows a similar pattern: we first divide them consideration of a number of specific parameters,
into the main conjugations that grammars among them the stress movement, the number of
propose and then we make an exhaustive list of syllables which affects on the creation of the
all the rules that specify their creation, as well as
3 Clitic pronoun
127
imperative forms, the active stem forms upon oper
which we create the passive stems, the formation Noun : Type = {s : Number => Case => Str ; g :
of the passive perfective participle, reduplication Gender} ;
patterns, internal augmentation phenomena. In
addition to the above, we needed to handle mkNoun_agori : (s: Str) -> Gender -> Noun =
various irregularities, which referred mainly to \agOri, g ->
the formation of the imperative or dependent let
forms, the passivization or not of the verb, the agori = mkStemNouns agOri;
occurrence of a participle, the formation of the in {
s = table { Sg => table {
active or passive simple past with the use of Nom | Acc | Vocative|CPrep P_se |CPrep PNul =>
ending forms borrowed from Ancient Greek. All agOri ;
the above parameters resulted in the creation of Gen |CPrep P_Dat=> mkGenSg agori} ;
26 main functions that handle the changes in the Pl => table {
inflected endings of the verbs, and 39 smaller Nom | Acc | Vocative|CPrep P_se |CPrep PNul =>
functions that are connected to the main mkNomPl agOri;
functions and help us handle the modifications Gen |CPrep P_Dat=> mkGen agOri}} ; g = g } ;
that the stem is subjected to, when conjugated.
Moreover, we must emphasize on the necessity mkStemNouns : Str -> Str = \s -> case s of {
to create a series of pattern matching functions c + v@(#stressedVowel) + x@(_ + _) =>c + unstress
that form and alter stems, for the production of v + x } ;
the passive perfective according to the active mkGenSg : Str -> Str = \s ->
perfective or imperfective, the passive imperative case s of
and the participles. A separate concrete module {x + "ος" => x + "οσς"; .............
was created in order to deal exclusively with the x + ("ι" | "σ") => x + "ιού"; };
complex MG verb morphology. Finally, as in the
case of personal pronouns, another alternation mkGen : Str -> Str = \s -> case s of {
appears in the formation of the possessive c + "άι" => c + "αγιών" ; .............
pronouns. Weak and emphatic forms of the c + v@(#stressedVowel) + x@(_ + _) + ("ι" | "σ")
possessive pronoun are both used in order to =>c + unstress v + x + "ιών" ; ............. } ;
express possession. The first one being the
stressedVowel : pattern Str = #("ά" | "ό" | "ί"| "έ" |
enclitic genitive form of the personal pronoun, "ή" | "ύ"| "ώ" | "εύ");
while the latter one, expressed via a combination
of the definite article, the adjective δικός dikós stress : Str -> Str = \x -> case x of {
“own” and the enclitic genitive form of the "α" => "ά" ;
personal pronoun. Both forms are assigned via "ο" => "ό" ; ........ };
two different functions, defined in the abstract MG Paradigms : Paradigms module
syntax: mkN = overload {
PossPron : Pron -> Quant ; mkN : (dentro : Str) -> N
PossNP : CN -> NP -> CN ; = \n -> lin N (regN n) ;
Table 1 presents an example of the main mkN : (s : Str) -> Gender -> N
procedure, based on which we created the noun = \n,g -> lin N (mkN1 n g) ;..................};
morphology and it is also representative of the
process that was followed in order to handle the mkN1 : Str -> Gender -> N = \x,g ->
morphology of the main declinable parts of case x of {................
c + ("ι"|"σ"|"όι"|"άι") => mkNoun_agori x g ;
speech. The example concerns the creation of ................ } ** {lock_N = <>} ;
nouns of neuter gender, ending in –ι, such as the
noun αγόρι agóri “boy”. Lexicon :abstract fun boy_N : N ;
Lexicon MG lin boy_N = mkN "αγόρι"
Common abstract grammar : categories Neut;
Cat N ; Lexicon English lin boy_N = mkN masculine
MG Resource grammar : Resource module (regN "boy") ;
Param Parsing into the abstract categories
Number = Sg | Pl ; Lang> parse –cat=N –lang=Gre “αγοριών”
Case = Nom | Gen | Acc | Vocative |CPrep Prepos; boy_N
Gender = Masc | Fem | Neut | Change;
Lang> parse –cat=N –lang=Eng “boys’”
128
boy_N Future and denoting whether the action
Generating the full inflectional paradigms expressed by the verb is viewed either as
Lang> linearize -lang=Gre -table boy_N occurring repeatedly, as a completed event, or as
s Sg Nom : αγόρι an event completed in the past, whose
s Sg Gen : αγοριού completion is relevant to some other point in
s Sg Acc : αγόρι time. Noun phrases (NP) represent the subject of
s Sg Vocative : αγόρι the sentence and thus, they appear in the
s Sg (CPrep P_se) : αγόρι nominative case, while agreement rules pass the
s Sg (CPrep PNul) : αγόρι grammatical features of the NP to the form of the
s Sg (CPrep P_Dat) : αγοριού verb. For the creation of the predication rule in
s Pl Nom : αγόρια our grammar, which forms a clause, we needed
s Pl Gen : αγοριών
s Pl Acc : αγόρια to take into consideration the presence of subject
s Pl Vocative : αγόρια NPs that present a negative connotation (i.e.
s Pl (CPrep P_se) : αγόρια κανένας kanénas “nobody”) and impose the use
s Pl (CPrep PNul) : αγόρια of a negative polarity in the clause. Accordingly,
s Pl (CPrep P_Dat) : αγοριών we are making a distinction between the different
Lang> linearize -lang=Eng -table boy_N moods, in order to assign the relevant particles
s Sg Nom : boy that introduce the clause and which also vary
s Sg Gen : boy's depending on the polarity. Interrogative
s Pl Nom : boys sentences do not defer from declarative
s Pl Gen : boys' sentences, in the sense that they use the exact
Table 1: The Noun Morphology same rules applied in declarations, while they are
4 Syntax simply characterized by the addition of the
The GF abstract syntax provides rules for all the question mark (;). Wh–questions are introduced
common phrase structures: noun phrases with an interrogative word which may be
(constructed of pronouns, proper nouns or indeclinable τι ti “what” or declinable for gender,
common nouns and their modifiers), adjectival number and case: ποιός-ποιά-ποιό poiós-poiá-
and verb phrases with their complements. The poió “who”. The selection of the appropriate
MG grammar covers all the above structures and gender of the interrogative word in our grammar
successfully correlates the language with the is a subject of interest. Whilst in most cases the
various languages included in the RGL. Due to masculine gender is used as an abstract gender
the fact that MG is a highly inflected language when introducing wh-questions, in particular
and given that the various morphological features contexts, when the gender of the subject under
express grammatical notations, the word order in question is known, the interrogative word should
a phrase is relatively free. Although all six be labeled by the gender of the known subject,
logical permutations of the major clausal without that implying that the use of the
constituents are usually considered masculine gender in such cases in considered
4 semantically incorrect. Relative clauses on the
grammatically correct (Tzanidaki, 1995), SVO other hand, present a more complex syntactic
remains the predominant word order. The structure and a number of possible alternations,
implemented rules in our grammar cover mainly as they are introduced by two main types of
the most common word order, unless the relative markers: the indeclinable ποσ pou “that,
syntactic mechanisms of the phrase itself require which” or the declinable relative pronoun ο
otherwise. οποίος o opoíos “which”. The MG grammar
4.1 Clauses provides both forms and utilizes the two different
relative markers, as the form alternates when its
The formation of the clause relies on a number of syntactic function in the relative clause requires a
parameters, namely the order, the tense, the genitive, or when it appears in a prepositional or
polarity and the mood. In main indicative clauses adverbial phrase. The antecedent of a relative
the tense defines the point of time of the verb in sentence might appear in the form not only of a
relation to the time of speaking. MG has 8 noun phrase but also of a sentence, as in the
tenses that are divided in three major categories: phrase “She sleeps, which is good”. When the
those that refer to the Present, the Past and the antecedent is sentential, the relative clause can be
introduced either with ποσ pou “that” or with the
4 Subject-Verb-Object relative pronoun ο οποίος o opoíos “which”,
129
no reviews yet
Please Login to review.