Pregroup grammar

Pregroup grammar (PG) is a grammar formalism intimately related to categorial grammars. Much like categorial grammar (CG), PG is a kind of type logical grammar. Unlike CG, however, PG does not have a distinguished function type. Rather, PG uses inverse types combined with its monoidal operation.

Definition of a pregroup

A pregroup is a partially ordered algebra (A, 1, \cdot, -^l, -^r, \leq) such that (A, 1, \cdot) is a monoid, satisfying the following relations:

The contraction and expansion relations are sometimes called Ajdukiewicz laws.

From this, it can be proven that the following equations hold:

x^l and x^r are called the left and right adjoints of x, respectively.

The symbol \cdot and \leq are also written \otimes and \to respectively. In category theory, pregroups are also known as autonomous categories[1] or (non-symmetric) compact closed categories.[2] More typically, x \cdot y will just be represented by adjacency, i.e. as xy.

Definition of a pregroup grammar

A pregroup grammar consists of a lexicon of words (and possibly morphemes) L, a set of atomic types T which freely generates a pregroup, and an relation : that relates words to types. In simple pregroup grammars, typing is a function that maps words to only one type each.

Examples

Some simple, intuitive examples using English as the language to model demonstrate the core principles behind pregroups and their use in linguistic domains.

Let L = {John, Mary, the, dog, cat, met, barked, at}, let T = {N, S, N0}, and let the following typing relation holds:

\text{John} : N \qquad \text{Mary} : N \qquad \text{the} : N \cdot N_0^l \qquad \text{dog} : N_0 \qquad cat : N_0
met : N^r \cdot S \cdot N^l \qquad barked : N^r \cdot S \qquad at : S^r \cdot N^{rr} \cdot N^r \cdot S \cdot N^l

A sentence S that has type T is said to be grammatical if T \leq S. We can prove this by use of a chain of \leq. For example, we can prove that John\ met\ Mary : N \cdot N^r \cdot S \cdot N^l \cdot N is grammatical by proving that N \cdot N^r \cdot S \cdot N^l \cdot N \leq S:

N \cdot N^r \cdot S \cdot N^l \cdot N ~\leq~ S \cdot N^l \cdot N ~\leq~ S

by first using contraction on N \cdot N^r and then again on N^l \cdot N. A more convenient notation exists, however, that indicates contractions by connecting them with a drawn link between the contracting types (provided that the links are nested, i.e. don't cross). Words are also typically placed above their types to make the proof more intuitive. The same proof in this notation is simply

A more complex example proves that the dog barked at the cat is grammatical:

Historical notes

Pregroup grammars have been introduced by Joachim Lambek in 1993 as a development of his syntactic calculus, replacing the quotients by adjoints.[3] Such adjoints had already been used earlier by Harris but without iterated adjoints and expansion rules. Adding such adjoints was interesting to handle more complex linguistic cases, where the fact that a^{ll} \neq a is needed. It was also motivated by a more algebraic viewpoint: the definition of a pregroup is a weakening of that of a group, introducing a distinction between the left and right inverses and replacing the equality by an order. This weakening was needed because using types from a free group would not work: an adjective would get the type N \cdot N^{-1} = 1, hence it could be inserted at any position in the sentence.[4]

Pregroup grammars have then been defined and studied for various languages (or fragments of them) including English,[5] Italian,[6] French,[7] Persian[8] and Sanskrit.[9] Languages with a relatively free word order such as Sanskrit required to introduce commutation relations to the pregroup, using precyclicity.

Semantics of pregroup grammars

Because of the lack of function types in PG, the usual method of giving a semantics via the λ-calculus or via function denotations is not available in any obvious way. Instead, two different methods exist, one purely formal method that corresponds to the λ-calculus, and one denotational method analogous to (a fragment of) the tensor mathematics of quantum mechanics.

Purely formal semantics

The purely formal semantics for PG consists of a logical language defined according to the following rules:

Some examples of terms are f(x), g(a,h(x,y)), g(x,b) \cdot [x]. A variable x is free in a term t if [x] does not appear in t, and a term with no free variables is a closed term. Terms can be typed with pregroup types in the obvious manner.

The usual conventions regarding α conversion apply.

For a given language, we give an assignment I that maps typed words to typed closed terms in a way that respects the pregroup structure of the types. For the English fragment given above we might therefore have the following assignment (with the obvious, implicit set of atomic terms and function symbols):

I(John : N) = j : E
I(Mary : N) = m : E
I(the : N \cdot N_0^l) = \iota(p) \cdot [p] : E \cdot E_0^l
I(dog : N_0) = dog : E_0
I(cat : N_0) = cat : E_0
I(met : N^r \cdot S \cdot N^l) = [x] \cdot met(x,y) \cdot [y] : E^r \cdot T \cdot E^l
I(barked : N^r \cdot S) = [x] \cdot barked(x) : E^r \cdot T
I(at : S^r \cdot N^{rr} \cdot N^r \cdot S \cdot N^l) = [x] \cdot y \cdot [y] \cdot at(x,z) \cdot [z] : T^r \cdot E^{rr} \cdot E^r \cdot T \cdot E^l

where E is the type of entities in the domain, and T is the type of truth values.

Together with this core definition of the semantics of PG, we also have a reduction rules that are employed in parallel with the type reductions. Placing the syntactic types at the top and semantics below, we have

For example, applying this to the types and semantics for the sentence John\ met\ Mary : N \cdot (N^r \cdot S \cdot N^l) \cdot N (emphasizing the link being reduced)

For the sentence the\ dog\ barked\ at\ the\ cat : (N \cdot N_0^l) \cdot N_0 \cdot (N^r \cdot S) \cdot (S^r \cdot N^{rr} \cdot N^r \cdot S \cdot N^l) \cdot (N \cdot N_0^l) \cdot N_0:

See also

References

  1. Selinger, Peter (2011). "A survey of graphical languages for monoidal categories". New Structures for Physics. Lecture Notes in Physics. Springer. pp. 289–233.
  2. Preller, Anne; Mehrnoosh Sadrzadeh (2011). "Semantic Vector Models and Functional Models for Pregroup Grammars". Journal of Logic, Language and Information 20 (4): 419–443.
  3. Lambek, Joachim (1999). "Type Grammar revisited". In Alain Lecomte (ed.). Logical Aspects of Computational Linguistics. LNAI. Heidelberg: Springer. pp. 1–27.
  4. Lambek, Joachim (2008). "Pregroup Grammars and Chomsky’s Earliest Examples". Journal of Logic, Language and Information 17 (2): 141–160.
  5. Lambek 2008
  6. Casadio, Claudia; Joachim Lambek (2001). "An algebraic analysis of clitic pronouns in Italian". Logical Aspects of Computational Linguistics. Springer. pp. 110–124. ISBN 3540422730.
  7. Preller, Anne; Violaine Prince; et al. (2008). "Pregroup grammars with linear parsing of the French verb phrase". CL2008: 53–84.
  8. Sadrzadeh, Mehrnoosh (2008). "Pregroup analysis of Persian sentences". Computational algebraic approaches to natural language, Polimetrica, Milano, Italy: 121–144.
  9. Casadio, Claudia; Mehrnoosh Sadrzadeh (2014). "Word Order Alternation in Sanskrit via Precyclicity in Pregroup Grammars". In Franck van Breugel, Elham Kashefi, Catuscia Palamidessi, Jan Rutten (eds.). Horizons of the Mind. A Tribute to Prakash Panangaden. Lecture Notes in Computer Science 8464. Springer International Publishing. pp. 229–249. ISBN 978-3-319-06879-4.
This article is issued from Wikipedia - version of the Thursday, July 16, 2015. The text is available under the Creative Commons Attribution/Share Alike but additional terms may apply for the media files.