Advanced search×

A Context-theoretic Framework for Compositionality in Distributional Semantics

Audio, Transactions of the IRE Professional Group on (2011)

Techniques in which words are represented as vectors have proved useful in many applications in computational linguistics, however there is currently no general semantic formalism for representing meaning in terms of vectors. We present a framework for natural language semantics in which words, phrases and sentences are all represented as vectors, based on a theoretical analysis which assumes that meaning is determined by context. In the theoretical analysis, we define a corpus model as a mathematical abstraction of a text corpus. The meaning of a string of words is assumed to be a vector representing the contexts in which it occurs in the corpus model. Based on this assumption, we can show that the vector representations of words can be considered as elements of an algebra over a field. We note that in applications of vector spaces to representing meanings of words there is an underlying lattice structure; we interpret the partial ordering of the lattice as describing entailment between meanings. We also define the context-theoretic probability of a string, and, based on this and the lattice structure, a degree of entailment between strings. We relate the framework to existing methods of composing vector-based representations of meaning, and show that our approach generalises many of these, including vector addition, component-wise multiplication, and the tensor product.

Version: za2963e q8za4 q8zb3 q8zc3 q8zd0 q8ze8 q8zf1 q8zg5

Similar articles you may find interesting…

  1. Mapping QTL for grain yield and other agronomic traits in post-rainy sorghum [Sorghum bicolor (L.) Moench].

    Theor Appl Genet (2013) PMID 23649648

    Sorghum, a cereal of economic importance ensures food and fodder security for millions of rural families in the semi-arid tropics. The objective of the present study was to identify and validate quantitative trait loci (QTL) for grain yield and other agronomic traits using replicated...
  2. Effects of Multimedia Vocabulary Instruction on Adolescents With Learning Disabilities.

    j learn disabil (2013) PMID 23649222

    The purpose of this experimental study is to investigate the effects of using content acquisition podcasts (CAPs), an example of instructional technology, to provide vocabulary instruction to adolescents with and without learning disabilities (LD). A total of 279 urban high school st...
  3. On the Dynamics of Non-Relativistic Flavor-Mixed Particles

    arXiv:1305.1306 [astro-ph.CO] 6 May 2013

    Evolution of a system of interacting non-relativistic quantum flavor-mixed particles is considered both theoretically and numerically. It was shown that collisions of mixed particles not only scatter them elastically, but can also change their mass eigenstates thus affecting particles' flavor compo...