@mastersthesis{St-Onge1,
  author = "David St-Onge",
  title = "Detecting and correcting malapropisms with lexical chains",
  school = "Department of Computer Science, University of Toronto",
  month = "March",
  year = "1995",
  note = "Published as technical report CSRI-319",
  abstract = "<P>Because chains of semantically related words express semantic
              continuity, such lexical chains can play an important role in the
              detection of malapropisms.  A <I>malapropism</I> is a correctly
              spelled word that does not fit in the context where it is used because
              it is the result of a spelling error on a different word that was
              intended.</p>
              <P>I first assume that such a word has much less probability of being
              inserted in any chain with other words.  If this assumption is
              correct, words that failed to be inserted with other words can be
              considered as potential malapropisms.  A mechanism that generates
              spelling replacements can then be used to generate replacement
              candidates.  The second assumption is that whenever a spelling
              replacement can be inserted in a chain with other words, this
              replacement is likely to be the intended word for which a malapropism
              has been substituted.</p>
              <P>The algorithm proposed here to detect lexical chains uses the on-line
              thesaurus WordNet to automatically quantify semantic relations between
              words.  Chains identified by the algorithm may have two major
              problems: over- or under-chaining.  Under-chaining---the inability to
              link a pair of related words---might be caused by an inadequacy of
              WordNet's set of relations, a lack of connections in WordNet's set of
              relations, a lack of connections in WordNet, a lack of consistency in
              the semantic proximity expressed by WordNet's links, and a poor
              algorithm for chaining.  Over-chaining---the linking of two poorly
              related words---might happen whenever two semantically distant words
              are close to each other in WordNet's graph.  Over-chaining often
              results in the merging of two chains.</p>
              <P>The results of the experiment show the validity of the basic
              assumptions.  However, improvements to the lexical-chaining algorithm
              are required before the malapropism detection algorithm can be
              integrated into a commercial spelling checker.</p>",
  download = "ftp://ftp.cs.toronto.edu/csri-technical-reports/319/319.ps.Z",
}


