Conflation Conflation

Conflation - Definition and Overview

A stemming algorithm is a method of reducing words to their stem, base, or root form. The algorithm has been a long-standing problem in computer science; the first paper on the subject was published in 1968. The process of stemming, often called conflation, is useful in search engines, natural language processing, and other text processing problems.

For example, a stemming algorithm reduces the words "fishing", "fished", "fish", and "fisher" to the root word, "fish".

Methods

There are several types of stemming algorithms. Some techniques used are suffix stripping and lookup table replacement. In lemmatization, the part of speech is first detected prior to attempting to find the root since for some languages, the stemming rules change depending on a word's part of speech. pPor While much of the work in this area has focused on the English language (with significant use of the Porter Stemmer algorithm), other languages have been investigated including at least German, French, Italian, Spanish, Portuguese, German, Dutch, Swedish, Norwegian, Danish, Russian, Finnish, Hebrew, and Arabic. Apparently, Hebrew and Arabic are still considered difficult research languages for stemming.

Further reading

  • W. B. Frakes, Stemming algorithms, Information retrieval: data structures and algorithms, Prentice-Hall, Inc., Upper Saddle River, NJ, 1992
  • Lovins, J. B. "Development of a Stemming Algorithm." Mechanical Translation and Computational Linguistics 11, 1968, 22--31.
  • Porter, M. F. "An Algorithm for Suffix Stripping." Program 14, 1980, 130--137.

External links

Example Usage of Conflation

snarkysmachine: @dorianisms I have to remind myself Chita Rivera and Rita Moreno are not the same person. This is a childhood Conflation I struggle to shake
BrettR4763: @RonSupportsYou U should talk.You love Conflation,saying that just b/c something worked in #Iraq naturally means it'll work in #Afghanistan.
loveandgarbage: I think children were most confused on Jigsaw by the Conflation of Noseybonk http://bit.ly/7PUdI0 and Janet Ellis - http://bit.ly/4UtasP
Copyright 2009 WordIQ.com - Privacy Policy  :: Terms of Use  :: Contact Us  :: About Us
This article is licensed under the GNU Free Documentation License. It uses material from the this Wikipedia article.