Text_mining Text_mining

Text mining - Definition and Overview

Related Words: Assembly, Building, Casting, Composition, Construction, Conversion, Creation, Cultivation, Depression, Drawing, Dredging, Drilling, Engraving

Text mining, also known as intelligent text analysis, text data mining or knowledge-discovery in text (KDT), refers generally to the process of extracting interesting and non-trivial information and knowledge from unstructured text. Text mining is a young interdisciplinary field which draws on information retrieval, data mining, machine learning, statistics and computational linguistics. As most information (over 80%) is stored as text, text mining is believed to have a high commercial potential value.

One application of text mining is in bioinformatics, where details of experimental results can be automatically extracted from a large corpus of text and then processed computationally. For example it has been quoted that a support vector machine (SVM) with appropriate training can extract details of protein-protein interaction from the literature with greater than 90 percent accuracy.

Some bioinformaticians have termed the body of literature the textome, which derives its name from the same naming convention which gave us the genome, however this term is far from universal.

One of the largest text mining applications that exist is probably the classified ECHELON surveillance system.

External links

  • Kmining (http://www.kmining.com/info_conferences.html) List of text mining, data mining and KDD scientific conferences
  • KDNuggets (http://www.kdnuggets.com/) Data Mining, Web Mining, and Knowledge Discovery Guide

See also

Example Usage of mining

anne0208: #1 International best-selling author of mining Online Gold with an Offline Shovel will speak at http://internetcitadel.com/blog/jc/anne0208
shaywren: Nearly 100,000 people weigh in on protection of Grand Canyon from new uranium mining http://ow.ly/ARwn
tamonten: How many times do I have to say it? It's not stalking, it's data mining. K?
Copyright 2009 WordIQ.com - Privacy Policy  :: Terms of Use  :: Contact Us  :: About Us
This article is licensed under the GNU Free Documentation License. It uses material from the this Wikipedia article.