|
Text mining - Definition and Overview |
| Related Words: Assembly, Building, Casting, Composition, Construction, Conversion, Creation, Cultivation, Depression, Drawing, Dredging, Drilling, Engraving |
|
|
|
Text mining, also known as intelligent text analysis, text data mining or knowledge-discovery in text (KDT), refers generally to the process of extracting interesting and non-trivial information and knowledge from unstructured text. Text mining is a young interdisciplinary field which draws on information retrieval, data mining, machine learning, statistics and computational linguistics. As most information (over 80%) is stored as text, text mining is believed to have a high commercial potential value.
One application of text mining is in bioinformatics, where details of experimental results can be automatically extracted from a large corpus of text and then processed computationally. For example it has been quoted that a support vector machine (SVM) with appropriate training can extract details of protein-protein interaction from the literature with greater than 90 percent accuracy.
Some bioinformaticians have termed the body of literature the textome, which derives its name from the same naming convention which gave us the genome, however this term is far from universal.
One of the largest text mining applications that exist is probably the classified ECHELON surveillance system.
External links
- Kmining (http://www.kmining.com/info_conferences.html) List of text mining, data mining and KDD scientific conferences
- KDNuggets (http://www.kdnuggets.com/) Data Mining, Web Mining, and Knowledge Discovery Guide
See also
|
|
Example Usage of mining |
 |
anne0208: #1 International best-selling author of mining Online Gold with an Offline Shovel will speak at http://internetcitadel.com/blog/jc/anne0208 |
 |
shaywren: Nearly 100,000 people weigh in on protection of Grand Canyon from new uranium mining http://ow.ly/ARwn |
 |
tamonten: How many times do I have to say it? It's not stalking, it's data mining. K? |
|