Document_classification Document_classification

Document classification - Definition and Overview

Related Words: Appraisal, Assessment, Categorization, Class, Evaluation, Factoring, Family, Genus, Grouping, Identification, Kingdom, Nomenclature, Onomastics, Onomatology, Order, Phylum

Document classification is a problem in information science. The task is to assign a document to one or more categories, based on its contents. Document classification tasks can be divided into two sorts: supervised document classification where some external mechanism (such as human feedback) provides information on the correct classification for documents, and unsupervised document classification, where the classification must be done entirely without reference to external information.

Document classification techniques include:

and approaches based on natural language processing.

A recent notable use of document classification techniques has been spam filtering which tries to discern E-mail spam messages from legitimate emails.

See also

External links

Copyright 2009 WordIQ.com - Privacy Policy  :: Terms of Use  :: Contact Us  :: About Us
This article is licensed under the GNU Free Documentation License. It uses material from the this Wikipedia article.