Nutch Nutch

Nutch - Definition and Overview

Nutch is an effort to build an open source search engine. It uses Lucene for the search and index component. The fetcher (robot) has been written from scratch solely for this project.

Nutch has a highly modular architecture allowing developers to create plugins for the following activities: media-type parsing, data retrieval, querying and clustering.

Tim O'Reilly has a seat in Nutch's board of directors.

Doug Cutting is the lead developer.

It is completely coded in Java, but data is written in language-independent formats. In June 2003 there was a successful 100 million page demo system.

External links

Example Usage of Nutch

094459: just implemented a google search type app for my zillions of local pdf's using Regain - works well and is easier to use than Nutch imho
yuanying: オープンソースの検索エンジン。Ruby 製の Suzaku がビミョーだったので、次は Nutch を試してみよう。
dstuartnz: I am liking #lucene #tika not that impressed with the documentation or lack of it but its starting to come together with my Nutch plugin yea
Copyright 2009 WordIQ.com - Privacy Policy  :: Terms of Use  :: Contact Us  :: About Us
This article is licensed under the GNU Free Documentation License. It uses material from the this Wikipedia article.