|
|||||||||||
| PREV PACKAGE NEXT PACKAGE | FRAMES NO FRAMES | ||||||||||
| Interface Summary | |
| PruneIndexTool.PruneChecker | This interface can be used to implement additional checking on matching documents. |
| Class Summary | |
| DmozParser | Utility that converts DMOZ RDF into a flat file of URLs to be injected. |
| PruneIndexTool | This tool prunes existing Nutch indexes of unwanted content. |
| PruneIndexTool.PrintFieldsChecker | This checker's main function is just to print out selected field values from each document, just before they are deleted. |
| PruneIndexTool.StoreUrlsChecker | This checker's main function is just to store the URLs of each document to be deleted in a text file. |
|
|||||||||||
| PREV PACKAGE NEXT PACKAGE | FRAMES NO FRAMES | ||||||||||