|
||||||||||
PREV PACKAGE NEXT PACKAGE | FRAMES NO FRAMES |
See:
Description
Interface Summary | |
---|---|
IArticleFilter | Interface for a filter which processes all articles from a given wikipedia XML dump file |
Class Summary | |
---|---|
InfoBox | A class abstracting a Wiki infobox. |
PrintArticle | Example filter which prints the rendered HTML articles to System.out The number of processed articles is limited by a maximum counter |
Siteinfo | The site and namespace information found in the header of a Mediawiki dump |
WikiArticle | Represents a single wiki page from a Mediawiki dump. |
WikiPatternMatcher | Can be used as a stand alone class to pattern match parts of the wiki formatted text. |
WikiXMLParser | A Wikipedia XML dump file parser Original version with permission from Marco Schmidt. |
Process an XML Wikipedia dump with the WikiXMLParser class
|
||||||||||
PREV PACKAGE NEXT PACKAGE | FRAMES NO FRAMES |