Package | Description |
---|---|
com.norconex.importer.parser | |
com.norconex.importer.parser.impl | |
com.norconex.importer.parser.impl.wordperfect |
Modifier and Type | Method and Description |
---|---|
protected IDocumentParser |
DefaultDocumentParserFactory.getFallbackParser() |
IDocumentParser |
IDocumentParserFactory.getParser(String documentReference,
ContentType contentType)
Gets a document parser, optionally based on its reference or content
type.
|
IDocumentParser |
DefaultDocumentParserFactory.getParser(String documentReference,
ContentType contentType)
Gets a parser based on content type, regardless of document reference
(ignoring it).
|
Modifier and Type | Method and Description |
---|---|
protected void |
DefaultDocumentParserFactory.registerFallbackParser(IDocumentParser parser) |
protected void |
DefaultDocumentParserFactory.registerNamedParser(ContentType contentType,
IDocumentParser parser) |
Modifier and Type | Class and Description |
---|---|
class |
AbstractTikaParser
Base class wrapping Apache Tika parser for use by the importer.
|
class |
FallbackParser
Parser using auto-detection of document content-type to figure out
which specific parser to invoke to best parse a document.
|
class |
HTMLParser
HTML parser based on Apache Tika
HtmlParser . |
class |
PDFParser
HTML parser based on Apache Tika
PDFParser . |
Modifier and Type | Class and Description |
---|---|
class |
WordPerfectParser
Parser for WordPerfect documents.
|
Copyright © 2009-2014 Norconex Inc.. All Rights Reserved.