public class AbstractTikaParser extends Object implements IDocumentParser
Modifier and Type | Class and Description |
---|---|
protected class |
AbstractTikaParser.RecursiveMetadataParser |
RDF_BASE_URI, RDF_SUBJECT_CONTENT
Constructor and Description |
---|
AbstractTikaParser(Parser parser,
String format)
Creates a new Tika-based parser.
|
Modifier and Type | Method and Description |
---|---|
protected void |
addTikaMetadata(Metadata tikaMeta,
Properties metadata) |
void |
parseDocument(InputStream inputStream,
ContentType contentType,
Writer output,
Properties metadata)
Parses a document.
|
public final void parseDocument(InputStream inputStream, ContentType contentType, Writer output, Properties metadata) throws DocumentParserException
IDocumentParser
parseDocument
in interface IDocumentParser
inputStream
- the document to parsecontentType
- the content type of the documentoutput
- where to save the extracted textmetadata
- where to store the metadataDocumentParserException
protected void addTikaMetadata(Metadata tikaMeta, Properties metadata)
Copyright © 2009-2014 Norconex Inc.. All Rights Reserved.