Modifier and Type | Method and Description |
---|---|
IImportHandler[] |
ImporterConfig.getPostParseHandlers() |
IImportHandler[] |
ImporterConfig.getPreParseHandlers() |
Modifier and Type | Method and Description |
---|---|
void |
ImporterConfig.setPostParseHandlers(IImportHandler... handlers) |
void |
ImporterConfig.setPreParseHandlers(IImportHandler... handlers) |
Modifier and Type | Interface and Description |
---|---|
interface |
IDocumentFilter
Filters documents.
|
Modifier and Type | Class and Description |
---|---|
class |
EmptyMetadataFilter
Accepts or rejects a document based on whether specified metadata properties
are empty or not.
|
class |
RegexMetadataFilter
Accepts or rejects a document based on its property values using
regular expression.
|
Modifier and Type | Interface and Description |
---|---|
interface |
IDocumentTagger
Tags a document with extra metadata information, or manipulate existing
metadata information.
|
Modifier and Type | Class and Description |
---|---|
class |
AbstractCharStreamTagger
Base class for taggers dealing with the body of text documents only.
|
class |
AbstractStringTagger
Base class to facilitate creating taggers based on text content, loading
text into
StringBuilder for memory processing, also giving more
options (like fancy regex). |
Modifier and Type | Class and Description |
---|---|
class |
ConstantTagger
Define and add constant values to documents.
|
class |
CopyTagger
Copies metadata fields.
|
class |
DeleteTagger
Delete the metadata fields provided.
|
class |
ForceSingleValueTagger
Forces a metadata field to be single-value.
|
class |
HierarchyTagger
Given a separator, split a field string into multiple segments
representing each node of a hierarchical branch.
|
class |
KeepOnlyTagger
Keep only the metadata fields provided, delete all other ones.
|
class |
RenameTagger
Rename metadata fields to different names.
|
class |
ReplaceTagger
Replaces an existing metadata value with another one.
|
class |
SplitTagger
Splits an existing metadata value into multiple values based on a given
value separator.
|
class |
TextBetweenTagger
Extracts and add values found between a matching start and
end strings to a document metadata field.
|
Modifier and Type | Interface and Description |
---|---|
interface |
IDocumentTransformer
Transformers allow to manipulate and convert extracted text and
save the modified text back.
|
Modifier and Type | Class and Description |
---|---|
class |
AbstractCharStreamTransformer
Base class for transformers dealing with text documents only.
|
class |
AbstractRestrictiveTransformer
Deprecated.
Since 1.1.0. Use
AbstractRestrictiveHandler |
class |
AbstractStringTransformer
Base class to facilitate creating transformers on text content, loading
text into
StringBuilder for memory processing, also giving more
options (like fancy regex). |
Modifier and Type | Class and Description |
---|---|
class |
ReduceConsecutivesTransformer
Reduces specified consecutive characters or strings to only one
instance (document content only).
|
class |
ReplaceTransformer
Replaces every occurrences of the given replacements
(document content only).
|
class |
StripAfterTransformer
Strips any content found after first match found for given pattern.
|
class |
StripBeforeTransformer
Strips any content found before first match found for given pattern.
|
class |
StripBetweenTransformer
Strips any content found between a matching start and end strings.
|
Copyright © 2009-2014 Norconex Inc.. All Rights Reserved.