- IDocumentFilter - Interface in com.norconex.importer.filter
-
Filters documents.
- IDocumentParser - Interface in com.norconex.importer.parser
-
Implementations are responsible for parsing a document (InputStream) to
extract its text and metadata.
- IDocumentParserFactory - Interface in com.norconex.importer.parser
-
Factory providing document parsers for documents.
- IDocumentTagger - Interface in com.norconex.importer.tagger
-
Tags a document with extra metadata information, or manipulate existing
metadata information.
- IDocumentTransformer - Interface in com.norconex.importer.transformer
-
Transformers allow to manipulate and convert extracted text and
save the modified text back.
- IImportHandler - Interface in com.norconex.importer
-
Identifies a class as being an import handler.
- importDocument(InputStream, Writer, Properties) - Method in class com.norconex.importer.Importer
-
Imports a document according to the importer configuration.
- importDocument(InputStream, ContentType, Writer, Properties, String) - Method in class com.norconex.importer.Importer
-
Imports a document according to the importer configuration.
- importDocument(File, File, Properties) - Method in class com.norconex.importer.Importer
-
Imports a document according to the importer configuration.
- importDocument(File, ContentType, File, Properties, String) - Method in class com.norconex.importer.Importer
-
Imports a document according to the importer configuration.
- Importer - Class in com.norconex.importer
-
Principal class responsible for importing documents.
- Importer() - Constructor for class com.norconex.importer.Importer
-
Creates a new importer with default configuration.
- Importer(ImporterConfig) - Constructor for class com.norconex.importer.Importer
-
Creates a new importer with the given configuration.
- IMPORTER_PREFIX - Static variable in class com.norconex.importer.Importer
-
- ImporterConfig - Class in com.norconex.importer
-
Importer configuration.
- ImporterConfig() - Constructor for class com.norconex.importer.ImporterConfig
-
- ImporterConfigLoader - Class in com.norconex.importer
-
Importer configuration loader.
- ImporterException - Exception in com.norconex.importer
-
Runtime exception thrown by many of the importer classes upon encountering
issues.
- ImporterException() - Constructor for exception com.norconex.importer.ImporterException
-
- ImporterException(String) - Constructor for exception com.norconex.importer.ImporterException
-
- ImporterException(Throwable) - Constructor for exception com.norconex.importer.ImporterException
-
- ImporterException(String, Throwable) - Constructor for exception com.norconex.importer.ImporterException
-
- INT_ARRAY - Static variable in class com.norconex.importer.parser.impl.wordperfect.WordPerfectInputStream
-
- IOnMatchFilter - Interface in com.norconex.importer.filter
-
Tells the collector that a filter is of "OnMatch" type.
- isCaseSensitive() - Method in class com.norconex.importer.filter.impl.RegexMetadataFilter
-
- isCaseSensitive() - Method in class com.norconex.importer.tagger.impl.TextBetweenTagger
-
- isCaseSensitive() - Method in class com.norconex.importer.transformer.impl.ReduceConsecutivesTransformer
-
- isCaseSensitive() - Method in class com.norconex.importer.transformer.impl.ReplaceTransformer
-
- isCaseSensitive() - Method in class com.norconex.importer.transformer.impl.StripAfterTransformer
-
- isCaseSensitive() - Method in class com.norconex.importer.transformer.impl.StripBeforeTransformer
-
- isCaseSensitive() - Method in class com.norconex.importer.transformer.impl.StripBetweenTransformer
-
- isInclusive() - Method in class com.norconex.importer.tagger.impl.TextBetweenTagger
-
- isInclusive() - Method in class com.norconex.importer.transformer.impl.StripAfterTransformer
-
- isInclusive() - Method in class com.norconex.importer.transformer.impl.StripBeforeTransformer
-
- isInclusive() - Method in class com.norconex.importer.transformer.impl.StripBetweenTransformer
-
- isRegex() - Method in class com.norconex.importer.tagger.impl.ReplaceTagger.Replacement
-
- isRegex() - Method in class com.norconex.importer.tagger.impl.SplitTagger.Split
-
- saveToXML(XMLStreamWriter) - Method in class com.norconex.importer.AbstractRestrictiveHandler
-
Convenience method for subclasses to save metadata restrictions.
- saveToXML(XMLStreamWriter) - Method in class com.norconex.importer.AbstractTextRestrictiveHandler
-
Convenience method for subclasses to save content type regex.
- saveToXML(XMLStreamWriter) - Method in class com.norconex.importer.filter.AbstractOnMatchFilter
-
Convenience method for subclasses to save the "onMatch" attribute
to an XML file when XMLConfiguration
is used.
- saveToXML(Writer) - Method in class com.norconex.importer.filter.impl.EmptyMetadataFilter
-
- saveToXML(Writer) - Method in class com.norconex.importer.filter.impl.RegexMetadataFilter
-
- saveToXML(Writer) - Method in class com.norconex.importer.parser.DefaultDocumentParserFactory
-
- saveToXML(Writer) - Method in class com.norconex.importer.tagger.impl.ConstantTagger
-
- saveToXML(Writer) - Method in class com.norconex.importer.tagger.impl.CopyTagger
-
- saveToXML(Writer) - Method in class com.norconex.importer.tagger.impl.DeleteTagger
-
- saveToXML(Writer) - Method in class com.norconex.importer.tagger.impl.ForceSingleValueTagger
-
- saveToXML(Writer) - Method in class com.norconex.importer.tagger.impl.HierarchyTagger
-
- saveToXML(Writer) - Method in class com.norconex.importer.tagger.impl.KeepOnlyTagger
-
- saveToXML(Writer) - Method in class com.norconex.importer.tagger.impl.RenameTagger
-
- saveToXML(Writer) - Method in class com.norconex.importer.tagger.impl.ReplaceTagger
-
- saveToXML(Writer) - Method in class com.norconex.importer.tagger.impl.SplitTagger
-
- saveToXML(Writer) - Method in class com.norconex.importer.tagger.impl.TextBetweenTagger
-
- saveToXML(XMLStreamWriter) - Method in class com.norconex.importer.transformer.AbstractRestrictiveTransformer
-
Deprecated.
Convenience method for subclasses to save metadata restrictions.
- saveToXML(Writer) - Method in class com.norconex.importer.transformer.impl.ReduceConsecutivesTransformer
-
- saveToXML(Writer) - Method in class com.norconex.importer.transformer.impl.ReplaceTransformer
-
- saveToXML(Writer) - Method in class com.norconex.importer.transformer.impl.StripAfterTransformer
-
- saveToXML(Writer) - Method in class com.norconex.importer.transformer.impl.StripBeforeTransformer
-
- saveToXML(Writer) - Method in class com.norconex.importer.transformer.impl.StripBetweenTransformer
-
- setCaseSensitive(boolean) - Method in class com.norconex.importer.filter.impl.RegexMetadataFilter
-
- setCaseSensitive(boolean) - Method in class com.norconex.importer.tagger.impl.TextBetweenTagger
-
Sets whether to ignore case when matching start and end text.
- setCaseSensitive(boolean) - Method in class com.norconex.importer.transformer.impl.ReduceConsecutivesTransformer
-
Sets whether to ignore case when matching characters or string
to reduce.
- setCaseSensitive(boolean) - Method in class com.norconex.importer.transformer.impl.ReplaceTransformer
-
Sets whether to ignore case when matching characters or string
to reduce.
- setCaseSensitive(boolean) - Method in class com.norconex.importer.transformer.impl.StripAfterTransformer
-
Sets whether to ignore case when matching start and end text.
- setCaseSensitive(boolean) - Method in class com.norconex.importer.transformer.impl.StripBeforeTransformer
-
Sets whether to ignore case when matching start and end text.
- setCaseSensitive(boolean) - Method in class com.norconex.importer.transformer.impl.StripBetweenTransformer
-
Sets whether to ignore case when matching start and end text.
- setContentTypeRegex(String) - Method in class com.norconex.importer.AbstractTextRestrictiveHandler
-
Sets the regular expression to match the content type.
- setFormat(String) - Method in class com.norconex.importer.parser.DefaultDocumentParserFactory
-
- setInclusive(boolean) - Method in class com.norconex.importer.tagger.impl.TextBetweenTagger
-
Sets whether start and end text pairs should themselves be stripped or
not.
- setInclusive(boolean) - Method in class com.norconex.importer.transformer.impl.StripAfterTransformer
-
Sets whether start and end text pairs should themselves be stripped or
not.
- setInclusive(boolean) - Method in class com.norconex.importer.transformer.impl.StripBeforeTransformer
-
Sets whether start and end text pairs should themselves be stripped or
not.
- setInclusive(boolean) - Method in class com.norconex.importer.transformer.impl.StripBetweenTransformer
-
Sets whether start and end text pairs should themselves be stripped or
not.
- setOnMatch(OnMatch) - Method in class com.norconex.importer.filter.AbstractOnMatchFilter
-
- setParserFactory(IDocumentParserFactory) - Method in class com.norconex.importer.ImporterConfig
-
- setPostParseHandlers(IImportHandler...) - Method in class com.norconex.importer.ImporterConfig
-
- setPreParseHandlers(IImportHandler...) - Method in class com.norconex.importer.ImporterConfig
-
- setProperties(String...) - Method in class com.norconex.importer.filter.impl.EmptyMetadataFilter
-
- setProperty(String) - Method in class com.norconex.importer.filter.impl.RegexMetadataFilter
-
- setReductions(String...) - Method in class com.norconex.importer.transformer.impl.ReduceConsecutivesTransformer
-
- setRegex(String) - Method in class com.norconex.importer.filter.impl.RegexMetadataFilter
-
- setRestriction(String, String, boolean) - Method in class com.norconex.importer.AbstractRestrictiveHandler
-
Sets what this handler should be restricted to.
- setRestriction(String, String, boolean) - Method in class com.norconex.importer.transformer.AbstractRestrictiveTransformer
-
Deprecated.
- setStripAfterRegex(String) - Method in class com.norconex.importer.transformer.impl.StripAfterTransformer
-
- setStripBeforeRegex(String) - Method in class com.norconex.importer.transformer.impl.StripBeforeTransformer
-
- SplitTagger - Class in com.norconex.importer.tagger.impl
-
Splits an existing metadata value into multiple values based on a given
value separator.
- SplitTagger() - Constructor for class com.norconex.importer.tagger.impl.SplitTagger
-
- SplitTagger.Split - Class in com.norconex.importer.tagger.impl
-
- SplitTagger.Split(String, String, String, boolean) - Constructor for class com.norconex.importer.tagger.impl.SplitTagger.Split
-
- StripAfterTransformer - Class in com.norconex.importer.transformer.impl
-
Strips any content found after first match found for given pattern.
- StripAfterTransformer() - Constructor for class com.norconex.importer.transformer.impl.StripAfterTransformer
-
- StripBeforeTransformer - Class in com.norconex.importer.transformer.impl
-
Strips any content found before first match found for given pattern.
- StripBeforeTransformer() - Constructor for class com.norconex.importer.transformer.impl.StripBeforeTransformer
-
- StripBetweenTransformer - Class in com.norconex.importer.transformer.impl
-
Strips any content found between a matching start and end strings.
- StripBetweenTransformer() - Constructor for class com.norconex.importer.transformer.impl.StripBetweenTransformer
-