Tika Extractor

Extract files contents by querying a running Apache Tika server

Parameters

Label (internal name) Type Default Description
Tika Service Address (serverAddress) string localhost:9998 Use HOSTNAME|IP:PORT, http://host:port will be used to query Tika server
Extract and store textual content (extractContent) string pydio-binaries/tika-{{.Node.Uuid}}.gz Tika can extract content from many files types, that can be indexed by search server
Compress extracted contents (compressContent) boolean true If switched on, extracted content is compressed before storing
Additional metadata fields (additionalMeta) string Content-Type Try to find additional known keys in metadata extracted by Tika. Use comma separated list of field names
Back to top