Skip to content
This repository has been archived by the owner on Jul 10, 2019. It is now read-only.

Tika module

butlermh edited this page Jun 2, 2011 · 3 revisions

Tika commands are found in behemoth-tika.job.

usage: com.digitalpebble.behemoth.tika.TikaDriver -i <input> -o <output> [-t <TikaProcessorClass>
-m <mimeType>]    
-i, --input           The input path
-o, --output          The output path
-t, --tikaProcesssor  The fully qualified name of a TikaProcessor class that handles the extraction
-m, --mimeType        The mime type to use

Parses the Behemoth corpus using Tika.

Behemoth Modules | Home

Clone this wiki locally