Skip to content

docker files for standalone Tika document-extraction service

Notifications You must be signed in to change notification settings

butter/docker-tika

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Supported tags and respective Dockerfile links

What is Tika?

Apache Tika is a content detection and analysis framework, written in Java, stewarded at the Apache Software Foundation. It detects and extracts metadata and text from over a thousand different file types, and as well as providing a Java library, has Server and Command Line editions suitable for use from other programming languages.

wikipedia.org/Apache_Tika

logo

How to use this image

Create a Dockerfile in your Tika project

FROM butter/tika-server:1.14
ENTRYPOINT java -jar /usr/local/bin/tika.jar -h 0.0.0.0 -c /tika-config.xml

License

View license information for Tika.

About

docker files for standalone Tika document-extraction service

Topics

Resources

Stars

Watchers

Forks

Packages

No packages published

Contributors 3

  •  
  •  
  •