Apache Tika Available With OSGi Bundle


The Apache Lucene project has announced the release of version 0.6 of its Apache Tika subproject. Apache Tika is a toolkit for detecting and extracting metamdata and structured text content from various documents using existing parser libraries.

Version 0.6 comes complete with an additional OSGi bundle packaging that makes it possible to use Tika features in an OSGi environment; and an upgrade to version 3.6 of the Apache POI dependency used for parsing Microsoft Office file formats. A flash video (video/x-flv) parser has also been added.

The full change log can be accessed at the Apache Lucene website, along with links to download the content analysis toolkit.

comments powered by Disqus