Apache Tika Available With OSGi Bundle
The Apache Lucene project has announced the release of version 0.6 of its Apache Tika subproject. Apache Tika is a toolkit for detecting and extracting metamdata and structured text content from various documents using existing parser libraries.
Version 0.6 comes complete with an additional OSGi bundle packaging that makes it possible to use Tika features in an OSGi environment; and an upgrade to version 3.6 of the Apache POI dependency used for parsing Microsoft Office file formats. A flash video (video/x-flv) parser has also been added.