Apache Tika bindings for PHP: extract text and metadata from documents, images and other formats
Common:
CLI Mode:
Client::setEnvVars()
to add environment variables (fixes #28)CLI mode:
Client::getXHTML()
to get a XML compliant output (fixes #27)Client::setJavaArgs()
that allows to add arguments for java binary$client->getRecursiveMetadata()
returns an array as expectedClient::getSupportedVersions()
and Client::isVersionSupported()
methods cannot be called staticallyClient::getAvailableDetectors()
and Client::getAvailableParsers()
returns an array with new formatNOTE: this feature was planned to be released with 1.x branch, because has limited funcionality (see #23). The contribution of @vuthaihoc added this to 0.x branch.
Client::setCallback()
to save memory