Transforms PDF, Documents and Images into Enriched Structured Data
Adversarial Robustness Toolbox (ART) - Python Library for Machine Learni...
extract internal monitoring data from application logs for collection in...
a library for audio and music analysis
Provides functions to read and write from/to an object or array using a ...
The Apache Tika toolkit detects and extracts metadata and text from over...
Visual Novels resource browser
node.js module for extracting text from html, pdf, doc, docx, xls, xlsx,...
Tika-Python is a Python binding to the Apache Tika™ REST services allowi...
🦜⛏️ Did you say you like data?
Stanford Open Information Extraction made simple!
A C++ static library offering a clean and simple interface to the 7-zip ...
A program to extract files from the RPA archive format.
Feed PDFs, docs, slides, web pages and more into GPT-4-Vision in one lin...
File Injector is a script that allows you to store any file in an image ...