This repository provides data and scripts to use Sherlock, a DL-based model for semantic data type detection: https://sherlock.media.mit.edu.
This release provides:
SherlockModel
class with the scikit-learn API (i.e. w/ fit
, predict
, predict_proba
methods),Contributions by: @lowecg @madelonhulsebos
This release reflects the code that was used for the experiments in the paper "Sherlock: a deep learning approach to semantic data type detection" (link to the paper on arXiv). This release provides code for:
This release consists inefficiencies and bugs, hence it is recommended to use the latest release of this project in production settings or new research projects. More about this project can be found on this website.