:id: A python library for accurate and scalable fuzzy matching, record d...
A C library for parsing/normalizing street addresses around the world. P...
A powerful and modular toolkit for record linkage and duplicate detectio...
Straightforward fuzzy matching, information retrieval and NLP building b...
:id: Command line tool for deduplicating CSV files
:id: Examples for using the dedupe library
A list of free data matching and record linkage software.
🔎 Finds fuzzy matches between CSV files
PyTorch library for transforming entities like companies, products, etc....
Spark RDD with Lucene's query and entity linkage capabilities
Resources for tackling record linkage / deduplication / data matching pr...
Record Linkage ToolKit (Find and link entities)
Link Wikidata items to large catalogs
Python package for deduplication/entity resolution using active learning
Python implementation of anonymous linkage using cryptographic linkage keys