data-matching
Entity resolution (also known as data matching, data linkage, record linkage, and many other terms) is the task of finding entities in a dataset that refer to the same entity across different data sources (e.g., data files, books, websites, and databases). Entity resolution is necessary when joining different data sets based on entities that may or may not share a common identifier (e.g., database key, URI, National identification number), which may be due to differences in record shape, storage location, or curator style or preference.
Here are 4 public repositories matching this topic...
WInte.r is a Java framework for end-to-end data integration. The WInte.r framework implements well-known methods for data pre-processing, schema matching, identity resolution, data fusion, and result evaluation.
-
Updated
Jul 12, 2025 - Java
Weka Comparator to match rules to test data with filtering abilites
-
Updated
Jan 10, 2024 - Java
Identity Reconciliation is a Java-based project focused on resolving and merging duplicate identities across systems to ensure consistent and accurate user data management.
-
Updated
Jul 22, 2025 - Java
Created by Halbert L. Dunn
Released 1946
- Followers
- 44 followers
- Organization
- entity-resolution
- Website
- github.com/topics/entity-resolution
- Wikipedia
- Wikipedia