Data collected by organizations is increasing at a relentless pace, but can still give a misleading or fragmented view of the real world.
For example, a person could appear multiple times or have multiple digital entities within the same database, due to typos, name changes, aggregation of different systems and so on. So, how do we match entities when the ID systems may be different or contain errors?
Entity resolution (ER) helps get to the truth. Entity resolution, which is the disambiguation of real-world entities in a database, is an essential data quality tool.
Graph provides an efficient approach for the entity resolution problem. A native graph database with massive parallel computing capability is the best tool to implement the approach.