Try the Pipeline

Ghost detection filters bad data — matching engine resolves the rest.

1Ghost Detection
2Entity Matching