Scenarios
In this section
- Creating a clean data set from the suspect pairs labeled by tMatchPredict and the unique rows computed by tMatchPairing
- Selecting the best-of-breed data from a group of duplicates to create a survivor
- Modifying the rule file manually to code the conditions you want to use to create a survivor
- Converting the Standard Job to a Spark Batch Job
- Merging the content of several rows using different columns as rank values