Data Cleansing - Duplicate Identification and Resolution Author: Roberto Negro Date: Dec. 2007 Fusion Customer Hub Data Cleansing - Duplicate Identification and Resolution
A Day in a life of a Data Steward Customer Data Mastering Data Profiling Data Cleansing Data Normalization & Mapping Data Import Duplicates Identification Duplicates Resolution Data Quality Analysis Data Cetification Data Archiving
Complete View of Data Management Customer Data Steward Dashboard Consolidated dashboard providing an overview of all data management tasks and status Single point of entry for data steward day to day tasks, with easy navigation to individual work areas Integrated Hub 360 tree for comprehensive view of all customer profiles and important relationship and child entities
Data Steward – Data Cleansing Duplicate Identification and Resolution
Data Steward – Data Cleansing Duplicate Identification
Data Steward – Data Cleansing Duplicate Identification – Regional Search
Data Steward – Data Cleansing Duplicate Identification – Local Area Basic Search
Data Steward – Data Cleansing Duplicate Identification – Local Area Advanced Search
Data Steward – Data Cleansing Duplicate Identification – Search Results
Data Steward – Data Cleansing Duplicate Identification Workarea
Duplicate Identification Create Duplicate Identification Batch (Party Type = Person)
Duplicate Identification Create Duplicate Identification Batch (Party Type = Person) Created by Creation date First name Last modified date Last name Last modified by Name Source system reference Taxpayer identification number Address Address line 1 Address line 2 City Country Created by module Created by Creation date Validation indicator Last modified date Last modified by Source system reference Postal code State Time zone Billing account Account description Account number Created by Creation date Last modified date Last modified by Source system reference
Duplicate Identification Create Duplicate Identification Batch (Party Type = Organization)
Duplicate Identification Create Duplicate Identification Batch (Party Type = Organization) Orgnization Created by Creation date D-U-N-S number HQ branch indicator Last modified date Last modified by Organization name Source system reference SIC code Taxpayer identification number Address Address line 1 Address line 2 City Country Created by module Created by Creation date Validation indicator Last modified date Last modified by Source system reference Postal code State Time zone Billing account Account description Account number Created by Creation date Last modified date Last modified by Source system reference
Data Steward – Data Cleansing Duplicate Identification – Assign Batch
Data Steward – Data Cleansing Duplicate Identification – Batch Details
Exercise Connect as Data_Steward_Mgr/Welcome1 Try to identify duplicates in the data that was imported using file Import or Data Import
Questions & Answers Oracle - RPA Confidential - Do not share outside of Oracle or RPA
Data Steward – Data Cleansing Duplicates Resolution
Duplicate Resolution Options Merge vs Link Flexible options for duplicate resolution Merge: creates a single record and re-points all transactions to the new record Link: creates an internal cross reference between all the records participating in the link Satisfy business needs when duplicates need to be maintained on purpose while still maintaining single view of the customer Supports registry and transactional style deployments © 2008 Oracle Corporation – Proprietary and Confidential
Data Steward- Duplicate Resolution Productivity - Tasks Assignment and Escalation Ability to assign duplicate resolution requests to data stewards Supports assignment of multiple items Notification support when resolution requests are assigned, rejected completed or error. Multi step review, approval of duplicate resolution process
Data Steward - Duplicate Resolution Preserving the right Information Ability to select the master record to keep after merge Ability to resolve conflicts in attribute values, and enter/modify attribute values during the merge process Ability to resolve conflicts in selected child entities Ability to save the merge process for further review and approval
Data Steward – Duplicate Resolution Enforce Governance via Merge Rules Ability to specify merge rules declaratively using business rules engine Enforcing enterprise governance policies without the need of coding Agreement rules are a set of merge prevention rules. A decision is made for a merge request by running agreement rules. The decision is to approve or reject a merge request. Use cases: If Surviving Party Tax Profile Type is not the same as Victim Party Tax Profile Type, merge can’t proceed. Prevent improper data consolidation Reduce data steward workload
Data Steward – Duplicate Resolution Merge History Comprehensive history captured of all customer records operations, including insert, update, delete and merge/link Attribute level audit trail capability available Detailed records of all data management operations to prevent redundancy of efforts and reduce possibilities of mistakes. Provide basis for reversal of any changes, if needed
Data Steward – Data Cleansing Duplicate Resolution – Search
Data Steward – Data Cleansing Duplicate Resolution – Basic Duplicate Identification Search
Data Steward – Data Cleansing Duplicate Resolution – Advanced Duplicate Resolution Search
Data Steward – Data Cleansing Duplicate Resolution –Duplicate Resolution Search Results
Data Steward – Data Cleansing Duplicate Resolution –Duplicate Resolution Search Results Details
Data Steward – Data Cleansing Duplicate Resolution – Duplicate Resolution Search Results Details
Data Steward – Data Cleansing Duplicate Resolution –Duplicate Resolution Search Results Details
Data Steward – Data Cleansing Duplicate Resolution- Request Selection Process
Data Steward – Data Cleansing Duplicate Resolution– Override System Mapping
Data Steward – Data Cleansing Duplicate Resolution– Override System Mapping
Data Steward – Data Cleansing Duplicate Resolution– Override System Mapping
Data Steward – Data Cleansing Duplicate Resolution– Override System Mapping
Data Steward – Data Cleansing Duplicate Resolution– Override System Mapping
Data Steward – Data Cleansing Duplicate Resolution– Override System Mapping
Data Steward – Data Cleansing Duplicate Resolution– Override System Mapping
Data Steward – Data Cleansing Duplicate Resolution– Reject
Data Steward – Data Cleansing Duplicate Resolution – Duplicate Resolution Statistics
Exercise Connect as Data_Steward_Mgr/Welcome1 Try to complete the resolution process on the duplicates identified by the Batch Duplicate Identification process
Questions & Answers Oracle - RPA Confidential - Do not share outside of Oracle or RPA
Data Steward – Data Cleansing Data Enrichment
Questions & Answers Oracle - RPA Confidential - Do not share outside of Oracle or RPA
Data Steward – Data Management Party Management
Data Steward – Data Management Party Management - People
CDMD – People Search
CDMD – Basic People Search
CDMD – Advanced People Search
CDMD – Advanced People Search Results (1)
Data Steward – Data Management Party View
Data Steward – Data Management Hierarchies Management
Questions & Answers Oracle - RPA Confidential - Do not share outside of Oracle or RPA
THANK YOU !