Lecture 14: Data Repairing
Today’s Agenda Recap and Data Repairing Constraint-based Data Repairing Probabilistic Data Repairing
1. Recap and Data Repairing Section 1 1. Recap and Data Repairing
Section 1 What is a Data Error?
Section 1 Example
Error Detection Taxonomy Section 1 Error Detection Taxonomy
Section 1 Example No constraint violations but did we fix the errors?
Data Repairing Taxonomy Section 1 Data Repairing Taxonomy
Section 1 How to automate? Most automatic repairing techniques adopt the “minimality” of repairs principle Repairing techniques in practice are manual and semi-automatic at best
Data repairs by Value Update Section 1 Data repairs by Value Update
Data repairs by Value Updates Section 1 Data repairs by Value Updates
2. Constraint-based Data Repairing Section 2 2. Constraint-based Data Repairing
Section 2 FD-based repairing
Holistic Constraint-based Repairing Section 2 Holistic Constraint-based Repairing
Step1: Minimal Vertex Cover Section 2 Step1: Minimal Vertex Cover
Step2: Collect Repair Requirements Section 2 Step2: Collect Repair Requirements
Section 2 Step3: Get Updates
3. Probabilistic Data Repairing Section 3 3. Probabilistic Data Repairing
Lots of Possible Repairs Section 3 Lots of Possible Repairs
Probabilistic Data Repairs Section 3 Probabilistic Data Repairs
Error detection as a black box Section 3 Error detection as a black box
A model for data repairs Section 3 A model for data repairs
A model for data repairs Section 3 A model for data repairs
Section 3 Scaling inference
Section 3 Relaxing Constraints
Section 3 Relaxing Constraints
Section 3 Relaxing Constraints
Section 3 Relaxing Constraints
Section 3 Relaxing Constraints
Data repairing as an ML problem Section 3 Data repairing as an ML problem