A Method for the Comparison of Criminal Cases using digital documents A New Distance Measure T.K. Cocx, tcocx@liacs.nl 2/26/2019
2/26/2019 T.K. Cocx, tcocx@liacs.nl
2/26/2019 T.K. Cocx, tcocx@liacs.nl
Comparing Documents Data mining: the search for knowledge in large amounts of data. Data: digital documents found on crime scene or fabricated by police describing the crime scene Knowledge: what crime labs may be setup by the same group of criminals Data mining tools: Text mining: extraction of entities from documents Distance measure on extracted output: document similarity Visualization: clustering of documents on screen 2/26/2019 T.K. Cocx, tcocx@liacs.nl
Coupled investigation table 4-step paradigm Documents Extraction table Investigation Amount Type Entity Text mining Coupled investigation table In Common Investigation 2 Investigation 1 Transformation Distance Matrix 0.92 0.27 … 0.51 2 1 Distance Measure Clustering Visualization 2/26/2019 T.K. Cocx, tcocx@liacs.nl