Temporal Analysis of Platelet Data in Chronic Viral Hepatitis Dataset Shoji HiranoShusaku Tsumoto Department of Medical.

Slides:



Advertisements
Similar presentations
Yinyin Yuan and Chang-Tsun Li Computer Science Department
Advertisements

Agency for Healthcare Research and Quality (AHRQ)
Random Forest Predrag Radenković 3237/10
An Introduction to Multivariate Analysis
Fast Algorithms For Hierarchical Range Histogram Constructions
Iterative Optimization and Simplification of Hierarchical Clusterings Doug Fisher Department of Computer Science, Vanderbilt University Journal of Artificial.
G. Alonso, D. Kossmann Systems Group
Principal Component Analysis (PCA) for Clustering Gene Expression Data K. Y. Yeung and W. L. Ruzzo.
Journal Club Alcohol and Health: Current Evidence March-April 2007.
Active Calibration of Cameras: Theory and Implementation Anup Basu Sung Huh CPSC 643 Individual Presentation II March 4 th,
CMPT 855Module Network Traffic Self-Similarity Carey Williamson Department of Computer Science University of Saskatchewan.
Journal Club Alcohol and Health: Current Evidence March-April 2005.
Iterative Optimization of Hierarchical Clusterings Doug Fisher Department of Computer Science, Vanderbilt University Journal of Artificial Intelligence.
Iterative Optimization and Simplification of Hierarchical Clusterings Doug Fisher Department of Computer Science, Vanderbilt University Journal of Artificial.
Multiple Indicator Cluster Surveys Data Interpretation, Further Analysis and Dissemination Workshop Overview of Data Quality Issues in MICS.
Clustering analysis workshop Clustering analysis workshop CITM, Lab 3 18, Oct 2014 Facilitator: Hosam Al-Samarraie, PhD.
Principal Component Analysis (PCA) for Clustering Gene Expression Data K. Y. Yeung and W. L. Ruzzo.
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 10: Survival Curves Marshall University Genomics Core.
© 2005 The McGraw-Hill Companies, Inc., All Rights Reserved. Chapter 12 Describing Data.
Copyright © 2011 Pearson Education, Inc. All rights reserved. Doing Research in Behavior Modification Chapter 22.
Similarity searching modell with Excel Zoltán Varga PhD student SZIU.
Severity Index for patients with Liver Cirrhosis disease Health Decision Analysis Yokabid Worku
Measuring disease and death frequency
POSTER TEMPLATE BY: om Sex Differences in Associations between Fear of Negative Evaluation (FNE) and Substance Use Lesley A.
DNA microarray technology allows an individual to rapidly and quantitatively measure the expression levels of thousands of genes in a biological sample.
ITEC6310 Research Methods in Information Technology Instructor: Prof. Z. Yang Course Website: c6310.htm Office:
Alignment and classification of time series gene expression in clinical studies Tien-ho Lin, Naftali Kaminski and Ziv Bar-Joseph.
A Prospective Study of the Impact of Bladder Incontinence Surgery on Sexual Satisfaction K. Witzke, DO, Gregory McIntosh, DO, FACOS, Jeffrey Schock, DO,
Evidence Based Medicine
Presented by Tienwei Tsai July, 2005
Analysis of Constrained Time-Series Similarity Measures
Research Methods.
Lecture 11. Microarray and RNA-seq II
Time Series Data Analysis - I Yaji Sripada. Dept. of Computing Science, University of Aberdeen2 In this lecture you learn What are Time Series? How to.
Study of cytokine gene polymorphism and graft outcome in live-donor kidney transplantation By Rashad Hassan MD Amgad El-Agroudy, Ahmad Hamdy, Amani Mostafa.
Various topics Petter Mostad Overview Epidemiology Study types / data types Econometrics Time series data More about sampling –Estimation.
Biostatistics, statistical software VII. Non-parametric tests: Wilcoxon’s signed rank test, Mann-Whitney U-test, Kruskal- Wallis test, Spearman’ rank correlation.
1 Virtual COMSATS Inferential Statistics Lecture-16 Ossam Chohan Assistant Professor CIIT Abbottabad.
1 Discovery of Temporal Patterns in Course-of-Disease Medical Data Jorge C. G. Ramirez Ph.D. Candidate Lynn L. Peterson and Diane J. Cook Supervising Professors.
The Dirichlet Labeling Process for Functional Data Analysis XuanLong Nguyen & Alan E. Gelfand Duke University Machine Learning Group Presented by Lu Ren.
Obesity, Medication Use and Expenditures among Nonelderly Adults with Asthma Eric M. Sarpong AHRQ Conference September 10, 2012.
Page 1 Inferring Relevant Social Networks from Interpersonal Communication Munmun De Choudhury, Winter Mason, Jake Hofman and Duncan Watts WWW ’10 Summarized.
ECML/PKDD-2005 Workshop on Discovery Challenge Guide to the Hepatitis Dataset Shusaku TsumotoShoji Hirano Department.
Clustering of Trajectory Data obtained from Soccer Game Record -A First Step to Behavioral Modeling Shoji Hirano Shusaku Tsumoto
BPS - 3rd Ed. Chapter 161 Inference about a Population Mean.
Effects of emotion induced by music on experimental pain Roy, Mathieu (1), Peretz, Isabelle (1), Rainville, Pierre (2) (1) Département de psychologie,
IMPORTANCE OF STATISTICS MR.CHITHRAVEL.V ASST.PROFESSOR ACN.
Evaluation of gene-expression clustering via mutual information distance measure Ido Priness, Oded Maimon and Irad Ben-Gal BMC Bioinformatics, 2007.
An Eyetracking Analysis of the Effect of Prior Comparison on Analogical Mapping Catherine A. Clement, Eastern Kentucky University Carrie Harris, Tara Weatherholt,
Intermittency Analysis and Spatial Dependence of Magnetic Field Disturbances in the Fast Solar Wind Sunny W. Y. Tam 1 and Ya-Hui Yang 2 1 Institute of.
Predicting the Location and Time of Mobile Phone Users by Using Sequential Pattern Mining Techniques Mert Özer, Ilkcan Keles, Ismail Hakki Toroslu, Pinar.
Learning Event Durations from Event Descriptions Feng Pan, Rutu Mulkar, Jerry R. Hobbs University of Southern California ACL ’ 06.
Hierarchical clustering approaches for high-throughput data Colin Dewey BMI/CS 576 Fall 2015.
An unsupervised conditional random fields approach for clustering gene expression time series Chang-Tsun Li, Yinyin Yuan and Roland Wilson Bioinformatics,
The Natural History of Liver Fibrosis Progression Rate in Hepatitis C Infection David Yamini, Benjamin Basseri, Anush Arakelyan, Pedram Enayati, Tram T.
Henry Masur, MD Bethesda, Maryland
Rapid Fibrosis and Significant Histologic Recurrence of Hepatitis C After Liver Transplant Is Associated With Higher Tumor Recurrence Rates in Hepatocellular.
Adefovir Suppresses HBV DNA Levels in Lamivudine-Resistant HIV/HBV Patients Slideset on: Benhamou Y, Thibault V, Vig P, et al. Safety and efficacy of adefovir.
More complex (multidimensional) methods
Types of Research Studies Architecture of Clinical Research
Novel Serum Markers of Fibrosis Progression for the Follow-Up of Hepatitis C Virus- Infected Patients  Frédérique Caillot, Martine Hiron, Odile Goria,
SMOKING AND HISTOLOGICAL STAGE OF CHRONIC VIRAL HEPATITIS
Long-term impact of response to interferon-based therapy in patients with chronic HCV in relation to liver function, survival and cause of death Philip.
Hierarchical clustering approaches for high-throughput data
I don’t need a title slide for a lecture
Winter 2018 CISC101 12/2/2018 CISC101 Reminders
Data Mining – Chapter 4 Cluster Analysis Part 2
Differences in the temporal dynamics of daily activity between chronic pain patients and healthy controls P. Montoya1, P. Geha2, M. Baliki2, A. V. Apkarian2,
InferentIal StatIstIcs
CPSC 641: Network Traffic Self-Similarity
Presentation transcript:

Temporal Analysis of Platelet Data in Chronic Viral Hepatitis Dataset Shoji HiranoShusaku Tsumoto Department of Medical Informatics, Shimane University, School of Medicine, Japan

Outline Introduction Cluster analysis system for time-series medical data Experimental results 1. Cluster analysis of platelet (PLT) sequences 2. PLT-based analysis of the progress speed of liver fibrosis Conclusion & Future Work

Introduction Platelet (PLT) decrease in chronic viral hepatitis The liver plays an important role in generating platelets Bleeding is often observed on the patients at the terminal stage of liver cirrhosis Liver fibrosis -> dysfunction -> PLT decrease -> bleeding Use of PLT count as an noninvasive index for evaluating liver fibrosis [F0 (no fibrosis), F1, F2, F3, F4 (liver cirrhosis)] PLT counts correlate with fibrotic stage (Matsuura et al., J. Viral Hepat. 2000), according to PLT counts at the time of biopsy However, Time-series analysis of the PLT data have rarely been performed Features of temporal patterns/courses have not been examined

Introduction (cont ’ d) Difficulties in time series comparison Irregular sampling intervals, irregular sequence length Existence of events with different length and phase Difficult to find the corresponding pairs of points be compared 3-6M 4 years 1-2 Weeks IFN 6m 9 years

Cluster Analysis System 4 years 1-2 weeks PLT IFN 6 m 9 years IFN 6 m Seq 1 Seq 2 Global Local Scale Similar Match Structural Dissimilarities Time-series Medical Data Multiscale Comparison Cluster representation of time series Grouping

Experiment 1: Cluster Analysis of PLT Sequences Objective Discovery of common temporal courses in platelet sequences Discovery of relations between platelet courses and fibrotic stages Used data Chronic hepatitis dataset provided by Chiba University Total 488 cases (Type B: 193, Type C w/o IFN: 99, Type C with IFN: 196) (Male 343, Female 145 (145:48, 61:38, 137:59)) Procedure Generate a dissimilarity matrix using the modified multi-scale matching Generate a dendrogram from the matrix using agglomerative hierarchical clustering (average linkage) and perform cluster analysis

Experiment 1 Cluster Constitution w.r.t. Fibrotic Stages Type C, IFN (total 196 cases) ※ clusters of N < 3 omitted F4 Major F1 Major

Experiment 1 Grouped Sequences F4, F3 major clusters (Type C, IFN) Cluster 5: N=11 (0/2/1/2/6) Normal H (350 x10^3 /ul ) Normal L (120 x10^3 /ul ) Cluster 8: N=40 (0/9/6/9/16) ※ first 16 cases Y Number of cases (F0/F1/F2/F3/F4) Male 10, Female 1 Male 27, Female 13

Experiment 1 Grouped Sequences F1, F2 major clusters (Type C, IFN) Cluster 11: N=46 (2/24/9/9/3)Cluster 12: N=42 (1/22/10/6/3) ※ First 16 cases Y Male 34, Female 12 Male 32, Female 10

Experiment 1 Grouped Sequences Clusters containing remarkable increase/decrease cases Cluster 4: N=4 (0/3/1/0/0) Cluster 6: N=3 (0/1/0/1/1) Cluster 10: N=5 (0/1/0/1/3) Y Male 2, Female 2 Male 2, Female 1 Male 4, Female 1

Experiment 1 Relationships between PLT and GPT F4, F3 major clusters (Type C, IFN) Cluster 5: N=11 (0/2/1/2/6) Normal H (350 x10^3 /ul ) Normal L (120 x10^3 /ul ) Y Normal H (40 IU/L) Normal L (7 IU/L) Y PLT GPT

Experiment 1 Relationships between PLT and GPT F1, F2 major clusters (Type C, IFN) Cluster 11: N=46 (2/24/9/9/3) Y PLT GPT ※ First 16 cases

Experiment 1 Relationships between PLT and GPT Clusters containing remarkable increase/decrease cases Cluster 4: N=4 (0/3/1/0/0) Cluster 6: N=3 (0/1/0/1/1) Cluster 10: N=5 (0/1/0/1/3) Y PLT GPT PLT GPT PLT

Experiment 1 Summary Fibrotic stage and PLT course F4/F3 major clusters Lower PLT level, below the normal range Decreasing, or flat courses Some F1/F2 cases may follow similar courses (ex. Type C IFN cluster 5) F1/F2 major clusters Moderate PLT level within the normal range (ex. Type C with IFN clusters 11, 12, 23) Mildly decreasing, or flat courses PLT level may correspond to fibrotic stages (ex. Type C with IFN clusters 23 > 12 > 11 )

Experiment 1 Summary Fibrotic stage and PLT course Fast increase/decrease cases Some cases rapidly descent from normal range regardless of fibrotic stage (Type C with IFN clusters 4,6) Some cases rapidly ascent toward normal range (after IFN therapy)(Type C with IFN cluster 10) Relationships between PLT and GPT Chronically high GPT level may induce the descent of PLT, progressing the fibrotic stage Difficult interpretation of GPT patterns; high level followed by flat low level Cured cases or terminal cases ?

Experiment 2: PLT-based Analysis of Progress Speed of Liver Fibrosis Assumption: Diminishing PLT (below the normal low limit; NL) for long time -> F4 stage Using PLT-based stage estimation and biopsy information, we analyzed Years for reaching F4 stage Years elapsed between stages 1Y Normal Low (NL) Normal High (NH) PLT diminishing below the NL MID 466 (7.92Y) F4(PLT) F1(biopsy)

Experiment 2 Preprocess: Sequence Selection Excluded sequences 1. No biopsy information 2. Short: # of exam < 3, or duration of exam < 2Y 3. Inhomogeneous: SD of exam intervals > 1Y Interval 14 Y Inhomogeneous seq. Interval 1M MID 215 Normal Low (NL) Normal High (NH)

Experiment 2 Preprocess: Sequence Smoothing Removal of short-term changes Convolution with Gaussian kernel with 6M width(  =2.8)

Experiment 2 Discrimination of Diminishing Cases Criteria 1. Keeps diminished PLT counts (below the normal level) at least for 6 month 2. Once diminished, never maintain normal PLT level for 6 month at any time until the last examination 1Y 6 M Diminished Case MID 466 Normal Low (NL) Normal High (NH)

Experiment 2 Discrimination of Diminished Cases Criteria 1. Keeps diminished PLT count (below the normal level) at least for 6 month 2. Once diminished, never maintain normal PLT level for 6 month at any time until the last examination 6 M Not Diminished Case 6 M Recovery MID 37 Normal Low (NL) Normal High (NH)

Experiment 2 Selection Results PLT data exist, but no biopsy information available # of exam < 3, or Duration of exam < 2Y SD of exam interval > 1Y Not diminished below NL Excluded from Analysis Subject for Analysis Selection Results (Total 720 Cases) Numbers in (): ratio (%)

Experiment 2 Years for Reaching F4: Two Measures First-exam basis:Years from the first PLT examination First-biopsy basis : Years from the first biopsy Fibrotic stage at starting date: stage observed at the first biopsy Years = 0 if the date for diminishment is earlier than the first biopsy Diminished date (below NL) F4 (PLT) First biopsy F1 (biopsy) First PLT exam F1 (stage at first biopsy) First-exam basis First-biopsy basis MID 216 Normal Low (NL) Normal High (NH)

Experiment 2 Years for Reaching F4: Summary for 97 Diminishing Cases (First-exam basis)

Experiment 2 Years for Reaching F4: Summary for 97 Diminishing Cases (First-biopsy Basis)

Experiment 2 Years Elapsed between Stages: Summary for 97 Diminishing Cases (First-biopsy Basis)

Experiment 2 Summary PLT-based estimation about years for reaching F4 and years elapsed between stages Assumption: Diminishing PLT (below NL) over long time -> F4 stage Keeps diminished PLT below NL at least for 6 month Cannot maintain normal PLT level over 6 month at any time from the time it diminished below NL to the last examination Analyzed a total of 97 diminished cases (31 Type B, 40 Type C with IFN, 26 Type C w/o IFN cases) Years for F4:2.95 Y (av. 97 cases, first-exam basis) 2.22 Y (av. 97 cases, first-biopsy basis) Years between stages: 1.60 Y/stage ( av. 97 cases, first-biopsy basis)

Conclusion Presented the results of temporal analysis of platelet data in chronic viral hepatitis dataset Findings: Temporal courses of PLT might be classified into some patterns according to their levels and trends which might be further related to fibrotic stages. In some exacerbating cases, liver fibrosis may proceed a few times faster than the natural courses. Further validation of the clinical reasonability of the results Application of the analysis scheme on other items/datasets. Future Work

Cluster Constitution w.r.t. Fibrotic Stages Type C w/o IFN (total 99 cases)

Grouped Sequences F4, F3 Major Clusters (Type C, w/o IFN) Cluster 3: N=47 (1/27/6/7/6) ※ First 16 cases Cluster 1: N=22 (0/4/4/5/9) ※ First 16 cases Y Male 14, Female 8 Male 27, Female 20

Relationships between PLT and GPT F4, F3 major clusters (Type C, w/o IFN) Cluster 1: N=22 (0/4/4/5/9) ※ First 16 cases Y PLT GPT ※ First 16 cases