Top 40 Motifs from Artificial Book with Different Masking Ratios

Slides:



Advertisements
Similar presentations
How to use Motif Join UI. 1. Open Command Window and type Motif_Join to call Motif_Join.m.
Advertisements

Theoretical Analysis. Objective Our algorithm use some kind of hashing technique, called random projection. In this slide, we will show that if a user.
$100 $200 $300 $400 $500 $100 $200 $300 $400 $500 $100 $200 $300 $400 $500 $100 $200 $300 $400 $500 $100 $200 $300 $400 $500 $100 $200 $300.
An analysis of “Using sequence compression to speed up probabilistic profile matching” by Valerio Freschi and Alessandro Bogliolo Cory Tobin.
HS 67Sampling Distributions1 Chapter 11 Sampling Distributions.
EGR 105 Foundations of Engineering I Fall 2005 – week 10 Project time.
8.4 Distance and Slope BobsMathClass.Com Copyright © 2010 All Rights Reserved. 1 This is a derivation of the Pythagorean Theorem and can be used to find.
Energy and Motion 8.4 Speed and Velocity -How do you calculate speed?
Mining Historical Archives for Near- Duplicate Figures Thanawin Rakthanmanon, Qiang Zhu, and Eamonn J. Keogh.
Visualization of Data Lesson 1.2. One-Variable Data Data in the form of a list Example, a list of test scores {78, 85, 93, 67, 51, 98, 88} Possible to.
$100 $200 $300 $400 $500 $100 $200 $300 $400 $500 $100 $200 $300 $400 $500 $100 $200 $300 $400 $500 $100 $200 $300 $400 $500 $100 $200 $300.
Preparing for Your Independent Reading Conference Freshman Survey Literature Mr. Abrams
Chapter 7 Proportional Reasoning Section 7.2 Proportional Variation and Solving Proportions.
Fitness Cases Are input-output pairs describing the output a program should produce given the particular input value. The fitness of an individual is usually.
Fast Shapelets: All Figures in Higher Resolution.
Ian Dawson How Good is the Harvest?
Motion and Force 8SCIENCE.
Warm-up  Get your green folder  Open to notebook paper and label it “ Speed and Motion Notes”  Make sure your have something to write with  Write the.
Determine the relationship between the sets: Set 1Set Set 1Set Set 1Set Set 1Set
Plotting mathematical functions CSC Scenario CSC 1522.
Hypothesis Tests One Sample Means
Unit 5 Week 1 Bad Dog, Dodger! Writing Conventions.
Create a Movie Pretend that your book is going to be made into a movie. Don’t use an actual movie that has been done to do this report. If your book has.
How to plot points… LearnZillion Notes: --For some lessons it may be best to include a slide or two about “A Common Mistake.” These slides show students.
Speed and Velocity.
Create Your Own Adventure Book
Distance and Midpoint Formulas
How to use… [matrixProfile, profileIndex, motifIndex, discordIndex] = interactiveMatrixProfile(data, subLen); Input data: input time series subLen: subsequence.
Slope of a Line (6.1).
LESSON 7-3 Accounts Receivable Turnover Ratio
Continuous Improvement for Writing
T-test Tests the differences in the means between two groups
Time Series Filtering Time Series
CALCULATING THE ACCOUNTS RECEIVABLE TURNOVER RATIO
LESSON 7-3 Accounts Receivable Turnover Ratio
SAY: Both ratios have a 1/2 relationship because ½ of 10 is 5 and ½ of 12 is 6.
Motion Table of Contents Describing Motion Speed and Velocity
Visualization of Data Lesson 1.2.
Volume 20, Issue 5, Pages (May 1998)
6.1 day 1: Antiderivatives and Slope Fields
6.1 day 1: Antiderivatives and Slope Fields
6.1 day 1: Antiderivatives and Slope Fields
GROpt.m (1) Copy the 7 files from GRopt.zip into one directory.
Performance Comparison of Tarry and Awerbuch Algorithms
Overview Basic Information Lecture Labs Lab Reports Homework Exams
Homework Schultz, Dayan, & Montague, Science, 1997
RATIOS MODULE 1 LESSONS 1-15 Test is Tuesday, September 18th
Life Law #1 Your Either Get it, or You Don’t
How will we plot this data?
Assignment Pages: 10 – 12 (Day 1) Questions over Assignment? # 1 – 4
Volume 20, Issue 5, Pages (May 1998)
Volume 53, Issue 3, Pages (February 2007)
© T Madas.
Ticket in the Door a = -7 3 d – 19 = 4 20 Agenda Ticket in the Door
Motion Table of Contents Describing Motion Speed and Velocity
Continuous Improvement for Writing
De-noising on the Body Centered Cubic (BCC) Sampling Lattice
Motifs across 4 heraldry books (More results)
Principles of Motor Control and Movement Accuracy
Continuous Improvement for Writing
Continuous Improvement for Writing
Time Series Filtering Time Series
Continuous Improvement for Writing
Continuous Improvement for Writing
Sampling and estimation
Preview Warm Up Lesson Presentation.
Title of Book By: Author’s Name
Prime Factorization Using Factor Trees
Continuous Improvement for Writing
Continuous Improvement for Writing
Presentation transcript:

Top 40 Motifs from Artificial Book with Different Masking Ratios Dataset: 128-page of our artificial book (14-segment display). Note that there are 12,800 random characters in the book.

Top 40 Motifs when pmask=60%

Top 40 Motifs when pmask=50%

Top 40 Motifs when pmask=40%

Top 40 Motifs when pmask=30%

Top 40 Motifs when pmask=20%

Average Motifs Distance On 128 pages dataset   Top 10 Motifs Top 20 Motifs Top 40 Motifs Masking Ratio Total Dist Error 60 45.5 1.00 101.0 223.5 50 46.0 0.83 103.0 1.01 226.5 1.02 40 46.5 0.67 104.5 230.5 1.03 30 49.0 0.50 109.5 1.08 246.0 20 52.5 0.33 119.0 1.15 265.0 1.18 Total Random Distance 10 pairs 20 Pairs 40 Pairs 400 800 1600 Do you want me to vary plot on different size of dataset? But I don’t sure whether it is sense when we plot different size in the same figure.

Effect of Masking Ratio to Average Distance 2 4 8 16 32 64 128 256 512 5 10 15 20 25 Average Distance of Top 20 Motifs Average Distance Number of pages Mask 60% Mask 50% Mask 40% Mask 30% Mask 20% Baseline at 60% 2 4 8 16 32 64 128 256 512 1 1.2 1.4 1.6 1.8 Average Distance on Different Masking Ratio Ratio to 60% masking ratio Number of pages Mask 60% Mask 50% Mask 40% Mask 30% Mask 20%

Effect of Parameters to Avg Distance

Top 20 Motifs when pmask=60% Top 14 Motifs from Brute Force: Run through all sliding windows Avg distane = 7.5 Poor quaility Very very slow Running Time 1 Page : 1 hour+ 2 Pages : 4.5 hours 4 Pages : ~ 18 hours 8 Pages : ~ 70 hours (3 days) Top 20 Motifs when pmask=60% Avg distance = 15.50 Worse distance Higher quality motifs

Top 20 Motifs from Brute Force: Run only on potential windows Avg distance = 14.60 Running time is not bad Top 20 Motifs when pmask=60% Avg distance = 15.50