Machine Learning & Bioinformatics Tien-Hao Chang (Darby Chang) Machine Learning & Bioinformatics 1.

Slides:



Advertisements
Similar presentations
TWO STEP EQUATIONS 1. SOLVE FOR X 2. DO THE ADDITION STEP FIRST
Advertisements

You have been given a mission and a code. Use the code to complete the mission and you will save the world from obliteration…
Advanced Piloting Cruise Plot.
1 Copyright © 2013 Elsevier Inc. All rights reserved. Chapter 3 CPUs.
1 Copyright © 2010, Elsevier Inc. All rights Reserved Fig 2.1 Chapter 2.
By D. Fisher Geometric Transformations. Reflection, Rotation, or Translation 1.
Business Transaction Management Software for Application Coordination 1 Business Processes and Coordination.
Human Performance Improvement Process
Jeopardy Q 1 Q 6 Q 11 Q 16 Q 21 Q 2 Q 7 Q 12 Q 17 Q 22 Q 3 Q 8 Q 13
Jeopardy Q 1 Q 6 Q 11 Q 16 Q 21 Q 2 Q 7 Q 12 Q 17 Q 22 Q 3 Q 8 Q 13
Title Subtitle.
0 - 0.
ALGEBRAIC EXPRESSIONS
DIVIDING INTEGERS 1. IF THE SIGNS ARE THE SAME THE ANSWER IS POSITIVE 2. IF THE SIGNS ARE DIFFERENT THE ANSWER IS NEGATIVE.
MULTIPLYING MONOMIALS TIMES POLYNOMIALS (DISTRIBUTIVE PROPERTY)
ADDING INTEGERS 1. POS. + POS. = POS. 2. NEG. + NEG. = NEG. 3. POS. + NEG. OR NEG. + POS. SUBTRACT TAKE SIGN OF BIGGER ABSOLUTE VALUE.
SUBTRACTING INTEGERS 1. CHANGE THE SUBTRACTION SIGN TO ADDITION
MULT. INTEGERS 1. IF THE SIGNS ARE THE SAME THE ANSWER IS POSITIVE 2. IF THE SIGNS ARE DIFFERENT THE ANSWER IS NEGATIVE.
FACTORING Think Distributive property backwards Work down, Show all steps ax + ay = a(x + y)
Addition Facts
Year 6 mental test 5 second questions
1 A gender and helping study with a different outcome.
SADC Course in Statistics Review of ideas of general regression models (Session 15)
ZMQS ZMQS
Machine Learning & Bioinformatics Tien-Hao Chang (Darby Chang) Machine Learning & Bioinformatics 1.
© 2011 TIBCO Software Inc. All Rights Reserved. Confidential and Proprietary. Towards a Model-Based Characterization of Data and Services Integration Paul.
ABC Technology Project
1 Computer Programming Boolean Logic Copyright © Texas Education Agency, 2013.
Chapter 3: The Procurement Process
O X Click on Number next to person for a question.
© S Haughton more than 3?
4 Square Questions Are you ready? B A
Quadratic Inequalities
1 Evaluations in information retrieval. 2 Evaluations in information retrieval: summary The following gives an overview of approaches that are applied.
Twenty Questions Subject: Twenty Questions
1 4 Square Questions B A D C Look carefully to the diagram Now I will ask you 4 questions about this square. Are you ready?
Squares and Square Root WALK. Solve each problem REVIEW:
Problem Solving and Algorithm Design
© 2012 National Heart Foundation of Australia. Slide 2.
The x- and y-Intercepts
Lets play bingo!!. Calculate: MEAN Calculate: MEDIAN
©Ian Sommerville 2004Software Engineering, 7th edition. Chapter 4 Slide 1 Software processes 2.
Past Tense Probe. Past Tense Probe Past Tense Probe – Practice 1.
Sets Sets © 2005 Richard A. Medeiros next Patterns.
This, that, these, those Number your paper from 1-10.
Science as a Process Chapter 1 Section 2.
Chapter 5 Test Review Sections 5-1 through 5-4.
GG Consulting, LLC I-SUITE. Source: TEA SHARS Frequently asked questions 2.
1 First EMRAS II Technical Meeting IAEA Headquarters, Vienna, 19–23 January 2009.
Event 4: Mental Math 7th/8th grade Math Meet ‘11.
The Rubric Reality Cobb Keys Classroom Teacher Evaluation System.
Addition 1’s to 20.
25 seconds left…...
Slippery Slope
Test B, 100 Subtraction Facts
11 = This is the fact family. You say: 8+3=11 and 3+8=11
Week 1.
We will resume in: 25 Minutes.
Numerical Analysis 1 EE, NCKU Tien-Hao Chang (Darby Chang)
A SMALL TRUTH TO MAKE LIFE 100%
1 Unit 1 Kinematics Chapter 1 Day
O X Click on Number next to person for a question.
Learning Outcomes Participants will be able to analyze assessments
Classification Classification Examples
DATA TRACKING AND EVALUATION 1. Goal of the STEP program: To increase the number of STEM graduates within the five-year period of the grant. You have.
1 MACHINE LEARNING TECHNIQUES IN IMAGE PROCESSING By Kaan Tariman M.S. in Computer Science CSCI 8810 Course Project.
What is machine learning? 1. A very trivial machine learning tool K-Nearest-Neighbors (KNN) The predicted class of the query sample depends on the voting.
Nearest-Neighbor Classifiers
MACHINE LEARNING TECHNIQUES IN IMAGE PROCESSING
MACHINE LEARNING TECHNIQUES IN IMAGE PROCESSING
Presentation transcript:

Machine Learning & Bioinformatics Tien-Hao Chang (Darby Chang) Machine Learning & Bioinformatics 1

What is machine learning? 2 Machine Learning & Bioinformatics

K-Nearest-Neighbors (KNN) A very trivial machine learning tool The predicted class of the query sample depends on the voting among its k nearest neighbors 3 O X X O O X O ? X X O O X X O Machine Learning & Bioinformatics

4 When k = 3 O X X O O X O O X X O Machine Learning & Bioinformatics

5 When k = 5 O X X O O X O X X X O O X X O Machine Learning & Bioinformatics

Although KNN is very trivial, it can Example: in vitro fertilization –Given: embryos described by 60 features –Problem: selection of embryos that will survive –Data: historical records of embryos and outcome Given a set of known instances Predict outcome for newly coming instances So, KNN learnt something related to the definition of embryo goodness 6 Machine Learning & Bioinformatics

7 Can machines really learn? Notice that here we call KNN a machine Definitions of learning from dictionary: –To get knowledge of by study, experience, or being taught –To become aware by information or from observation –To commit to memory –To be informed of, ascertain; to receive instruction Operational definition: –Things learn when they change their behavior in a way that makes them perform better in the future Difficult to measure Trivial for computers Does a slipper learn? Machine Learning & Bioinformatics

8 Shortly speaking, machine learning is Machine E.g. KNN Training data A set of known instances Testing data A query instance Outcome Class of the query instance Knowledge/ Information Machine Learning & Bioinformatics

9 Furthermore, learning is Machine E.g. KNN Training data A set of known instances Testing data A query instance Outcome Class of the query instance Knowledge/ Information When training data increases It delivers better (e.g. higher accuracy) outcome Machine Learning & Bioinformatics

10 Usually, we dont invent the wheel Machine E.g. KNN Training data A set of known instances Testing data A query instance Outcome Class of the query instance Knowledge/ Information Convert data (e.g. embryos) to vector is not trivial Machine Learning & Bioinformatics

Feature 11 Machine Learning & Bioinformatics

Data representation Format (for LIBSVM) –1 1: :0.25 3: : –though this is for LIBSVM, a famous implementation of support vector machine (SVM), all other machine learning tools share the same concept Label is also the answer or class of an sample Feature is also called features or feature vector 12 LabelFeature Machine Learning & Bioinformatics

Label and feature Label is defined by the experts –usually biologists in bioinformatics Data representation is also called feature encoding or feature extraction –you may not know which feature is important –you may not have the key feature –you need to know the domain knowledge to design good features –if you dont design new algorithms (most researchers dont), the only thing you can do is to design new features 13 Machine Learning & Bioinformatics

Evaluation 14 Machine Learning & Bioinformatics

Evaluation issues Recall that in KNN algorithm, the predicted classes of query samples require comparing the query samples to a collection of reference samples whose classes are known This collection is called training set and these reference samples are called training samples When evaluating, we need to know the classes of the query samples so that we can compare the answers and the predictions These query samples with known classes are called testing set or testing samples 15 Machine Learning & Bioinformatics

The answer of query is needless theatrically 16 Actually, it should not exist or we dont need to predict. However, we always need to evaluate our methods/features, and thus we always have the answer of the testing set in this course. Machine Learning & Bioinformatics

Sample arrangement How to split n samples, whose classes are known, into training and testing sets? Its getting worse if the algorithm has parameters –is KNN a method? –are 3NN and 5NN different methods? 17 Machine Learning & Bioinformatics

Todays exercise Machine Learning & Bioinformatics 18

Single-class prediction Machine Learning & Bioinformatics 19 Design your own select, feature, buy and sell programs. Upload and test them in our simulation system. Finally, commit your best version before 23:59 10/1 (Mon).simulation system