Lesson Objectives Aims From the spec:

Slides:



Advertisements
Similar presentations
Lindsey Bleimes Charlie Garrod Adam Meyerson
Advertisements

Data Mining Methodology 1. Why have a Methodology  Don’t want to learn things that aren’t true May not represent any underlying reality ○ Spurious correlation.
Research Away Day Algorithms and Complexity Themes.
Analysis of Algorithms CS 477/677
Why teach coding?.
Copyright © 2006 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill Technology Education Copyright © 2006 by The McGraw-Hill Companies,
New Mexico Computer Science For All Introduction to Algorithms Maureen Psaila-Dombrowski.
 1  Outline  stages and topics in simulation  generation of random variates.
Big Idea 1: The Practice of Science Description A: Scientific inquiry is a multifaceted activity; the processes of science include the formulation of scientifically.
Algorithms and their Applications CS2004 ( ) Dr Stephen Swift 1.2 Introduction to Algorithms.
Data Mining Chapter 1 Introduction -- Basic Data Mining Tasks -- Related Concepts -- Data Mining Techniques.
Fuzzy Genetic Algorithm
PROCESSING, ANALYSIS & INTERPRETATION OF DATA
Problem of the Day  I am thinking of a question and propose 3 possible answers. Exactly one of the following is the solution. Which is it? A. Answer 1.
Applications of Genetic Algorithms TJHSST Computer Systems Lab By Mary Linnell.
CS 127 Introduction to Computer Science. What is a computer?  “A machine that stores and manipulates information under the control of a changeable program”
Course Overview  What is AI?  What are the Major Challenges?  What are the Main Techniques?  Where are we failing, and why?  Step back and look at.
Computer Science Background for Biologists CSC 487/687 Computing for Bioinformatics Fall 2005.
Onlinedeeneislam.blogspot.com1 Design and Analysis of Algorithms Slide # 1 Download From
What’s going on here? Can you think of a generic way to describe both of these?
Introduction to Machine Learning, its potential usage in network area,
Computer Hardware What is a CPU.
Animals Including Humans
A Level Computer Science
Excursions in Modern Mathematics Sixth Edition
Component 1.6.
OPERATING SYSTEMS CS 3502 Fall 2017
Lesson Objectives Aims Key Words
Chapter 7. Classification and Prediction
Lesson Objectives Aims
LECTURE 01: Introduction to Algorithms and Basic Linux Computing
Data Structures Lab Algorithm Animation.
Finite State Machines Dr K R Bond 2009
Lesson Objectives Aims Key Words
Computational Thinking, Problem-solving and Programming: General Principals IB Computer Science.
Algorithms - A technical perspective Are they really a black box?
Creating Functions with Parameters
Behavioral Design Patterns
Done Done Course Overview What is AI? What are the Major Challenges?
Lesson Objectives Aims – Know about…
Genomic Data Clustering on FPGAs for Compression
Exploring Computer Science
Reading: Pedro Domingos: A Few Useful Things to Know about Machine Learning source: /cacm12.pdf reading.
Unsolvable Problems December 4, 2017.
Looping and Random Numbers
AP CSP: Bytes, File Sizes, and Text Compression
Animals Including Humans
Chapter 15 QUERY EXECUTION.
Translators & Facilities of Languages
Print slides for students reference
Heuristics Definition – a heuristic is an inexact algorithm that is based on intuitive and plausible arguments which are “likely” to lead to reasonable.
Objective of This Course
Human Complexity of Software
Lesson Objectives Aims You should be able to:
Scientific Inquiry Unit 0.3.
Lesson Objectives Aims You should be able to:
Theory of Computation Turing Machines.
Lesson Objectives Aims You should know about: – Web Technologies
4. Computational Problem Solving
Introduction to Stream Computing and Reservoir Sampling
RESEARCH BASICS What is research?.
Overview of Workflows: Why Use Them?
Timing analysis research
Applying principles of computer science in a biological context
WJEC GCSE Computer Science
Lab 8: GUI testing Software Testing LTAT
U3L8 Creating Functions with Parameters
Work & Machines Thinking Questions.
Pointer analysis John Rollinson & Kaiyuan Li
Work & Machines Thinking Questions.
Presentation transcript:

Lesson Objectives Aims From the spec: Learners should apply their knowledge of: Backtracking Data Mining Heuristics Performance Modelling Pipelining Visualisation Thinking abstractly

This lesson This will take a couple of lessons It is exam question heavy Some questions you should be able to answer from what you’ve done so far – computational thinking

Backtracking An algorithmic approach to problem solving A set of rules have been defined (the algorithm) and these form a path The path is followed, if a rule fails on a piece of data, then the algorithm returns to the last known good point

Data Mining “Big data” is a big problem and massive opportunity at the same time Modern systems enable us to collect and analyse extraordinary amounts of data Usually from multiple sources Great examples are google, facebook and twitter – why?

Data mining can be used to find relationships between seemingly unrelated data Applies to data with dissimilar structures May throw up unexpected correlations

Applications Supermarkets Insurance companies Business Modelling Disease pattern modelling

Methodology The method of searching through seemingly unconnected data Methods are “pattern matching” and “anomaly detection” May include some method of correlation calculation Does not need a pre-determined “matching criteria” – it’s job is to find what matches! Only possible because of modern processing power on high speed machines (a brute force approach)

Exam Questions – Data mining

Heuristics Many problems in computer science are either unsolvable or would take too long to solve in a sensible amount of time. Sat Nav is a classic example – how do you know you’re taking the best, fastest or shortest route without analysing ALL roads?

Heuristics provide a solution by providing a “best fit” or “good enough” answer. This will solve problems much more quickly than an “exact” or brute force Can also find an approximate solution when there is no exact solution available

Heuristics are a trade off: Optimality Completeness Accuracy Precision The outcome of these trade offs are: Speed A solution that is “good enough”

Solving a problem “heuristically” Gather all relevant data, but not necessarily ALL data Data gathered should be the most likely to help in the given situation Make judgements based on rules or previous experiences

Example – Virus scanning Needs to happen quickly Not all viruses are know about! Looks for code or behaviour that is similar to known viruses (classes) Has different rules depending on virus type or family If patterns match or are similar then it infers a virus Can work for new/unidentified viruses

Exam Questions - Heuristics

9 Marks

We’ve seen this in CPU’s Pipelining We’ve seen this in CPU’s The output of one task is the input of another Enables jobs to be queued and also run in parallel to increase productivity or throughput Used in: CPU (RISC especially) Command line systems (Linux Pipe command) Task Scheduling

In real use… To pipeline Identify processes that MUST run in sequence Identify processes that may run in parallel Identify when processes must converge (“obligatory sequence”) What must happen before another process can start? E.g. Underwear must be on before trousers.

Exam Question - Pipelining

Answers

Performance Modelling Modelling or simulating real world objects or situations For the purpose of: Finding out effectiveness Tweaking and changing parameters to observe their effects Safety Experimentation Finding optimal solutions

Simply converting raw data into a visual form. Examples: Visualisation Simply converting raw data into a visual form. Examples: Graphs Heat Maps Mapping (in general) Touch maps (football?) A much more powerful method of interpreting data

Question 2 – Comp. thinking

Answers

Exam Questions - Abstraction

Review/Success Criteria You should know Lots of things.