CSIS-400: Bioinformatics Dr. Eric Breimer. Good News & Bad News Remind me to tell you the good news.

Slides:



Advertisements
Similar presentations
IT253: Computer Organization Lecture 6: Assembly Language and MIPS: Programming Tonga Institute of Higher Education.
Advertisements

The Scientific Method. Science relies on facts, not opinions. is not static or fixed. Science is a process that changes as observations & measurements.
Economics 1 Principles of Microeconomics Instructor: Ted Bergstrom.
CS 197 Computers in Society Fall, Welcome, Freshmen!
Exit A key to the successful Science Fair Project A key to the successful Science Fair Project Goal The Researcher’s Strategy Results Method Hypothesis.
My Policies and Some Advice for Doing Well in this Course.
CSIS-385: Algorithm Analysis ► I am Dr. Breimer ► Br-Eye-mer ► I like ► Office hours: 1:35-2:30 everyday ► Drop by randomly at your own risk.
CSC 212 – Data Structures Prof. Matthew Hertz WTC 207D /
This material in not in your text (except as exercises) Sequence Comparisons –Problems in molecular biology involve finding the minimum number of edit.
My Policies and Some Advice for Doing Well in this Course.
CSIS-116: Survey of Information Technology Dr. Eric Breimer.
Binary Arithmetic Math For Computers.
Models and Designs Investigation 1.  Label your new section Models and Designs  Draw pictures of a “model” and “design”
A single, one slide introduction Appropriate for first day.
What do you think it means… if I told you that learning about idioms is a piece of cake? But, how did you know what a piece of cake means? You’re right!
Welcome to Biology 102! Please pick up a syllabus (if you don’t have one yet) and a clicker at the front desk. You will need to rent a clicker from the.
CSCI 200 Introduction To Programming with Visual Basic Bob Bradley.
Please open Daily Quiz 34. A scientific calculator may be used on this quiz. You can keep your yellow formula sheets out when you take the quiz. Remember.
1 CSE 1341 Southern Methodist University Lyle School of Engineering Computer Science and Engineering Dept. Fall 2013 Don Evans.
BIT 115: Introduction To Programming1 Sit in front of a computer Log in –Username: 230class –password: –domain: student Bring up the course web.
COMP 111 Programming Languages 1 First Day. Course COMP111 Dr. Abdul-Hameed Assawadi Office: Room AS15 – No. 2 Tel: Ext. ??
Student Forum March5, pm - Collaborate Students will share their thoughts on topics including: --experiences with online courses --ways instructors.
CSc 2310 Principles of Programming (Java) Dr. Xiaolin Hu.
Prof. Matthew Hertz SH 1029F /
Prof. Matthew Hertz WTC 207D /
Computer Science 10: Introduction to Computer Science Dr. Natalie Linnell with credit to Cay Horstmann and Marty Stepp.
1 My Experiences as Faculty Member and Researcher Dr. Kalim Qureshi.
Which Language is Better?
1 Introduction to Physics 250 Dr. Phil Womble Applied Physics Institute Office Hours: MWF 8:00-9:00 TCCW 232.
Computer Programming TCP1224 Chapter 3 Completing the Problem-Solving Process and Getting Started with C++
The Guessing Game The entire business of Statistics is dedicated to the purpose of trying to guess the value of some population parameter. In what follows.
Imin Research Techniques in Immunology Techniques used to study immunology & Techniques using products of immune system - e.g. Abs Instructors -Dr.
Welcome to Physics 1D03.
1 8/29/05CS150 Introduction to Computer Science 1 Professor: Shereen Khoja
1 Introduction to Linguistics Teacher: Simon Smith ( 史尚明 ) – “Dr Smith”, “Simon” or “ 老師 ”: OK – “Smith” or “Teacher”: not OK This semester’s course: –
CSC425 - Introduction To Computer Programming 1. 2 Generation Of Programming Languages A set of rules that telling a computer what to do. There are over.
Course Introduction CSE250. Course Overview This course will be difficult Work hard and start early You are adults and I will treat you as such – I won’t.
WOW: WORKING ON THE WORK Tricks, Tips, and Best Practices Doug Curtright and Shane Miller.
1 Design and Integration: Part 2. 2 Plus Delta Feedback Reading and lecture repeat Ambiguous questions on quizzes Attendance quizzes Boring white lecture.
Welcome to PHY2049 Physics for Engineers and Scientists II Dr. Bindell.
Introduction to Python Lesson 1 First Program. Learning Outcomes In this lesson the student will: 1.Learn some important facts about PC’s 2.Learn how.
COP 3275 – Iteration and loops Instructor: Diego Rivera-Gutierrez.
CSCE 1030 Computer Science 1 First Day. Course Dr. Ryan Garlick Office: Research Park F201 B –Inside the Computer Science department.
Course Overview  What is AI?  What are the Major Challenges?  What are the Main Techniques?  Where are we failing, and why?  Step back and look at.
1 Running Experiments for Your Term Projects Dana S. Nau CMSC 722, AI Planning University of Maryland Lecture slides for Automated Planning: Theory and.
CMSC 2021 CMSC 202 Computer Science II for Majors Spring 2002 Sections Ms. Susan Mitchell.
CMSC 2021 CMSC 202 Computer Science II for Majors Spring 2001 Sections Ms. Susan Mitchell.
COMP1927 Course Introduction 16x1
COP3502: Introduction to Computer Science Yashas Shankar Program Translation.
Welcome to Astronomy 113 “ It would seem that you have no useful skill or talent whatsoever, he said.
Warmup 11/18/15 Research says a glass of red wine each day is good for your heart. Who do you think paid for this research? Why? Objective Tonight’s Homework.
广州市教育局教学研究室英语科 Module 8 Unit 1 A land of diversity Period 3 Grammar 广州执信中学 郑卫红.
BIT 115: Introduction To Programming. 2 Today Midterm feedback Getting User Input.
Welcome to Introduction to Psychology! Let’s share a bit about where we are all from…
Week 1 in American Government Tips for this week: Assignments due by Sunday : 1.00, 1.01, 1.02, 1.03, 1.04 Read the Course Information tab– FAQS Add the.
CSE 374 Programming Concepts & Tools
Study Smarter, Not Harder
Done Done Course Overview What is AI? What are the Major Challenges?
Title of your science project
Week 1 Gates Introduction to Information Technology cosc 010 Week 1 Gates
Psychological Science I: Statistics
Introduction to Physical Science & Scientific Method
CS 139 – Programming Fundamentals
Welcome to Physics 1D03.
Welcome to CS220/MATH 320 – Applied Discrete Mathematics Fall 2018
Experiments and Variables Experiment 3.1 Using a Series of Experiments
CMSC201 Computer Science I for Majors Lecture 24 – Sorting
Introduction to Computer Science I.
Classroom Procedures & Disclosure
Professor: Shereen Khoja
Presentation transcript:

CSIS-400: Bioinformatics Dr. Eric Breimer

Good News & Bad News Remind me to tell you the good news

Quizzes At least 6 pop-quizzes – The lowest one gets dropped 20 minutes long – Not difficult if you come to class I may announce them ahead of time – if attendance is not a problem

Projects 3 Projects – Kind of hard Open due dates – Submit early and I’ll give you feedback – Then, submit again to get a better grade – Final submission a week before classes end

Exams First exam (early October) – Fail it and you should probably drop the course – Things are only going to get harder Second exam is a warm up for the final – (i.e. late November) Third exam is cumulative – during finals week.

Lecture My PowerPoint slides are sometimes bare – Not enough detail to study from Textbook is a reference manual – Look things up, not a tutorial Most of the material comes from lecture. – Miss lecture a lot and you will be lost

Grading 25% Cumulative final exam 15% Higher exam grade 10% Lower exam grade 20% Quiz average – 5 quizzes (equally weights) 30% Project average – 3 projects (different weights)

Bioinformatics Computer Science/Math meets Biology/Genetics Data-driven Science – Data collection – wet lab – 5% – Analyzing the data – computer lab – 95% Human Genome Project – best example of bioinformatics Machine Learning & Data Mining – used heavily in bioinformatics

Science (remind me to tell you a story) What is science? Make an observation Develop a hypothesis – explains the observation Run some experiments – to test your hypothesis Collect enough data to support your hypothesis – and it becomes a Theory

Science (again) 1. Observation 2. Hypothesis 3. Experiments 4. Supportive Data 5. Theory

Bad Science Collect some data Look at the data exhaustively and try to find some property/hypothesis Develop tools and run experiments to help you discover some property/hypothesis Then develop a hypothesis that ‘makes sense’ (scientifically speaking)

Bad Science (example) For years, material science and engineering followed the ‘bad science’ model. The search for better goop 1. Mix materials to make new types of goop 2. Test all the new types of goop 3. Pick the best goop 4. Then, try to explain scientifically: Why is this goop so good?

Bad Science (again) 1. Collect Data 2. Experiment 3. Design Tools 4. Analyze Data 5. Hypothesis/Theory

Some History Researcher were shunned for conducting bad science – Shunned by academia – Not shunned by industry Then came the Human Genome Project Industry (Celera) kicked Academia’s (NIH) butt. Q:What was their secret weapon? A: Bad Science Please don’t shun me!

Analogy Problem: You are in a cage with 500 lbs. hungry tiger Only options: 1. Reason with the tiger 2. Shoot the tiger Nobody wants to hurt a tiger, but… Some problems can’t be reasoned with…

Analogy The Human Genome data is like a 500 lbs. hungry tiger. A high-throughput computer is like a shotgun. Pure scientists will die reasoning with the problem Meanwhile, Industry will use computers and ‘bad science’ to perform amazing genetic experiments.

The Genome Project (overly simplified version) Biologists/Chemists figured out a way of reading the genetic material in a cell (sequencing) Unfortunately, they couldn’t read it from beginning to end They could only read it in small chunks And, the process was prone to error.

The Genome Project (overly simplified version) Over time, Biologists/Chemists sequenced a lot of DNA. Lots of different organisms Lots of different segments Before any ‘real’ hypotheses could be made they had to 1. Assemble the data 2. Correct the errors 3. Put the segments together

The Genome Project (overly simplified version) Thus, algorithms, techniques, and methodologies were pioneered just to put together the segments. Assembling entire Genomes is just the beginning The next goal is to generate a complete mapping between sequence and function

Heart formation Mapping Which pieces do What function? Glucose metabolism Liver function Brain development Vision Cancer prob. Blaa I’m not good at making stuff up

Analogy Imagine a C++ Program Imagine a really big one 4 million lines long… int main(void) { int x = 0; int y = 0; cout << "Enter two numbers: "; cin >> x >> y; int z = x + y; cout >> z; }

Analogy Imagine compiling it into assembly language 00424D90 mov eax,dword ptr [ecx] 00424D92 mov edx,7EFEFEFFh 00424D97 add edx,eax 00424D99 xor eax,0FFh 00424D9C xor eax,edx 00424D9E add ecx, DA1 test eax, h 00424DA6 je main_loop (00424d90) 00424DA8 mov eax,dword ptr [ecx-4]

Analogy Imagine what the machine code would look like Shown here in Hex format

Analogy Now picture the hex code as a binary sequence –

Analogy Now change some of the bits (2% error rate) –

Analogy Lets randomly sample segments of the sequences –

Analogy Finally, lets collect 1,000,000 random segments …

Analogy Complete Problem: – Given 1,000,000 random samples. Remember there are random errors – Try to re-construct the original 4 million line program You don’t have to really re-construct it You just have to tell me EXACTLY what it does.

Analogy Complete Here’s the catch: – I’m not even going to tell you what programming language I used – In fact, the only thing I’ll tell you is that it’s a language you’ve never seen in your entire life.