Alex Ropelewski PSC-NRBSC Bienvenido Vélez UPR Mayaguez Using Molecular Biology to Teach Computer Science High-level Programming with Python Finding Patterns.

Slides:



Advertisements
Similar presentations
Technology Across the Curriculum Programs Keys to Success Beth Secrist, TAC Coordinator Copyright Beth Secrist, This work is the intellectual property.
Advertisements

 The following material is the result of a curriculum development effort to provide a set of courses to support bioinformatics efforts involving students.
Presentation to Educational Policy Committee Department of Biology Revised March, 2013 Biology Department: Position Requests.
Copyright © 1997 Carnegie Mellon University Introduction to the Personal Software Process - Lecture 1 1 Introduction to the Personal Software Process Lecture.
National Standards In Pathology Education Barbara Knollmann-Ritschel, M.D.
MARC: Developing Bioinformatics Programs July 2009 Alex Ropelewski PSC-NRBSC Bienvenido Vélez UPR Mayaguez Essential Computing for Bioinformatics 1 Lecture.
Educational Programs in Bioinformatics at UNO Hesham H. Ali Department of Computer Science College of Info Science and Technology University of Nebraska.
MARC: Developing Bioinformatics Programs July 2009 Alex Ropelewski PSC-NRBSC Bienvenido Vélez UPR Mayaguez Reference: How to Think Like a Computer Scientist:
NICLS: Development of Biomedical Computing and Information Technology Infrastructure Presented by Simon Sherman August 15, 2005.
1 Essential Computing for Bioinformatics Bienvenido Vélez UPR Mayaguez Lecture 4 High-level Programming with Python Part I: Controlling the flow of your.
Information Technology Career Ladder Clayton College & State University Larry Booth, IT Department Head Copyright Larry Booth,
1 The Department of Computational Biology University of Pittsburgh School of Medicine Hagai Meirovitch.
Introduction to Light and Electron Microscopy NEU Gina Sosinsky, Ph. D. Mark Ellisman, Ph. D.
These materials were developed with funding from the US National Institutes of Health grant #2T36 GM to the Pittsburgh Supercomputing Center 1 
Establishment of a New BS Biotechnology Program with emphasis in Agrobiotechnology and the creation of the Biotechnology Learning and Research Center Award.
MARC: Developing Bioinformatics Programs July 2009 Alex Ropelewski PSC-NRBSC Bienvenido Vélez UPR Mayaguez Reference: How to Think Like a Computer Scientist:
 The following material is the result of a curriculum development effort to provide a set of courses to support bioinformatics efforts involving students.
Building the Integrated Learning Center Having the right people at the table Chris Johnson, Senior Consultant for Learning Technologies University of Arizona.
What is the NIH RePORTER? And How Will it Help My PI?
These materials were developed with funding from the US National Institutes of Health grant #2T36 GM to the Pittsburgh Supercomputing Center 1 
MARC: Developing Bioinformatics Programs July 2009 Alex Ropelewski PSC-NRBSC Bienvenido Vélez UPR Mayaguez Reference: How to Think Like a Computer Scientist:
The Biotechnology Program at JCCC Johnson County Community College Overland Park, Kansas.
PANEL: Growing a Research Development Infrastructure from the Ground Up.
OSU -22 yrs Barb Taylor Biopic BS CU PhD UCSD Res Asst CU & UCHSC Post-doc UW.
Artificial Intelligence Lecture No. 29 Dr. Asad Ali Safi ​ Assistant Professor, Department of Computer Science, COMSATS Institute of Information Technology.
The NIH Roadmap and the Human Microbiome Project Francis S. Collins, M.D., Ph.D. National Human Genome Research Institute April 22, 2007.
 The following material is the result of a curriculum development effort to provide a set of courses to support bioinformatics efforts involving students.
Computing and Communications and Biology Molecular Communication; Biological Communications Technology Workshop Arlington, VA 20 February 2008 Jeannette.
MARC: Developing Bioinformatics Programs July 2009 Alex Ropelewski PSC-NRBSC Bienvenido Vélez UPR Mayaguez Reference: How to Think Like a Computer Scientist:
 The following material is the result of a curriculum development effort to provide a set of courses to support bioinformatics efforts involving students.
MARC: Developing Bioinformatics Programs July 2009 Alex Ropelewski PSC-NRBSC Bienvenido Vélez UPR Mayaguez 1 Essential Computing for Bioinformatics Lecture.
 The following material is the result of a curriculum development effort to provide a set of courses to support bioinformatics efforts involving students.
COMPUTATIONAL BIOLOGIST DR. MARTIN TOMPA Place of Employment: University of Washington Type of Work: Develops computer programs and algorithms to identify.
NOVA Evaluation Report Presented by: Dr. Dennis Sunal.
Biology - 3 Biotechnology – DNK Marine Sciences - 16 Physics - 3 Geology - 5 Mathematics - 1 Chemistry - DNK Dean Office - DNK Civil Eng. - DNK Chemical.
© 2010 Pittsburgh Supercomputing Center Pittsburgh Supercomputing Center RP Update July 1, 2010 Bob Stock Associate Director
Applicant’s Profile Study for Improving Undergraduate Enrollment at the University of Puerto Rico at Mayagüez Dr. David González Barreto Dr. Antonio A.
An Introduction to NCBI & BLAST National Center for Biotechnology Information Richard Johnston Pasadena City College.
Implementing Practices That Lead to Institutional Change: Faculty Development Kaci Thompson, University of Maryland Joe Watkins, University of Arizona.
1 Essential Computing for Bioinformatics Bienvenido Vélez UPR Mayaguez Lecture 3 High-level Programming with Python Part III: Files and Directories Reference:
BING 6004: Intro to Computational BioEngineering Spring 2016 Lecture 1: Using Python Expressions and Variables Bienvenido Vélez UPR Mayaguez Reference:
Bienvenido Vélez UPR Mayaguez Reference: How to Think Like a Computer Scientist: Learning with Python 1 Introduction to Python programming for Bioinformatics.
Physics Gravitation Science and Mathematics Education Research Group Supported by UBC Teaching and Learning Enhancement Fund Department of Curriculum.
Alex Ropelewski Pittsburgh Supercomputing Center National Resource for Biomedical Supercomputing Bienvenido Vélez
MARC: Developing Bioinformatics Programs Alex Ropelewski PSC-NRBSC Bienvenido Vélez UPR Mayaguez Reference: How to Think Like a Computer Scientist: Learning.
Bienvenido Vélez UPR Mayaguez Essential Computing for Bioinformatics 1 BING 6004: Intro to Computational BioEngineering Spring 2016 Lecture 4: Basic File.
MARC: Developing Bioinformatics Programs June 2012 Alex Ropelewski PSC-NRBSC Bienvenido Vélez UPR Mayaguez Reference: How to Think Like a Computer Scientist:
MARC: Developing Bioinformatics Programs Alex Ropelewski PSC-NRBSC Bienvenido Vélez UPR Mayaguez Essential BioPython Manipulating Sequences with Seq 1.
High-level Programming with Python Expressions and Variables Alex Ropelewski PSC-NRBSC Bienvenido Vélez UPR Mayaguez Reference: How to Think Like a Computer.
MARC: Developing Bioinformatics Programs Alex Ropelewski PSC-NRBSC Bienvenido Vélez UPR Mayaguez Reference: How to Think Like a Computer Scientist: Learning.
MARC: Developing Bioinformatics Programs Alex Ropelewski PSC-NRBSC Bienvenido Vélez UPR Mayaguez Reference: How to Think Like a Computer Scientist: Learning.
Bienvenido Vélez UPR Mayaguez Using Molecular Biology to Teach Computer Science 1 These materials were developed with funding from the US National Institutes.
Health Management Information Systems What is Health Informatics? Lecture a This material Comp6_Unit1 was developed by Duke University, funded by the Department.
MARC: Developing Bioinformatics Programs July 2009 Alex Ropelewski PSC-NRBSC Bienvenido Vélez UPR Mayaguez Essential Computing for Bioinformatics 1 Lecture.
1 A Short Introduction to Analyzing Biological Data Using Relational Databases Part II: Creating a Relational Database to Model Biological Data Alex Ropelewski.
MARC: Developing Bioinformatics Programs June 2012 Alex Ropelewski PSC-NRBSC Bienvenido Vélez UPR Mayaguez Reference: How to Think Like a Computer Scientist:
Introduction to Python
Using Molecular Biology to Teach Computer Science
A Short Introduction to Analyzing Biological Data Using Relational Databases Part III: Writing Simple (Single Table) Queries to Access Relational Data.
Biography Assistant Vice Chancellor for Commercial Translation in the Health Sciences Visiting Associate Professor, Department of Biomedical Informatics.
Using Molecular Biology to Teach Computer Science
Essential Computing for Bioinformatics
LEARN WHY COMPUTERS ARE REVOLUTIONIZING BIOLOGY!
Introduction to Bioinformatics
Introduction to Python
Units – Powers to Prefix (4)
Units – Powers to Prefix (1)
Units – Prefix to Powers (1)
Units – Powers to Prefix (2)
Units – Powers to Prefix (3)
Presentation transcript:

Alex Ropelewski PSC-NRBSC Bienvenido Vélez UPR Mayaguez Using Molecular Biology to Teach Computer Science High-level Programming with Python Finding Patterns in DNA Sequences 1 These materials were developed with funding from the US National Institutes of Health grant #2T36 GM to the Pittsburgh Supercomputing Center

The following material is the result of a curriculum development effort to provide a set of courses to support bioinformatics efforts involving students from the biological sciences, computer science, and mathematics departments. They have been developed as a part of the NIH funded project “Assisting Bioinformatics Efforts at Minority Schools” (2T36 GM008789). The people involved with the curriculum development effort include: Dr. Hugh B. Nicholas, Dr. Troy Wymore, Mr. Alexander Ropelewski and Dr. David Deerfield II, National Resource for Biomedical Supercomputing, Pittsburgh Supercomputing Center, Carnegie Mellon University. Dr. Ricardo González Méndez, University of Puerto Rico Medical Sciences Campus. Dr. Alade Tokuta, North Carolina Central University. Dr. Jaime Seguel and Dr. Bienvenido Vélez, University of Puerto Rico at Mayagüez. Dr. Satish Bhalla, Johnson C. Smith University. Unless otherwise specified, all the information contained within is Copyrighted © by Carnegie Mellon University. Permission is granted for use, modify, and reproduce these materials for teaching purposes. Most recent versions of these presentations can be found at Essential Computing for Bioinformatics

def findDNAPattern(dna, pattern,startPosition=0, endPosition=None): 'Finds the index of the first ocurrence of DNA pattern within DNA sequence’ if (endPosition == None): endPosition = len(dna) dna = dna.lower() # Force sequence and pattern to lower case pattern = pattern.lower() for i in xrange(startPosition, endPosition): # Attempt to match patter starting at position i if (matchDNAPattern(dna[i:],pattern)): return i return -1 Lecture 2 Homework: One Approach Write your own find function: Top-Down Design: From BIG functions to small helper functions 3 These materials were developed with funding from the US National Institutes of Health grant #2T36 GM to the Pittsburgh Supercomputing Center

def matchDNAPattern(sequence, pattern): 'Determines if DNA pattern is a prefix of DNA sequence' i = 0 while ((i < len(pattern)) and (i < len(sequence))): if (not matchDNANucleotides(sequence[i], pattern[i])): return False i = i + 1 return (i == len(pattern)) Lecture 2 Homework: One Approach Write your own find function: Top-Down Design: From BIG functions to small helper functions 4 These materials were developed with funding from the US National Institutes of Health grant #2T36 GM to the Pittsburgh Supercomputing Center

Lecture 2 Homework: One Approach def matchDNANucleotides(base1, base2): 'Returns True is nucleotide bases are equal or one of them is unknown' return (base1 == 'x' or base2 == 'x' or (isDNANucleotide(base1) and (base1 == base2))) Write your own find function: 5 These materials were developed with funding from the US National Institutes of Health grant #2T36 GM to the Pittsburgh Supercomputing Center

Complementing Sequences: Utilities DNANucleotides='acgt' DNAComplements='tgca’ def isDNANucleotide(nucleotide): 'Returns True when n is a DNA nucleotide’ return (type(nucleotide) == type("") and len(nucleotide)==1 and nucleotide.lower() in DNANucleotides) 6 These materials were developed with funding from the US National Institutes of Health grant #2T36 GM to the Pittsburgh Supercomputing Center

7