Session I Database & Data Mining Speaker: Mehmet M. Dalkilic

Slides:



Advertisements
Similar presentations
INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING NLP-AI IIIT-Hyderabad CIIL, Mysore ICON DECEMBER, 2003.
Advertisements

Jeffery Loo NLM Associate Fellow ’03 – ’05 chemicalinformaticsforlibraries.
Database Systems Chapter 1 The Worlds of Database Systems.
LÊ QU Ố C HUY ID: QLU OUTLINE  What is data mining ?  Major issues in data mining 2.
Kansas State University Department of Computing and Information Sciences CIS 830: Advanced Topics in Artificial Intelligence From Data Mining To Knowledge.
ระบบฐานข้อมูลขั้นสูง (Advanced Database Systems) Lecturer AJ. Suwan Janin Phone:
Bioinformatics for Stem Cell Lecture 1 Debashis Sahoo, PhD.
Chapter 9 Database Management Discovering Computers Fundamental.
Copyright © 2006, The McGraw-Hill Companies, Inc. All rights reserved. Decision Support Systems Chapter 10.
How to Give a Talk Amy Bruckman Georgia Institute of Technology.
Introduction to Access. Access 2010 is a database creation and management program.
Evolution of BINF Bioinformatics Certificate Foothill College R. Cormia, L. English, K. Erickson.
Grade Book Database Presentation Jeanne Winstead CINS 137.
LOGO/ICON Keval Mehta School of Informatics Master of Science in Bioinformatics Andrews Dalkilic Team Dr. Mehmet Dalkilic, Dr. Justen Andrews, Dr. John.
Fall CSE330/CIS550: Introduction to Database Management Systems Prof. Susan Davidson Office: 278 Moore Office hours: TTh
PROTEIN INTERACTION NETWORK – INFERENCE TOOL DIVYA RAO CANDIDATE FOR MASTER OF SCIENCE IN BIOINFORMATICS ADVISOR: Dr. FILIPPO MENCZER CAPSTONE PROJECT.
1 DAFFODIL Effective Support for Using Digital Libraries Norbert Fuhr University of Duisburg-Essen, Germany.
© 2005 Bioinformatics Indiana University April, ::: Troy Campbell Advisors: Mehmet Dalkilic, Informatics Claudia Johnson, Paleontology Erika Elswick,
Maximal D-segments Maximal-scoring No subsegment has higher score No segment properly containing the segment satisfies the above No supersegment has higher.
There is an inherent meaning in everything. “Signs for people who can see.”
Prepared by: Saatchi, Seyed Mohsen1 Arab Open University - AOU T171 You, Your Computer and the Net: Learning and living in the information age Module 2.
Leading By Convening: A Blueprint for Authentic Engagement September 13, 2014.
By Prof. Dr. Salahuddin Khan
Joins and Relationships The Theory, the Pearls and the Perils
Data Mining – Intro.
Chapter 7. Propositional and Predicate Logic
AP CSP: Cleaning Data & Creating Summary Tables
Introduction to Bioinformatics and Functional Genomics
Databases.
PhD fellowship in bioinformatics
Soliciting Reader Contributions to Software Tutorials
STEM? WHAT IS Put this slide on the screen and ask students:
Done Done Course Overview What is AI? What are the Major Challenges?
Data Resource Management
THE SCIENTIFIC METHOD.
Datamining : Refers to extracting or mining knowledge from large amounts of data Applications : Market Analysis Fraud Detection Customer Retention Production.
Relational Algebra Chapter 4, Part A
Bioinformatics (or making apples as big as a house)
Define your own road in life.
به نام خدا Big Data and a New Look at Communication Networks Babak Khalaj Sharif University of Technology Department of Electrical Engineering.
Query Execution Presented by Khadke, Suvarna CS 257
Relational Algebra 461 The slides for this text are organized into chapters. This lecture covers relational algebra, from Chapter 4. The relational calculus.
Impact of Formal Methods in Biology and Medicine
Impact of Formal Methods in Biology and Medicine
Indigene Project: Systems Biology Project
Database Applications (15-415) Relational Calculus Lecture 6, September 6, 2016 Mohammad Hammoud.
A Few Things to Think About
Rick, the SkyServer is a website we built to make it easy for professional and armature astronomers to access the terabytes of data gathered by the Sloan.
Data Warehousing and Data Mining
Relational Algebra Chapter 4, Sections 4.1 – 4.2
Bioinformatics & Social Conundrums
CSCD 506 Research Methods for Computer Science
Database Management Systems CSE594
BIOINFORMATICS Summary
Dr. Fisher, 20 April 2013 For Astronomy class at ISM
STEM? WHAT IS What is STEM
Chapter 7. Propositional and Predicate Logic
CompSci 1: Principles of Computer Science Lecture 1 Course Overview
Automating Profitable Growth™
Chen Li Information and Computer Science
Query Optimization.
Academic & More Group 4 谢知晖 王逸雄 郭嘉宋 程若愚.
Data Mining.
Spatial Data Infrastructure GRS-21306
Applying principles of computer science in a biological context
Dr.s Khem Ghusinga and Alan Jones
Unit 20 New Frontiers Lesson1 Futurology.
Relational Calculus Chapter 4, Part B
Spreadsheet As a Relational Database Engine
Presentation transcript:

Session I Database & Data Mining Speaker: Mehmet M. Dalkilic Content of Talk & Notes: http://www.informatics.indiana.edu/dalkilic/retreat07 Bioinformatics Retreat 02.03.07 © M.M. Dalkilic

Bioinformatics Retreat @ Bradford Woods © Indiana University 2007 “Systems biology is the science of discovering, modeling, understanding and ultimately engineering at the molecular level the dynamic relationships between the biological molecules that define living organisms” Leroy Hood Institute for Systems Biology http://www.systemsbiology.org/ Saturday, December 29, 2018 Bioinformatics Retreat @ Bradford Woods © Indiana University 2007

Bioinformatics Retreat @ Bradford Woods © Indiana University 2007 Outline (I) A cursory overview of Database and Data Mining (II) Examples (a few) (III) Sundry important research questions (IV) Summary & Prelude to Discussion Saturday, December 29, 2018 Bioinformatics Retreat @ Bradford Woods © Indiana University 2007

Bioinformatics Retreat @ Bradford Woods © Indiana University 2007 Perspectives "There's millions and millions of unsolved problems. Biology is so digital, and incredibly complicated, but incredibly useful. …. It is hard for me to say confidently that, after fifty more years of explosive growth of computer science, there will still be a lot of fascinating unsolved problems at peoples' fingertips, that it won't be pretty much working on refinements of well-explored things. Maybe all of the simple stuff and the really great stuff has been discovered. It may not be true, but I can't predict an unending growth. I can't be as confident about computer science as I can about biology. Biology easily has 500 years of exciting problems to work on, it's at that level." - It is hard for me to say confidently that, after fifty more years of explosive growth of computer science, there will still be a lot of fascinating unsolved problems at peoples' fingertips, that it won't be pretty much working on refinements of well-explored things. I can't be as confident about computer science as I can about biology. Biology easily has 500 years of exciting problems to work on, it's at that level. Donald Knuth Saturday, December 29, 2018 Bioinformatics Retreat @ Bradford Woods © Indiana University 2007

Bioinformatics Retreat @ Bradford Woods © Indiana University 2007 Perspectives Computer science is no more about computers than astronomy is about telescopes. Edsger Dijkstra Saturday, December 29, 2018 Bioinformatics Retreat @ Bradford Woods © Indiana University 2007

Bioinformatics Retreat @ Bradford Woods © Indiana University 2007 Database Saturday, December 29, 2018 Bioinformatics Retreat @ Bradford Woods © Indiana University 2007

Bioinformatics Retreat @ Bradford Woods © Indiana University 2007 Database Saturday, December 29, 2018 Bioinformatics Retreat @ Bradford Woods © Indiana University 2007

Bioinformatics Retreat @ Bradford Woods © Indiana University 2007 Database Saturday, December 29, 2018 Bioinformatics Retreat @ Bradford Woods © Indiana University 2007

Bioinformatics Retreat @ Bradford Woods © Indiana University 2007 Database Saturday, December 29, 2018 Bioinformatics Retreat @ Bradford Woods © Indiana University 2007

Bioinformatics Retreat @ Bradford Woods © Indiana University 2007 Database SQL → Algebra → Optimized Algebra → Process → Table Saturday, December 29, 2018 Bioinformatics Retreat @ Bradford Woods © Indiana University 2007

Bioinformatics Retreat @ Bradford Woods © Indiana University 2007 Database SQL is essentially a form of First Order Predicate Calculus differs from general field of Mathematical logic * We don’t focus on use of functions (omit them in SQL) * We focus on finitary models Saturday, December 29, 2018 Bioinformatics Retreat @ Bradford Woods © Indiana University 2007

Database Why can’t I ask any question I’d like in a relational database? Dirk Van Gucht, DSI Saturday, December 29, 2018 Bioinformatics Retreat @ Bradford Woods © Indiana University 2007

Bioinformatics Retreat @ Bradford Woods © Indiana University 2007 Database Saturday, December 29, 2018 Bioinformatics Retreat @ Bradford Woods © Indiana University 2007

Database Why can’t I ask any question I’d like in a relational database? Saturday, December 29, 2018 Bioinformatics Retreat @ Bradford Woods © Indiana University 2007

Database Why can’t I ask any question I’d like in a relational database? Saturday, December 29, 2018 Bioinformatics Retreat @ Bradford Woods © Indiana University 2007

Database Why can’t I ask any question I’d like in a relational database? Saturday, December 29, 2018 Bioinformatics Retreat @ Bradford Woods © Indiana University 2007

Database Why can’t I ask any question I’d like in a relational database? Dirk Van Gucht, DSI Saturday, December 29, 2018 Bioinformatics Retreat @ Bradford Woods © Indiana University 2007

Bioinformatics Retreat @ Bradford Woods © Indiana University 2007 Datamining Dirk Van Gucht, DSI Saturday, December 29, 2018 Bioinformatics Retreat @ Bradford Woods © Indiana University 2007

Bioinformatics Retreat @ Bradford Woods © Indiana University 2007 Datamining Saturday, December 29, 2018 Bioinformatics Retreat @ Bradford Woods © Indiana University 2007

Bioinformatics Retreat @ Bradford Woods © Indiana University 2007 Biological processes can be modeled as complex networks of interconnected components. … Saturday, December 29, 2018 Bioinformatics Retreat @ Bradford Woods © Indiana University 2007

Bioinformatics Retreat @ Bradford Woods © Indiana University 2007 Data Integration Problem how is data meaningfully integrated Saturday, December 29, 2018 Bioinformatics Retreat @ Bradford Woods © Indiana University 2007

How are the data related? Messy issues of database & datamining How are the data related? What kind of model? What kind of inferencing? Is the data validated? Is there sufficient reason to use the network? Saturday, December 29, 2018 Bioinformatics Retreat @ Bradford Woods © Indiana University 2007

Relational Database currently ignores domains. Significant Challenges Relational Database currently ignores domains. The relational model is poor at modeling biological data and their uncertain nature…no probabilistic means in querying. No advance in querying. Incorporate other successes in dealing with large repositories. Databases have no casual user in mind—they are designed by experts. Datamining has focused almost exclusively on relational modeled data. Ignored actionable results. Viewing and Search are still in their infancy. Saturday, December 29, 2018 Bioinformatics Retreat @ Bradford Woods © Indiana University 2007

John Colbourne Scott Beason Thanks to organizers email me if you’d like to discuss anything Acknowledgements (no special order) Justen Andrews Haixu Tang Sun Kim Jim Costello Rupali Patwardhan Junguk Hur Sumit Middha Brian Ead, Esfandiar Haghverdi John Colbourne Scott Beason Pedja Radivojac Saturday, December 29, 2018 Bioinformatics Retreat @ Bradford Woods © Indiana University 2007