[vermeer]slides/IR/DataMining.ppt © Gasteiger et al. C3C3 Data Mining in Chemistry Markus C. Hemmer Computer-Chemie-Centrum, Universität Erlangen-Nürnberg.

Slides:



Advertisements
Similar presentations
SOMA2 – Drug Design Environment. Drug design environment – SOMA2 The SOMA2 project Tekes (National Technology Agency of Finland) DRUG2000 program.
Advertisements

Analysis of High-Throughput Screening Data C371 Fall 2004.
Particle swarm optimization for parameter determination and feature selection of support vector machines Shih-Wei Lin, Kuo-Ching Ying, Shih-Chieh Chen,
Prof. Carolina Ruiz Computer Science Department Bioinformatics and Computational Biology Program WPI WELCOME TO BCB4003/CS4803 BCB503/CS583 BIOLOGICAL.
© Oellien, Ihlenfeldt, Engel, Ertl C3C3 MMWS 2002 Interactive Datamining of Large-Scale Screening Datasets Klaus Engel, Thomas Ertl Visualization and Interactive.
Collaborative Information Management: Advanced Information Processing in Bioinformatics Joost N. Kok LIACS - Leiden Institute of Advanced Computer Science.
Cheminformatics II Apr 2010 Postgrad course on Comp Chem Noel M. O’Boyle.
Jeffery Loo NLM Associate Fellow ’03 – ’05 chemicalinformaticsforlibraries.
JYC: CSM17 BioinformaticsCSM17 Week 10: Summary, Conclusions, The Future.....? Bioinformatics is –the study of living systems –with respect to representation,
Building an Intelligent Web: Theory and Practice Pawan Lingras Saint Mary’s University Rajendra Akerkar American University of Armenia and SIBER, India.
© Franz Kurfess Project Topics 1 Topics for Master’s Projects and Theses -- Winter Franz J. Kurfess Computer Science Department Cal Poly.
Cloud Computing for Chemical Property Prediction Paul Watson School of Computing Science Newcastle University, UK Microsoft Cloud.
8 th Iranian workshop of Chemometrics 7-9 February 2009 Progress of Chemometrics in Iran Mehdi Jalali-Heravi February 2009 In the Name of God.
Basic concepts of Data Mining, Clustering and Genetic Algorithms Tsai-Yang Jea Department of Computer Science and Engineering SUNY at Buffalo.
1 BrainWave Biosolutions Limited Accelerating Life Science Research through Technology.
Social Network Analysis: Tasks and Tools Steven Loscalzo and Lei Yu Department of Computer Science Watson School of Engineering and Applied Science State.
Iris Recognition By Mohammed, Ashfaq Ahmed. Introduction Iris Recognition is a Biometric Technology which deals with identification based on the human.
Deep Learning for Big Data P. Baldi University of California, Irvine Department of Computer Science Institute for Genomics and Bioinformatics Center for.
Data Mining – Intro.
Computer Science Universiteit Maastricht Institute for Knowledge and Agent Technology Data mining and the knowledge discovery process Summer Course 2005.
DEMO CSE fall. What is GeneMANIA GeneMANIA finds other genes that are related to a set of input genes, using a very large set of functional.
Data Mining By Andrie Suherman. Agenda Introduction Major Elements Steps/ Processes Tools used for data mining Advantages and Disadvantages.
Data Mining: Concepts & Techniques. Motivation: Necessity is the Mother of Invention Data explosion problem –Automated data collection tools and mature.
Automatic assignment of NMR spectral data from protein sequences using NeuroBayes Slavomira Stefkova, Michal Kreps and Rudolf A Roemer Department of Physics,
OLAM and Data Mining: Concepts and Techniques. Introduction Data explosion problem: –Automated data collection tools and mature database technology lead.
『 Data Mining 』 By Jung, hae-sun. 1.Introduction 2.Definition 3.Data Mining Applications 4.Data Mining Tasks 5. Overview of the System 6. Data Mining.
Data Mining Techniques
1 Data mining of toxic chemicals & database-based toxicity prediction Jiansuo Wang & Luhua Lai Institute of Physical Chemistry, Peking University P. R.
Cédric Notredame (30/08/2015) Chemoinformatics And Bioinformatics Cédric Notredame Molecular Biology Bioinformatics Chemoinformatics Chemistry.
Kansas State University Department of Computing and Information Sciences CIS 830: Advanced Topics in Artificial Intelligence From Data Mining To Knowledge.
Understanding Data Analytics and Data Mining Introduction.
© Gasteiger et al. C3C3 /slides/VS-C/Dias/terena_eng_d.ppt Networking Education of Chemistry.
Progress in utilization of Mycobacterium tuberculosis cytochrome P450 monooxygenases as novel drug targets Central University of Technology Bloemfontein,
Chapter 1 Introduction to Data Mining
Introduction to Data Mining Group Members: Karim C. El-Khazen Pascal Suria Lin Gui Philsou Lee Xiaoting Niu.
NMR Chemical Shift Prediction nmrshiftDB2. HOSE Codes Hierarchically Ordered Spherical Description of Environment Description of chemical environment.
Introduction to Web Mining Spring What is data mining? Data mining is extraction of useful patterns from data sources, e.g., databases, texts, web,
U N I V E R S I T Y O F S O U T H F L O R I D A Database-centric Data Analysis of Molecular Simulations Yicheng Tu *, Sagar Pandit §, Ivan Dyedov *, and.
Use of Machine Learning in Chemoinformatics Irene Kouskoumvekaki Associate Professor December 12th, 2012 Biological Sequence Analysis course.
Data Mining Knowledge on rough set theory SUSHIL KUMAR SAHU.
1 A Heuristic Approach Towards Solving the Software Clustering Problem ICSM03 Brian S. Mitchell /
Data Mining – Intro. Course Overview Spatial Databases Temporal and Spatio-Temporal Databases Multimedia Databases Data Mining.
Data mining. Data mining, at its core, is the transformation of large amounts of data into meaningful patterns and rules.
Structural Browsing Indices, Spotfire and Drug Discovery Mark Johnson 1 and Yong-jin Xu 2 1 Pannanugget Consulting; 2 Pharmacia, Inc. Spotfire Users Conference.
QSAR Study of HIV Protease Inhibitors Using Neural Network and Genetic Algorithm Akmal Aulia, 1 Sunil Kumar, 2 Rajni Garg, * 3 A. Srinivas Reddy, 4 1 Computational.
AN INTELLIGENT AGENT is a software entity that senses its environment and then carries out some operations on behalf of a user, with a certain degree of.
Data Mining BY JEMINI ISLAM. Data Mining Outline: What is data mining? Why use data mining? How does data mining work The process of data mining Tools.
Bioinformatics MEDC601 Lecture by Brad Windle Ph# Office: Massey Cancer Center, Goodwin Labs Room 319 Web site for lecture:
C3C3 Introduction into CI; SS 03/1st lecture © Gasteiger et al. Chemoinformatics in Europe: Achievements and Perspectives Johann Gasteiger Computer-Chemie-Centrum.
EMBL-EBI Chemistry & the PDB MSDchem Primary Developer: Dimitris Dimitropoulos.
Role of Theory Model and understand catalytic processes at the electronic/atomistic level. This involves proposing atomic structures, suggesting reaction.
CZ5225 Methods in Computational Biology Lecture 2-3: Protein Families and Family Prediction Methods Prof. Chen Yu Zong Tel:
Learning disjunctions in Geronimo’s regression trees Felix Sanchez Garcia supervised by Prof. Dana Pe’er.
Use of Machine Learning in Chemoinformatics
Books Visualizing Data by Ben Fry Data Structures and Problem Solving Using C++, 2 nd edition by Mark Allen Weiss MATLAB for Engineers, 3 rd edition by.
Computational Approach for Combinatorial Library Design Journal club-1 Sushil Kumar Singh IBAB, Bangalore.
Artificial Neural Networks and Their Applications Prof. Les Sztandera.
A Computational Study of RNA Structure and Dynamics Rhiannon Jacobs and Harish Vashisth Department of Chemical Engineering, University of New Hampshire,
Introduction.  Instructor: Cengiz Örencik   Course materials:  myweb.sabanciuniv.edu/cengizo/courses.
It is a web-based tool for the retrieval of chemistry information and data from published literature. The content covers more than 200 years of chemistry.
RESEARCH APPROACH.
Current Status at BioChemtek
Data Warehousing and Data Mining
The halogens / Qualitative tests Module Enthalpy changes
Ligand Docking to MHC Class I Molecules
Standards Development for Metabolomics
BIOINFORMATICS Summary
Data Warehousing Data Mining Privacy
Lecture 4. Niching and Speciation (1)
Presentation transcript:

[vermeer]slides/IR/DataMining.ppt © Gasteiger et al. C3C3 Data Mining in Chemistry Markus C. Hemmer Computer-Chemie-Centrum, Universität Erlangen-Nürnberg D Erlangen, Germany

[vermeer]slides/IR/DataMining.ppt © Gasteiger et al. C3C3 What is Data Mining ? Data Mining is an analytical process designed to explore large amounts of data in search for consistent patterns and systematic relationships. „...a non-trivial process of identifying valid, novel, potentially useful, and ultimately understandable patterns in data“ (Srikant, Agrawal, 1996)

[vermeer]slides/IR/DataMining.ppt © Gasteiger et al. C3C Yearly number of documents in Chemical Abstracts Amount of Information in Chemistry Millions Number of registered substances Thousands

[vermeer]slides/IR/DataMining.ppt © Gasteiger et al. C3C3 The Chemical Language C 10 H 13 Cl 2 O 3 PS Dichlophenthion Phosphorothioic acid O-2,4-dichlorophenyl O,O-diethyl ester ClC(C(=C1)OP(=S)(OCC)OCC)=CC(=C1)Cl

[vermeer]slides/IR/DataMining.ppt © Gasteiger et al. C3C3 Search for Cancerostatic Drugs similar substratesprotein/substrate complex

[vermeer]slides/IR/DataMining.ppt © Gasteiger et al. C3C3 chemical reactivity biological activity Representation of Properties

[vermeer]slides/IR/DataMining.ppt © Gasteiger et al. C3C3 Non-linear Projection onto a Torus

[vermeer]slides/IR/DataMining.ppt © Gasteiger et al. C3C3 Comparison of Steroid Surfaces 3,20-Allopregnandion3,20-Pregnandion

[vermeer]slides/IR/DataMining.ppt © Gasteiger et al. C3C3 Descriptor of a Polycyclic System

[vermeer]slides/IR/DataMining.ppt © Gasteiger et al. C3C3 Visualization of Multidimensional Data

[vermeer]slides/IR/DataMining.ppt © Gasteiger et al. C3C3 Research and Projects at the CCC TeleSpec Evaluation of Reactions Drug Design Synthesis Design Structure/Spectrum Correlation Dissertation online SOL Biochemical Pathways ChemVis QSAR/QSPR VS-C

[vermeer]slides/IR/DataMining.ppt © Gasteiger et al. C3C3 Software Development at the CCC CORINA 3D structure generator PETRA atomic property calculator ARC descriptor generator KMAP Kohonen network generator CACTVS chemical information system EROS reaction prediction expert system CORA reaction classification system WODCA synthesis design expert system

[vermeer]slides/IR/DataMining.ppt © Gasteiger et al. C3C3 Data Mining Dienst – Chemie (Data Mining Service – Chemistry) Pattern Recognition Substructure Search Similarity Search Diversity Search Pattern Analysis Property Search

[vermeer]slides/IR/DataMining.ppt © Gasteiger et al. C3C3 Information Sources Simulation Analysis Databases Calculation

[vermeer]slides/IR/DataMining.ppt © Gasteiger et al. C3C3 The Concept of Data Mining Service - Chemistry

[vermeer]slides/IR/DataMining.ppt © Gasteiger et al. C3C3 Descriptor Software

[vermeer]slides/IR/DataMining.ppt © Gasteiger et al. C3C3 Searching a Substructure substructure search

[vermeer]slides/IR/DataMining.ppt © Gasteiger et al. C3C3 Acknowledgements Chemical Information Dr. Thomas Engel Databases & Visualization Dr. Wolf-Dietrich Ihlenfeldt Frank Oellien Expert Systems Achim Herwig Genetic Algorithms Dr. Sandra Handschuh Neural Networks Dr. Andreas Teckentrup Dr. Lothar Terfloth Spectroscopy Dr. Paul Selzer Thomas Kostka Structures & Properties Thomas Kleinöder Christof Schwab Structure Coding Dr. Joao Aires de Sousa Dr. Valentin Steinhauer Synthesis Planning Dr. Matthias Pförtner Markus Sitzmann Team Coordination Prof. Dr. Johann Gasteiger

[vermeer]slides/IR/DataMining.ppt © Gasteiger et al. C3C3 Contact Information WWW: