Indiana University School of Indiana University ECCR Summary Infrastructure: Cheminformatics web service infrastructure made available as a community resource.

Slides:



Advertisements
Similar presentations
Integrating ChemAxon technology into your End User Applications Java solutions for cheminformatics Ver. Mar., 2005.
Advertisements

SOMA2 – Drug Design Environment. Drug design environment – SOMA2 The SOMA2 project Tekes (National Technology Agency of Finland) DRUG2000 program.
Data Mining and the Web Susan Dumais Microsoft Research KDD97 Panel - Aug 17, 1997.
Indiana University School of David Wild – CICC Quarterly Meeting, Jan Page 1 Projects 1-4 update David Wild CICC Quarterly Meeting January 27.
Supporting Engagement in Open Access: a Publishers Perspective
Kensington Oracle Edition: Open Discovery Workflow Meets Oracle 10g Professor Yike Guo.
ASCR Data Science Centers Infrastructure Demonstration S. Canon, N. Desai, M. Ernst, K. Kleese-Van Dam, G. Shipman, B. Tierney.
University of Illinois Visualizing Text Loretta Auvil UIUC February 25, 2011.
Prof. Carolina Ruiz Computer Science Department Bioinformatics and Computational Biology Program WPI WELCOME TO BCB4003/CS4803 BCB503/CS583 BIOLOGICAL.
High Performance Dimension Reduction and Visualization for Large High-dimensional Data Analysis Jong Youl Choi, Seung-Hee Bae, Judy Qiu, and Geoffrey Fox.
1 Enriching UK PubMed Central SPIDER launch meeting, Wolfson College, Oxford Paul Davey, UK PubMed Central Engagement Manager.
Jeffery Loo NLM Associate Fellow ’03 – ’05 chemicalinformaticsforlibraries.
Semantic Web and Web Mining: Networking with Industry and Academia İsmail Hakkı Toroslu IST EVENT 2006.
Fungal Semantic Web Stephen Scott, Scott Henninger, Leen-Kiat Soh (CSE) Etsuko Moriyama, Ken Nickerson, Audrey Atkin (Biological Sciences) Steve Harris.
MS DB Proposal Scott Canaan B. Thomas Golisano College of Computing & Information Sciences.
1 BrainWave Biosolutions Limited Accelerating Life Science Research through Technology.
Semantic Web and Web Mining: Networking with Industry and Academia İsmail Hakkı Toroslu IST EVENT 2006.
Parallel Data Analysis from Multicore to Cloudy Grids Indiana University Geoffrey Fox, Xiaohong Qiu, Scott Beason, Seung-Hee.
Using the Drupal Content Management Software (CMS) as a framework for OMICS/Imaging-based collaboration.
SALSASALSASALSASALSA Digital Science Center June 25, 2010, IIT Geoffrey Fox Judy Qiu School.
1 Building National Cyberinfrastructure Alan Blatecky Office of Cyberinfrastructure EPSCoR Meeting May 21,
AMBIT Software for Data Management and (Q)SAR Applications Nina Jeliazkova Bulgarian Academy of Sciences Institute for Parallel Processing Sofia Bulgaria.
1 © Goharian & Grossman 2003 Introduction to Data Mining (CS 422) Fall 2010.
1 Challenges Facing Modeling and Simulation in HPC Environments Panel remarks ECMS Multiconference HPCS 2008 Nicosia Cyprus June Geoffrey Fox Community.
Cédric Notredame (30/08/2015) Chemoinformatics And Bioinformatics Cédric Notredame Molecular Biology Bioinformatics Chemoinformatics Chemistry.
1 The Discovery Informatics Framework Pat Rougeau President and CEO MDL Information Systems, Inc. Delivering the Integration Promise American Chemical.
1 Gary Wiggins for Geoffrey Fox April 30, 2007 Computer Science, Informatics, Physics Pervasive Technology Laboratories Indiana.
A long tradition. e-science, Data Centres, and the Virtual Observatory why is e-science important ? what is the structure of the VO ? what then must we.
Topological Summaries: Using Graphs for Chemical Searching and Mining Graphs are a flexible & unifying model Scalable similarity searches through novel.
CS525: Big Data Analytics Machine Learning on Hadoop Fall 2013 Elke A. Rundensteiner 1.
Computational Scientometrics Studying science by scientific means Dr. Katy Börner Cyberinfrastructure for Network Science Center, Director Information.
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Collaborating with iPlant.
Big Data Supporting Drug Discovery Cautionary Tales from the World of Chemistry for Translational Informatics Valery Tkachenko RSC-CSIR/OSDD meeting Pune,
IPlant cyberifrastructure to support ecological modeling Presented at the Species Distribution Modeling Group at the American Museum of Natural History.
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Collaborating with iPlant.
ChemModLab: A Web-based Cheminformatics Modeling Laboratory S. Stanley Young + ECCR and ChemSpider Teams.
The Future of the iPlant Cyberinfrastructure: Coming Attractions.
Use of Machine Learning in Chemoinformatics Irene Kouskoumvekaki Associate Professor December 12th, 2012 Biological Sequence Analysis course.
Parallel Applications And Tools For Cloud Computing Environments Azure MapReduce Large-scale PageRank with Twister Twister BLAST Thilina Gunarathne, Stephen.
Oracle Database 11g Semantics Overview Xavier Lopez, Ph.D., Dir. Of Product Mgt., Spatial & Semantic Technologies Souripriya Das, Ph.D., Consultant Member.
Indiana University School of David Wild, Geoffrey Fox, Bioinformatics retreat, February Page 1 Chemoinformatics David Wild, Bioinformatics.
LACBioNet : a Regional BioInformatics/Computational Biology Network for Latin-America and the Caribbean Special emphasis will be given to : Training and.
CNI, 3rd April 2006 Slide 1 UK National Centre for Text Mining: Activities and Plans Dr. Robert Sanderson Dept. of Computer Science University of Liverpool.
SALSA and Cheminformatics SALSA Group February
ECCR Overview/MLSCN. NIH Roadmap Series of initiatives designed to pursue major opportunities in biomedical research and gaps in current knowledge that.
1 Overview of Chemical Informatics and Cyberinfrastructure Collaboratory October Geoffrey Fox Computer Science, Informatics, Physics Pervasive.
SALSASALSASALSASALSA Digital Science Center February 12, 2010, Bloomington Geoffrey Fox Judy Qiu
Parallel Applications And Tools For Cloud Computing Environments CloudCom 2010 Indianapolis, Indiana, USA Nov 30 – Dec 3, 2010.
Semantic Web Portal: A Platform for Better Browsing and Visualizing Semantic Data Ying Ding et al. Jin Guang Zheng, Tetherless World Constellation.
PubChem: An Open Repository for Chemical Structure and Biological Activity Information Steve Bryant The NIH Biowulf Cluster: 10 Years of Scientific Supercomputing.
Use of Machine Learning in Chemoinformatics
High Risk 1. Ensure productive use of GRID computing through participation of biologists to shape the development of the GRID. 2. Develop user-friendly.
MDL Information Systems, Inc. Powering the Process of Invention Donna del Rey Director, Business Planning
UKOLN is supported by: Library futures in the new research landscape. Dr Liz Lyon, UKOLN, University of Bath, UK CURL Members Meeting October 2004, London.
Data Mining in Germany IIM Conference, Oct. 24, 2012 Gottfried Schwarz, DLR > Lecture > Author Document > Datewww.DLR.de Chart 1.
1 Database Systems, 8 th Edition Star Schema Data modeling technique –Maps multidimensional decision support data into relational database Creates.
Genomic Medicine Grid Juan Pedro Sánchez Merino Instituto de Salud Carlos III
Chemical Informatics and Cyberinfrastructure Collaboratory An NIH-Funded Exploratory Center for Cheminformatics Research Project of the IU School of Informatics.
Large Scale Semantic Data Integration and Analytics through Cloud: A Case Study in Bioinformatics Tat Thang Parallel and Distributed Computing Centre,
Page 1 Computer-aided Drug Design —Profacgen. Page 2 The most fundamental goal in the drug design process is to determine whether a given compound will.
Introduction to PubChem BioAssay
MATLAB Distributed, and Other Toolboxes
Chemical Informatics and Cyberinfrastructure Collaboratory
Modern Data Management
Gary Wiggins for Geoffrey Fox
CICC Combines Grid Computing with Chemical Informatics
Volume 19, Issue 1, Pages (January 2012)
CICC Chemical Compound Mining Workflows
Consortium: National networks in 16 European countries.
Presentation transcript:

Indiana University School of Indiana University ECCR Summary Infrastructure: Cheminformatics web service infrastructure made available as a community resource including 2D and 3D databases, predictive models, statistics, docking, name to structure conversion, 2D to 3D conversion, similarity and clustering. Education: A leading center in cheminformatics education offering Ph.D., M.S. and innovative Distance Education program aimed at government, academia and industry. Collaboration with University of Michigan. Delivering a CIC distance class at Michigan Research: Major research areas include data mining of PubChem bioassays including network models, predictive models for cytotoxicity, QSAR domain applicability, clustering huge datasets, data mining chemistry literature and documents, and Semantic Web applications 2D Map of PubChem from GTM

Indiana University School of Cheminformatics Web Service Infrastructure Extensive set of (Web) services made available as a community resource including 2D and 3D databases, predictive models, statistics, docking, name to structure conversion, 2D to 3D conversion, similarity and clustering. Teragrid and local supercomputing resources provide scalability to millions of compounds. Prototypes a comprehensive open community infrastructure framework for algorithm & application deployment: Teragrid, IU, and other Grid/Cloud Cyberinfrastructure And Supercomputing Resources Aggregation and Data Mining Algorithms PubChem and Other Public Chemical & Biological Information Sources Cheminformatics, Comp. Chemistry, and Statistical (R) Web Services Knowledge Discovery Tools Educational applications InfrastructureAlgorithms & Applications

Indiana University School of Exemplar Current Project: Pub3D Services Provides 3D structures for 17M PubChem compounds. Scalable to hundreds of millions of structures Accessible via SQL, Web page and Web service interfaces. Can be included in workflows. Will include multiple conformers Backed by novel algorithms and distributed DB architecture to enable fast shape queries. Also enables density of chemical space analysis.

Indiana University School of Current Major Research Areas Data mining of PubChem bioassays using Bayesian models. Using Cytotoxicity models to predict acute toxicity –Collaboration with Stephan Schurer, Scripps (Florida) Algorithms for domain applicability of QSAR models. Exploration of chemical spaces using density of space approach. Virtual Screening for anti-malarials –Collaboration with Jean-Claude Bradley, Drexel University –Two micromolar inhibitors of falcipain-2 identified Supporting predictive model deployment and exchange (PMML) Cheminformatics cyberinfrastructure Fast comparison of toxicity data sets using binary fingerprints.

Indiana University School of Graduate Program in Cheminformatics Unique program in Cheminformatics: we are the only center in the U.S. we are aware of offering a range of formal qualifications in cheminformatics. –As of fall 2008 will have 6 Ph.D. students, 8 M.S. students, and 4 graduate certificate students who come from government, industry & academia. –All courses are available by Distance Education including CIC courseshare with Michigan. –We have received Industry Fellowships from Lilly and Symyx. Delivering a CIC distance class at Michigan General review of cheminformatics education in Drug Discovery Today 11, 9&10 (May 2006), pp Distance Education J. Chm. Inf. Model 2006; 46;

Indiana University School of Future Directions New Initiative: linking bioinformatics and chemical informatics. Parallel deterministic Annealing/GTM clustering and dimensional scaling (MDS) of huge datasets –Exploiting multicore and other advanced chip architectures 2D Map of PubChem from GTM Extracting and mining chemical information in journal articles (SMILES index NLP, and structure/ontology searching) Use of Semantic Web for automatic workflow composition. Compound & Bioassay Network Models allow investigation of cross- assay relationships and the use of PubChem as a source of polypharmacology. Network view of SARs providing a framework to analyze structure activity landscapes.