10-12 Sep 2008OpenTox kick-off meeting Basel, Switzerland Ideaconsult Ltd. Dr. Nina Jeliazkova.

Slides:



Advertisements
Similar presentations
Scientific & technical presentation JChem Cartridge for Oracle
Advertisements

Integrating ChemAxon technology into your End User Applications Java solutions for cheminformatics Ver. Mar., 2005.
JKlustor clustering chemical libraries presented by … maintained by Miklós Vargyas Last update: 25 March 2010.
SOMA2 – Drug Design Environment. Drug design environment – SOMA2 The SOMA2 project Tekes (National Technology Agency of Finland) DRUG2000 program.
Integration of MBSE and Virtual Engineering for Detailed Design
Companies can suffer numerous problems due to poor management of resources and careless decisions. In real-world decision- making, many organizations lack.
Development of an Institutional Knowledge-base at FDA’s Center for Food Safety and Applied Nutrition Kirk B. Arvidson 1, Annette McCarthy 1, Chihae Yang.
Community Grids Lab CICC Activities Geoffrey Fox, Marlon Pierce Indiana University.
Objective Understand web-based digital media production methods, software, and hardware. Course Weight : 10%
PROBABILISTIC ASSESSMENT OF THE QSAR APPLICATION DOMAIN Nina Jeliazkova 1, Joanna Jaworska 2 (1) IPP, Bulgarian Academy of Sciences, Sofia, Bulgaria (2)
Chemical Category Formation: Toxicology and REACH Dr Steven Enoch Liverpool John Moores University 14 th May 2009.
James Martin CpE 691, Spring 2010 February 11, 2010.
Building Enterprise Applications Using Visual Studio ®.NET Enterprise Architect.
University of Leeds Department of Chemistry The New MCM Website Stephen Pascoe, Louise Whitehouse and Andrew Rickard.
Information Retrieval in Practice
SciFinder ® : Part of the process™ 2006 Edition. SciFinder ® : Part of the process™ 2006 Edition SciFinder ® 2006 provides new, powerful capabilities.
1 BrainWave Biosolutions Limited Accelerating Life Science Research through Technology.
Supplement 02CASE Tools1 Supplement 02 - Case Tools And Franchise Colleges By MANSHA NAWAZ.
Building Knowledge-Driven DSS and Mining Data
Overview of Search Engines
DEiXTo.
Knowledge Science & Engineering Institute, Beijing Normal University, Analyzing Transcripts of Online Asynchronous.
AMBIT Software for Data Management and (Q)SAR Applications Nina Jeliazkova Bulgarian Academy of Sciences Institute for Parallel Processing Sofia Bulgaria.
MDC Open Information Model West Virginia University CS486 Presentation Feb 18, 2000 Lijian Liu (OIM:
Data Mining Techniques
Module 3: Business Information Systems Chapter 11: Knowledge Management.
Kansas State University Department of Computing and Information Sciences CIS 830: Advanced Topics in Artificial Intelligence From Data Mining To Knowledge.
Understanding Data Analytics and Data Mining Introduction.
Ihr Logo Data Explorer - A data profiling tool. Your Logo Agenda  Introduction  Existing System  Limitations of Existing System  Proposed Solution.
Katanosh Morovat.   This concept is a formal approach for identifying the rules that encapsulate the structure, constraint, and control of the operation.
Concept demo System dashboard. Overview Dashboard use case General implementation ideas Use of MULE integration platform Collection Aggregation/Factorization.
Toxmatch - a tool to assess chemical similarity
Appendix: The WEKA Data Mining Software
Funded by: European Commission – 6th Framework Project Reference: IST WP 2: Learning Web-service Domain Ontologies Miha Grčar Jožef Stefan.
AMBIT Chemoinformatics Software for Data Management Joanna Jaworska Nina Jeliazkova P&G Brussels, Ideaconsult Ltd., Belgium Bulgaria.
1 Research Groups : KEEL: A Software Tool to Assess Evolutionary Algorithms for Data Mining Problems SCI 2 SMetrology and Models Intelligent.
New functionalities in Toolbox Multi-document application 2. QA of chemical structures 3. Working on 2D or 2.5D mode INPUT.
Administrative Software Chapter 7 Teaching and Learning with Technology.
Mike Comber Consulting TIMES-SS Assessment of skin sensitisation hazard Presented on behalf of the TIMES-SS consortia.
1 Selected Current and Suggested Ideas on Uses of HPV Challenge Data Nhan Nguyen US EPA Characterizing Chemicals in Commerce: Using Data on High Production.
AMBIT Chemoinformatics Software for Data Management Joanna Jaworska Nina Jeliazkova P&G Brussels, Ideaconsult Ltd., Belgium Bulgaria.
Development Process and Testing Tools for Content Standards OASIS Symposium: The Meaning of Interoperability May 9, 2006 Simon Frechette, NIST.
Use of Machine Learning in Chemoinformatics Irene Kouskoumvekaki Associate Professor December 12th, 2012 Biological Sequence Analysis course.
Chapter 3 DECISION SUPPORT SYSTEMS CONCEPTS, METHODOLOGIES, AND TECHNOLOGIES: AN OVERVIEW Study sub-sections: , 3.12(p )
United Nations Economic Commission for Europe Statistical Division The Importance of Databases in the Dissemination Process Steven Vale, UNECE.
SIMO SIMulation and Optimization ”New generation forest planning system” Antti Mäkinen & Jussi Rasinmäki Dept. of Forest Resource Management.
CEFIC LRI Tools – Ambit 1.21 Nina Jeliazkova Ideaconsult Ltd. Sofia, Bulgaria.
D. Heynderickx DH Consultancy, Leuven, Belgium 22 April 2010EuroPlanet, London, UK.
CASE (Computer-Aided Software Engineering) Tools Software that is used to support software process activities. Provides software process support by:- –
Recording Actor Provenance in Scientific Workflows Ian Wootten, Shrija Rajbhandari, Omer Rana Cardiff University, UK.
Visualization Four groups Design pattern for information visualization
Search Engine using Web Mining COMS E Web Enhanced Information Mgmt Prof. Gail Kaiser Presented By: Rupal Shah (UNI: rrs2146)
An Introduction Student Name: Riaz Ahmad Program: MSIT( ) Subject: Data warehouse & Data Mining.
Barcelona April, 2008 Overview of the QSAR Application Toolbox Gilman Veith International QSAR Foundation Duluth, Minnesota.
Unclassified//For Official Use Only 1 RAPID: Representation and Analysis of Probabilistic Intelligence Data Carnegie Mellon University PI : Prof. Jaime.
Use of Machine Learning in Chemoinformatics
Linking LRI AMBIT Chemoinformatic system with the IUCLID Substance database to Support Read-across of Substance endpoint data and Category formation N.
VIEWS b.ppt-1 Managing Intelligent Decision Support Networks in Biosurveillance PHIN 2008, Session G1, August 27, 2008 Mohammad Hashemian, MS, Zaruhi.
Indiana University School of Indiana University ECCR Summary Infrastructure: Cheminformatics web service infrastructure made available as a community resource.
Information Retrieval in Practice
The CompTox Chemistry Dashboard: an informational data hub at the
General Concepts in QSAR for Using the QSAR Application Toolbox
Forming Chemical Categories..
Automatic Video Shot Detection from MPEG Bit Stream
Waikato Environment for Knowledge Analysis
Tutorial 8 Objectives Continue presenting methods to import data into Access, export data from Access, link applications with data stored in Access, and.
Objective Understand web-based digital media production methods, software, and hardware. Course Weight : 10%
2. An overview of SDMX (What is SDMX? Part I)
Jonathan Griffin, Managing Director, IFIS Publishing &
Presentation transcript:

10-12 Sep 2008OpenTox kick-off meeting Basel, Switzerland Ideaconsult Ltd. Dr. Nina Jeliazkova

10-12 Sep 2008OpenTox kick-off meeting Basel, Switzerland Content  Introduction  Main tasks in OpenTox  Overview of the software developed by Ideaconsult ltd. Toxtree 1.51 Toxmatch 1.05 – A chemical similarity evaluation tool Ambit Discovery Ambit Database Tools 1.30 QMRF repository Ambit XT new! Under development

10-12 Sep 2008OpenTox kick-off meeting Basel, Switzerland Introduction  Ideaconsult Ltd is a SME registered with Sofia City Court at the end of 2004, providing consultancy, technical services and software development in the areas of chemoinformatics, QSAR and data mining; design and feasibility study of computer networks; support to governmental organizations, small and medium enterprises that require assistance in writing projects in IST and the Environment.  Develops and maintains several well-known and widely used open source software products, relevant to OpenTox, in particular Toxtree (used in the application of the threshold of toxicological concern) Toxmatch (for encoding and applying chemical similarity indices) Ambit (a QSAR decision support system, including generic database management, structure conversions and searching, as well as applicability domain assessment).  These tools have been carefully designed and developed in close co-operation both with academia, regulatory bodies and industry. Some of the code base could be effectively re-used, adapted and further enhanced to meet the objectives of the project.

10-12 Sep 2008OpenTox kick-off meeting Basel, Switzerland Main Tasks in OpenTox:  Work package 2 (Framework implementation) leader Building blocks for a (Q)SAR decision support system (WP2) Database for QSAR models (compliant with ECB QMRF) (WP2)  Work package 3 AMBIT database with more than chemical compounds (WP3)  Work package 4 Chemoinformatics and data mining algorithms (WP4) Algorithms for the determination of chemical similarities and applicability domains (WP4)

10-12 Sep 2008OpenTox kick-off meeting Basel, Switzerland Toxtree 1.51  Estimates toxic hazard by applying a decision tree approach.  Full-featured and flexible user- friendly open source software New decision trees with arbitrary rules can be built with the help of graphical user interface or by developing new plug-ins in Java code GPL license  Platform independent  Input: datasets from various compatible file types SMILES built-in 2D structure diagram editor.  Output SDF, MOL, CSV, MS Excel, CML, TXT, PDF, HTML  Batch mode  5 classification schemes (plug- ins) for various endpoints assessment available

10-12 Sep 2008OpenTox kick-off meeting Basel, Switzerland Toxtree 1.51 plug-ins:  Cramer rules  Cramer rules (Cramer G. M., R. A. Ford, R. L. Hall, Estimation of Toxic Hazard - A Decision Tree Approach, J. Cosmet. Toxicol., Vol.16, pp , Pergamon Press, 1978); toxicity mode of actions  Verhaar scheme for predicting toxicity mode of actions (Verhaar HJM, van Leeuwen CJ and Hermens JLM (1992) Classifying environmental pollutants. 1.Structure-activity relationships for prediction of aquatic toxicity. Chemosphere 25, ); skin irritation and corrosion potential  A decision tree for estimating skin irritation and corrosion potential, based on rules published in “The Skin Irritation Corrosion Rules Estimation Tool (SICRET), John D. Walker, Ingrid Gerner, Etje Hulzebos, Kerstin Schlegel, QSAR Comb. Sci. 2005, 24, pp ”; eye irritation and corrosion potential  A decision tree for estimating eye irritation and corrosion potential, based on rules published in “Assessment of the eye irritating properties of chemicals by applying alternatives to the Draize rabbit eye test: the use of QSARs and in vitro tests for the classification of eye irritation, Ingrid Gerner, Manfred Liebsch & Horst Spielmann, Alternatives to Laboratory Animals, 2005, 33, pp ”; carcinogenicity and mutagenicity  A decision tree for estimating carcinogenicity and mutagenicity, based on the rules published in the accompanying document: “The Benigni / Bossa rulebase for mutagenicity and carcinogenicity – a module of Toxtree”, by R. Benigni, C. Bossa, N. Jeliazkova, T. Netzeva, and A. Worth.

10-12 Sep 2008OpenTox kick-off meeting Basel, Switzerland Toxtree internals Series of questions are applied on query compound and also on reaction products Q1.Normal constituent of the bodyNo Q2.Contains functional groups associated with enhanced toxicityNo Q3.Contains elements other than C,H,O,N, divalent SNo Q5.Simply branched aliphatic hydrocarbon or a common carbohydrate No Q6.Benzene derivative with certain substituentsNo Q7.HeterocyclicNo Q16.Common terpeneNo Q17.Readily hydrolysed to a common terpeneNo Q19.Open chainNo Q23.AromaticYes Q27.Rings with substituentsYes Q28.More than one aromatic ringYes Q29.Readily hydrolysedYes Q19.Open chainYes Q20.Aliphatic with some functional groups Yes Q21.3 or more different functional groupsNo Q18.One of the list (see explanation)No Low (Class I) Q30.Aromatic Ring with complex substituentsNo Q18.One of the list (see explanation)NoLow (Class I) Query compound Hydrolysis products

10-12 Sep 2008OpenTox kick-off meeting Basel, Switzerland More on Toxtree  Structure alerts defined by SMARTS patterns  Descriptors calculation LogP, Molar Refractivity, Partial MR, Sterimol, EHOMO, ELUMO, etc.  Explanations and examples for each rule outcome  Improved visualization and reporting  Can be embedded in other standalone and web applications  Recent publication: G. Patlewicz, N. Jeliazkova, R.J. Safford, A.P. Worth and B. Aleksiev, An evaluation of the implementation of the Cramer classification scheme in the Toxtree software, SAR and QSAR in Environmental Research, Vol. 19, Nos. 3–4, April–June 2008, 1–30

10-12 Sep 2008OpenTox kick-off meeting Basel, Switzerland Toxmatch 1.05  Provides means to compare a chemical or set of chemicals to a toxicity dataset through the use of similarity indices Intended use is one to many or many to many quantitative read-across To help in the systematic formation of groups and read-across  Includes datasets for four toxicity endpoints to facilitate endpoint specific read-across aquatic toxicity bioconcentration factor skin sensitisation skin irritation  Developed under the terms of an Joint Research Centre (JRC) contract  Flexible open-source software application  Platform independent

10-12 Sep 2008OpenTox kick-off meeting Basel, Switzerland Toxmatch methods  Structure representations Descriptors Fingerprints Atom environments  Similarity indices (pair wise) Euclidean distance Cosine similarity Hodgkin-Richards Index Tanimoto distance Tanimoto distance on fingerprints Hellinger distance on atom environments Maximum Common Structure similarity  Similarity to a set Similarity between a query structure and a representative point of the set (e.g. the dataset centre or a consensus fingerprint) Average similarity between a query structure and the nearest k structures  Descriptor generation EHOMO, ELUMO, Log P, MW can be calculated  Verhaar and BfR skin irritation schemes as available in Toxtree are included G. Patlewicz, N. Jeliazkova, A. Gallegos Saliner, A. P. Worth, Toxmatch-a new software tool to aid in the development and evaluation of chemically similar groups,SAR and QSAR in Environmental Research, 19:3, 397 — 412(2008)

10-12 Sep 2008OpenTox kick-off meeting Basel, Switzerland Toxmatch methods  Toxmatch is a promising tool to help in the systematic formation of groups and read- across  Possible to do quantitative read-across assessments using Toxmatch – either to predict the membership of a predefined group or to estimate the endpoint result using the weighted average of nearest neighbours as defined by a specific similarity index such as Euclidean distance or Tanimoto index  However toxicity data is required in order for analogues to be selected  There are of course limitations in encoding similarity into a single index – but that is less about the index per se and more about how we characterise the parameters for a given toxicity. An objective numerical measure can be helpful in grouping chemicals

10-12 Sep 2008OpenTox kick-off meeting Basel, Switzerland AMBIT  Developed within the framework of CEFIC LRI project “Building blocks for a future (Q)SAR decision support system: databases, applicability domain, similarity assessment and structure conversions”.  Consists of a relational database and functional modules allowing a variety of evaluations flexible structure, similarity and other queries.  Applications: Ambit Database tools 1.30 (on the right) Ambit Discovery (applicability domain assessment) Ambit Online

10-12 Sep 2008OpenTox kick-off meeting Basel, Switzerland AMBIT Discovery Software for applicability domain assessment  Methods: Ranges Euclidean distance City-block Distance Probability Density Fingerprints  Consensus fingerprint + Tanimoto distance  Consensus fingerprint + Missing fragments Atom environments  Consensus atom environments + Hellinger distance  kNN + Tanimoto distance  Ranking  More options Threshold Preprocessing (e.g. PCA) Center  Results from multiple methods are automatically combined. Joanna Jaworska, Nina Nikolova-Jeliazkova, How can structural similarity analysis help in category formation, SAR and QSAR in Environmental Research, vol 18, 3-4 (2007)

10-12 Sep 2008OpenTox kick-off meeting Basel, Switzerland AMBIT Extensions  ECB commissioned an extension to develop a reference site for retrieving robust summaries of (Q)SAR models in QSAR Model Reporting Format (QMRF)  AMBIT XT – under development

10-12 Sep 2008OpenTox kick-off meeting Basel, Switzerland AMBIT XT  Built upon AMBIT software  Objectives: Develop an open source user friendly software, providing a set of functionalities to facilitate registration of the chemicals for REACH. Improve the user friendliness by introduction of workflow capabilities Develop a set of defined workflows for analogue identification and PBT assessment.  Close collaboration with industry  LGPL license

10-12 Sep 2008OpenTox kick-off meeting Basel, Switzerland AMBIT XT emphasize on  Data provenance history of the updates of the chemicals information.  Data quality Easy way for comparison between different sources  Flexible storage for measured data for different endpoints Easy way to extract all relevant information for a chemical; many formats available for toxicological data  Recording of user actions  Easy entry of complex structural alerts to facilitate grouping  Molecular descriptors  Improved data entrance and visualization  Embedded workflow engine  Modular application (flexible plug- in support)

10-12 Sep 2008OpenTox kick-off meeting Basel, Switzerland Thank you!