CEFIC LRI Tools – Ambit 1.21 Nina Jeliazkova Ideaconsult Ltd. Sofia, Bulgaria.

Slides:



Advertisements
Similar presentations
Visualize Success 2011 Tony Gunter Professional Services Visual South, Inc. Advanced Browse and Excel Interface.
Advertisements

SOMA2 – Drug Design Environment. Drug design environment – SOMA2 The SOMA2 project Tekes (National Technology Agency of Finland) DRUG2000 program.
EBSCO Discovery Service
Presumed Asbestos Training Programme. Introduction Getting Started The Interface Top Tool Bar Creating a New Site Type I Survey Type II Survey Type III.
MS-Access XP Lesson 1. Introduction to MS-Access Database Management System Software (DBMS) Store data in databases Database is a collection of table.
Microsoft Office 2010 Office 2010 and Windows 7: Essential Concepts and Skills Mark Worden Instructor Use your spacebar or down arrow key to advance slides.
Chapter 5 Creating, Sorting, and Querying a Table
Integrating Access with the Web and with Other Programs.
Chapter 12: ADO.NET and ASP.NET Programming with Microsoft Visual Basic.NET, Second Edition.
SciFinder ® : Part of the process™ 2006 Edition. SciFinder ® : Part of the process™ 2006 Edition SciFinder ® 2006 provides new, powerful capabilities.
Lab 03 Windows Operating Systems (Cont.). PYP002 Preparatory Computer ScienceWindows Operating System2 Objectives Develop a good understanding of 1. The.
1 BrainWave Biosolutions Limited Accelerating Life Science Research through Technology.
Tutorial 8 Sharing, Integrating and Analyzing Data
HiVision SNMP Software.
WorkPad 4 Quick Start WorkPad 4 Quick Start  Business Optix brings the rigor and discipline of business modelling and design into.
Microsoft Office 2010 Office 2010 and Windows 7: Essential Concepts and Skills.
Chapter 2 Querying a Database
1 ADVANCED MICROSOFT WORD Lesson 15 – Creating Forms and Working with Web Documents Microsoft Office 2003: Advanced.
Advanced Tables Lesson 9. Objectives Creating a Custom Table When a table template doesn’t suit your needs, you can create a custom table in Design view.
AMBIT Software for Data Management and (Q)SAR Applications Nina Jeliazkova Bulgarian Academy of Sciences Institute for Parallel Processing Sofia Bulgaria.
ARCHIBUS Log On Instructions. Log Into ARCHIBUS Web Central Log In Screen 1.Open your Internet browser. 2.Enter the URL to view the ARCHIBUS Login Page.
A ‘How To’ on Reproducing Data Obtained During The CHEM6128: Mini Project.
Databases and LINQ Visual Basic 2010 How to Program 1.
What’s New in VRS? GUGM May 15, 2008 Presenter: Kelly P. Robinson GIL Service Georgia State University
AMBIT Chemoinformatics Software for Data Management Joanna Jaworska Nina Jeliazkova P&G Brussels, Ideaconsult Ltd., Belgium Bulgaria.
Tutorial 121 Creating a New Web Forms Page You will find that creating Web Forms is similar to creating traditional Windows applications in Visual Basic.
Microsoft Access Lesson 1 Lexington Technology Center February 11, 2003 Bob Herring On the Web at
Marcel Casado NCAR/RAP WEATHER WARNING TOOL NCAR.
Management Information Systems MS Access MS Access is an application software that facilitates us to create Database Management Systems (DBMS)
AMBIT Chemoinformatics Software for Data Management Joanna Jaworska Nina Jeliazkova P&G Brussels, Ideaconsult Ltd., Belgium Bulgaria.
Computing Fundamentals Module Lesson 3 — Changing Settings and Customizing the Desktop Computer Literacy BASICS.
Access 2013 Microsoft Access 2013 is a database application that is ideal for gathering and understanding data that’s been collected on just about anything.
Training Guide for Inzalo SOP Users. This guide has been prepared to demonstrate the use of the Inzalo Intranet based SOP applications. The scope of this.
Key Applications Module Lesson 21 — Access Essentials
XP New Perspectives on Microsoft Access 2002 Tutorial 1 1 Microsoft Access 2002 Tutorial 1 – Introduction To Microsoft Access 2002.
MS Access 2007 Management Information Systems 1. Overview 2  What is MS Access?  Access Terminology  Access Window  Database Window  Create New Database.
An Introduction to Designing and Executing Workflows with Taverna Aleksandra Pawlik materials by: Katy Wolstencroft University of Manchester.
Creating Graphical User Interfaces (GUI’s) with MATLAB By Jeffrey A. Webb OSU Gateway Coalition Member.
CIS111 PC Literacy Getting Started with Windows XP.
Lesson 01: Introduction to Database Software. At the end of this lesson, students should be able to: State the usage of database software. Start a database.
LANDESK SOFTWARE CONFIDENTIAL Tips and Tricks with Filters Jenny Lardh.
McGraw-Hill/Irwin The Interactive Computing Series © 2002 The McGraw-Hill Companies, Inc. All rights reserved. Microsoft Excel 2002 Working with Data Lists.
Introduction to KE EMu
Introduction to KE EMu Unit objectives: Introduction to Windows Use the keyboard and mouse Use the desktop Open, move and resize a.
Microsoft Office 2013 Try It! Chapter 4 Storing Data in Access.
XP New Perspectives on Microsoft Office Access 2003, Second Edition- Tutorial 8 1 Microsoft Office Access 2003 Tutorial 8 – Integrating Access with the.
Institute for the Protection and Security of the Citizen HAZAS – Hazard Assessment ECCAIRS Technical Course Provided by the Joint Research Centre - Ispra.
Microsoft Office 2008 for Mac – Illustrated Unit D: Getting Started with Safari.
Windows Vista Configuration MCTS : Internet Explorer 7.0.
Downloading and Installing GRASP-AF Workshop Ian Robson Information Analyst, North of England Cardiovascular Network.
The Next Step Hudson Fare Files 102 – Import & upload Rev. 10/14.
AdisInsight User Guide July 2015
Visual Basic 2010 How to Program
Computer Literacy BASICS
Business Objects Overview
Microsoft Access 2013 Bobby Wan.
Introduction to Web programming
Office 2010 and Windows 7: Essential Concepts and Skills
Exploring Microsoft® Access® 2016 Series Editor Mary Anne Poatsy
Optimizing Efficiency + Funding
Learning about Taxes with Intuit ProFile
Microsoft Office Access 2003
Microsoft Office Access 2003
Microsoft Excel 2007 – Level 2
Learning about Taxes with Intuit ProFile
ACE Secure Data Portal - Accounts Tab - Statements
Tutorial 7 – Integrating Access With the Web and With Other Programs
Tutorial Introduction to help.ebsco.com.
Presentation transcript:

CEFIC LRI Tools – Ambit 1.21 Nina Jeliazkova Ideaconsult Ltd. Sofia, Bulgaria

QSAR Awareness Day, JRC, Ispra,Italy2 Outline Ambit overview Demo : 1. Finding basic information about a query compound in the database 2. Complex query in the database –retrieve data meeting multiple criteria from Ambit database 3. Import data from EURAS Gold standard Bioconcentration database

QSAR Awareness Day, JRC, Ispra,Italy3 Introduction – why Ambit ? Limited free, publicly accessible, methodologically transparent software was identified as one of the roadblocks for broadening use of in-silico methods (ICCA Workshop in Setubal 2002, OECD) Realization that efficient use of existing information on chemicals requires better ways for Storage standardized formats, computer automated verification of structures, capability to store large amounts of data Taking advantage of rapidly evolving field of data mining and extraction of relevant information

QSAR Awareness Day, JRC, Ispra,Italy4 IT strategy Ambit - building blocks for Decision Support System High emphasis on interoperability for “plug and play” Flexibility modular design Transparency Open source, relying on open standards. Open source software lowers the user barrier, facilitates the dissemination activities and enables the reproducibility of models and results The cheminformatics functionality relies on the open source Java library – The Chemistry Development Kit The software is based on MySQL database ( which is the most popular open source relational database. Chemical Markup Language (CML) Chemical Markup Language (CML) acknowledged method of encoding chemical data in XML acknowledged method of encoding chemical data in XML Is being adopted by a large number of chemical organisations, from government, through commercial to academia. Is being adopted by a large number of chemical organisations, from government, through commercial to academia. The choice of CML for the internal format makes the database independent of the software which is able to access it, in contrast to some proprietary solutions. The choice of CML for the internal format makes the database independent of the software which is able to access it, in contrast to some proprietary solutions.

QSAR Awareness Day, JRC, Ispra,Italy5 IT strategy Desktop installation: MySQL database and standalone application (AmbitDatabaseTools) on the same PC Intranet installation: MySQL database on a server and standalone application (AmbitDatabaseTools) on the user PCs Internet installation – My SQL Database and web server (JSP and Servlets), Web browser as user interface

QSAR Awareness Day, JRC, Ispra,Italy6 Ambit overview The AMBIT database: stores chemical structures, their identifiers such as CAS, INChI numbers; attributes such as molecular descriptors, experimental data together with test descriptions, and literature references. The database can also store QSAR models. In addition the software can generate a suite of 2D and 3D molecular descriptors. can be searched by identifiers, attribute value or range, experimental data value or range, user defined structure and substructure, structural similarity AMBIT database contains over chemical compounds with data imported from over a dozen databases [ The number of compounds is growing all the time and one the of system’s great strengths is that any dataset can be imported for comparison and analysis. AMBITDatabaseTools 1.21 allows the user to create a local database and to import his own sets of chemical compounds. AMBIT Discovery performs chemical grouping and assesses the applicability domain of a QSAR offering a variety of methods including using different approaches to similarity assessments: statistical that rely on ‘descriptor space’; approaches based on mechanistic understanding; and approaches based on structural similarity. ECB QMRF inventory – a tailored version of Ambit database (under development). Will store information in QMRF. Large effort on standardization

QSAR Awareness Day, JRC, Ispra,Italy7 AMBIT Database Today Not restricted to these datasets! Any dataset can be imported. (e.g. DSSTox, AQUIRE, LLNA …)

QSAR Awareness Day, JRC, Ispra,Italy8 AMBIT Database Schema

QSAR Awareness Day, JRC, Ispra,Italy9 AMBIT Online: Similarity search

QSAR Awareness Day, JRC, Ispra,Italy10 AMBIT Online: Query result

QSAR Awareness Day, JRC, Ispra,Italy11 Links to other databases: example: KEGG

QSAR Awareness Day, JRC, Ispra,Italy12 Information about QSAR models

QSAR Awareness Day, JRC, Ispra,Italy13 Search AQUIRE database online

QSAR Awareness Day, JRC, Ispra,Italy14 Search EURAS Bioconcentration database online

QSAR Awareness Day, JRC, Ispra,Italy15 Ambit Database Tools 1.21 AMBITDatabase main window consists of following areas: Task bar on the left; Task bar Molecule browser (top right); Molecule browser Molecule data tabs (bottom right); Molecule data tabs Fast SMILES entry panel (top); Fast SMILES entry panel Status bar at the bottom. Standalone application available at

QSAR Awareness Day, JRC, Ispra,Italy16 Demo: 1. Finding basic information about a query compound in the database 2. Complex query in the database – retrieve data meeting multiple criteria from Ambit database 3. Import data from EURAS Gold standard Bioconcentration database

QSAR Awareness Day, JRC, Ispra,Italy17 Exercise 1. Finding basic information about a query compound in the database Launch AmbitDatabaseTools 1.20 Start menu/ All Programs/ CEFIC- LRI/Ambit 1.20 Ambit database tools main screen. Various tasks can be started from the menu options at the left panel. This exercise uses Search / CAS RN menu to lookup for compound with specific CAS RN

QSAR Awareness Day, JRC, Ispra,Italy18 Exercise 1a. Lookup by CAS RN An input box appears Enter and click OK. The result appears in top panel (Molecule browser) Click on 3D tab to view the 3D structure Further processing – save, calculate descriptors, etc.

QSAR Awareness Day, JRC, Ispra,Italy19 Exercise 1b. Retrieve descriptors The objective of this exercise is to retrieve values of several descriptors from the database. The descriptors we are interested are LogP Crossectional diameter Maximum diameter Molecular weight Use Molecule/Advanced data retrieval menu

QSAR Awareness Day, JRC, Ispra,Italy20 Exercise 1b. Retrieve descriptors The following window appears Check Read descriptors row The following window appears: Check following descriptors : XLogPDescriptor WeightDescriptor CrossectionalDiameterDescriptor MaximumDiameterDescriptor

QSAR Awareness Day, JRC, Ispra,Italy21 Exercise 1b. Retrieve descriptors The results appear in Descriptors tab Further processing – save, etc.

QSAR Awareness Day, JRC, Ispra,Italy22 Exercise 1c. Retrieve AQUIRE data Use Molecule/AQUIRE menu to retrieve toxicity data for hexaldehyde The results can be observed in bottom panel, EXPERIMENTAL data tab. Click on each row to view more details. Save to a file using File/Save menu (sdf, csv, xls, txt)

QSAR Awareness Day, JRC, Ispra,Italy23 SDF file for hexaldehyde CDK 6/23/07,13: V C C C C C ……… M END > 2596 > > > > O=CCCCCC > LC50=22000,ug/L Pimephales promelas > >

QSAR Awareness Day, JRC, Ispra,Italy24 XLS file for hexaldehyde

QSAR Awareness Day, JRC, Ispra,Italy25 Exercise 2. Complex queries: Use Ambit database to retrieve data that meet multiple criteria Use Search options /options menu to configure desired searches Switch to Similarity tab and set 0,7 for Tanimoto threshold (we will be searching for structures with Tanimoto similarity > 0.7)

QSAR Awareness Day, JRC, Ispra,Italy26 Exercise 2a. Similarity search Use Search/Structure search menu to invoke advanced query window Draw dimetylphtalate as shown at the figure Click Similarity button Browse the 7 compounds found (in Molecule Browser) Go to Search/options and lower threshold to 0.6 Use Search/Structure search/Similarity again with the same compound

QSAR Awareness Day, JRC, Ispra,Italy27 Exercise 2a. Similarity search Now there are 156 compounds with Tanimoto similarity > 0.6 We will be using Molecule/Save as dataset menu to store the query results into the database Hint: you can store query results directly into database, without loading into Molecule Browser, by setting Search Options/Result destination – DATABASE and then performing the query

QSAR Awareness Day, JRC, Ispra,Italy28 Database and datasets - background There can be many Ambit databases running on one MySQL server Within Ambit database the chemical compounds can be grouped in many subsets. Typically, one database consists of multiple subsets (datasets), corresponding to the origin of the data (e.g. the file used to import the compounds) The search results can be marked as a separate subset within Ambit database The search can be performed within entire Ambit database or just on a selected subset. This allows to use results of one query as a input to another and restrict the set of structures step by step Database server (MySQL) Ambit Database 1 (e.g. ambit) Dataset 1 ( structures from NCI) Dataset 2 (600 structures from DSSTox EPA Fathead Minnow) Dataset 3 (AQUIRE) Dataset 4 (DSSTox carcinogenic potency data) Dataset 5 (EURAS Bioconcentration factor data) Dataset 6 (my similarity search results) Ambit Database 2 (e.g. test_database) … Ambit Database N (e.g. my_secret_dataset) Other (non-Ambit) databases

QSAR Awareness Day, JRC, Ispra,Italy29 Exercise 2a. Similarity search Use Molecule/Save as dataset menu to store the query results into the database In the dialog box (as at right), add “+” button to add a new entry for the dataset. Type in the name for the dataset (e.g. “Similarity search Tanimoto > 0.6”) Click OK

QSAR Awareness Day, JRC, Ispra,Italy30 Exercise 2a. Similarity search Now the new dataset is available in the datasets list and can be used to restrict subsequent queries Use Search options/Dataset menu to select which dataset to be searched, select “Similarity search Tanimoto > 0.6” and click OK Note: this will not load any structures into Molecule browser!

QSAR Awareness Day, JRC, Ispra,Italy31 Exercise 2b. Pre-set physicochemical profile The objective is to extract compounds that have physicochemical properties, relevant for bioaccumulation from the set of structurally similar compounds found by previous query. The recommended descriptors and ranges are: LogP < 4.5 Molecular weight < 1100 Cross sectional diameter < 17.4 Å Maximum diameter < 43 Å

QSAR Awareness Day, JRC, Ispra,Italy32 Exercise 2b. Pre-set physicochemical profile Use Search/Structure search menu The window with options for structure, descriptors and experimental data queries appears. Click on Descriptors icon to obtain a list of descriptors available in the database

QSAR Awareness Day, JRC, Ispra,Italy33 Exercise 2b. Pre-set physicochemical profile Select XLogP descriptor (click on first column Click on Condition column and select “<” sign. Double click on the next column and enter 4.5 Repeat with descriptors: WeightDescriptor (Molecular weight) < 1100 CrosssectionalDiameterDescriptor (crossectional diameter) < 17.4 MaximumDiameterDescriptor (maximum diameter or maximum length) < 43 Click the Search button

QSAR Awareness Day, JRC, Ispra,Italy34 Exercise 2b. Pre-set physicochemical profile 123 out of the 156 structurally similar compounds have the predefined profile. The descriptor values can be inspected in the Descriptors tab

QSAR Awareness Day, JRC, Ispra,Italy35 Exercise 2c. Retrieve available toxicity data Use Search Options/Options menu to select he endpoint Select AQUIRE tab Select LC50 (Lethal concentration to 50% of test compounds) from the first list box

QSAR Awareness Day, JRC, Ispra,Italy36 Exercise 2c. Retrieve available toxicity data The next step is to tell the software we want to retrieve the data for all retrieved compounds (not only for the current structure). To do this: Select Molecule processing tab Select Molecule Browser: Current set of structures from the first list box

QSAR Awareness Day, JRC, Ispra,Italy37 Exercise 2c. Retrieve available toxicity data Use Molecule/AQUIRE menu to retrieve LC50 data for the current set of compounds Click Start button.

QSAR Awareness Day, JRC, Ispra,Italy38 Exercise 2c. Retrieve available toxicity data Browse the compounds to view AQUIRE data at the bottom panel Repeat the same procedure to retrieve BCF data from AQUIRE

QSAR Awareness Day, JRC, Ispra,Italy39 Exercise 2d. Retrieve available toxicity data (ER Binding) Structure/Search menu Click experiments Select DSSTox- ERBinding Select Endpoint=“ER RBA” Click Search

QSAR Awareness Day, JRC, Ispra,Italy40 Exercise 2d. Retrieve available toxicity data (ER Binding) Browse ER Binding data, save results into file

QSAR Awareness Day, JRC, Ispra,Italy41 More exercises Batch search Import structures into database Import descriptors and experimental data (e.g. bioconcentration factor dataset) Import QSAR models Database processing Descriptor calculation Atom environments, Fingerprint, SMILES generation Create new (empty) database. Create users for the new database Import compounds

QSAR Awareness Day, JRC, Ispra,Italy42 Ambit - Summary AMBIT software is a set of libraries and tools, providing various chemoinformatics functionalities for data management. The AMBIT system consists of a database and functional modules allowing a variety of flexible searches and mining of the data stored in the database. The unique feature of AMBIT is the ability to store multifaceted information about chemical structures and provide a searchable interface linking these diverse components.

Thank you! Questions?