Zagreb 30 June A Database of biological pathways David Croft
Zagreb 30 June What are pathways?
Zagreb 30 June What are pathways?
Zagreb 30 June What are pathways?
Zagreb 30 June Rationale – Journal information Nature 407(6805):770-6.The Biochemistry of Apoptosis. “Caspase-8 is the key initiator caspase in the death-receptor pathway. Upon ligand binding, death receptors such as CD95 (Apo-1/Fas) aggregate and form membrane- bound signalling complexes (Box 3). These complexes then recruit, through adapter proteins, several molecules of procaspase-8, resulting in a high local concentration of zymogen. The induced proximity model posits that under these crowded conditions, the low intrinsic protease activity of procaspase-8 (ref. 20) is sufficient to allow the various proenzyme molecules to mutually cleave and activate each other (Box 2). A similar mechanism of action has been proposed to mediate the activation of several other caspases, including caspase-2 and the nematode caspase CED-3 (ref. 21).” How can I access the pathway described here and reuse it?
Zagreb 30 June Nature Oct 12;407(6805): The biochemistry of apoptosis. Rationale - Figures A picture paints a thousand words… but…. Just pixels Omits key details Assumes Fact or Hypothesis?
Zagreb 30 June Reactome is… A free, online, open-source curated database of pathways and reactions in human biology Authored by expert biologists, maintained by Reactome editorial staff (curators) Mapped to cellular compartment (but not tissue or developmental stage)
Zagreb 30 June Reaction Example 1: Enzymatic
Zagreb 30 June Reaction Example 2: Transport REACT_1338.3
Zagreb 30 June Other Reaction Types Dimerization Phosphorylation
Zagreb 30 June Reactions Connect into Pathways OUTPUT INPUT CATALYST OUTPUT INPUT CATALYST INPUT OUTPUT CATALYST
Zagreb 30 June Reactome Data Model Events: Reactions Pathways Entities: Proteins Complexes Small compounds Modulation: Inhibition Activation
Zagreb 30 June Where Reactome’s Data Comes From Expert and curator create outline of new pathway Expert fills in details Curator enters information into database Reviewer (another expert) checks biological correctness Curator (staff) Expert (recruited) Reviewer (recruited)
Zagreb 30 June Data Expansion - Link-outs From Reactome GO Molecular Function Compartment Biological process KEGG, ChEBI – small molecules UniProt – proteins Sequence dbs – Ensembl, OMIM, Entrez Gene, RefSeq, HapMap, UCSC, KEGG Gene PubMed references – literature evidence for events
Zagreb 30 June Data Expansion – Projecting to Other Species A + ATP A + ADP -P B Human A + ATP A + ADP -P B Mouse B A Drosophila Reaction not inferred No orthologue - Protein not inferred + ATP
Zagreb 30 June Species Selection
Zagreb 30 June Exportable Protein-Protein Interactions Inferred from complexes and reactions Interactions between proteins in the same complex, reaction, or adjoining reaction Lists available from Downloads See Readme document for more details
Zagreb 30 June Coverage – Current TOC Scroll down for lots more...
Zagreb 30 June Planned Coverage – Editorial Calendar wiki.reactome.org/index.php/Reactome_Calendar
Zagreb 30 June Reactome Tools Interactive Pathway Browser Pathway Mapping and Over-representation Expression overlay onto pathways Molecular Interaction overlay Biomart
Zagreb 30 June BioMart Dataset PK FK Pathway Protein Full details of BioMart at: Gene Small molecule
Zagreb 30 June BioMart Advantages Query without needing to know data model Scales for big datasets Easy linking to other Marts (federation)
Zagreb 30 June BioMart and Reactome: Datasets Main tables: Complex Pathway Reaction Interaction Dimension tables: Gene Protein Small Molecule Literature reference
Zagreb 30 June Tutorial
Zagreb 30 June Front Page Main text Navigation bar
Zagreb 30 June Exercise 1 1.What’s the latest news item on the Reactome homepage? 2.How many human proteins are represented in Reactome? 3.What date is the next release? What is the first new topic that will be included?
Zagreb 30 June The Pathway Browser Species selector Search & Analyze bar Sidebar Pathway Diagram Panel Details Panel (hidden) Thumbnail
Zagreb 30 June Pathways tab – pathway hierarchy Pathway Reaction Black-box
Zagreb 30 June Exercise 2 From the homepage, search for ‘Signaling by Notch’. Click on the top pathway hit. This will open it in the Pathway Browser. Ignoring the diagram for now, look at the Pathways tab on the left. 1.How many sub-pathways does this pathway have? 2.How many reactions are in the first of these sub-pathways? 3.What reaction follows Notch 2 precursor transport to Golgi? Hint: If it’s not visible, open the Details pane at the bottom of the page by clicking on the blue triangle.
Zagreb 30 June The Pathway Browser - Pathway Diagrams Boxes are proteins, protein sets, mixed sets or complexes. Ovals are small molecules (or sets of) Green boxes are proteins or sets, blue are complexes. Catalyst Input Outputs Compartment Reaction node Transition Binding Dissociation Omitted Uncertain Regulation +ve -ve
Zagreb 30 June Exercise 3 From the Homepage, search for the pathway ‘Effects of PIP2 hydrolysis’ and open it in the Pathway Browser. 1.What symbol represents the reaction for ‘Binding of IP3 to the IP3 receptor’? 2.What symbol represents the reaction ‘Transport of Ca++ from platelet dense tubular system to cytoplasm’? What subtype of reaction is this? 3.What is the catalyst (descriptive name) for ‘2-AG hydrolysis to arachidonate by MAGL’? Can you find its UniProt ID and name the two outputs of this reaction?
Zagreb 30 June Navigating in the Pathway Browser I Click here Highlights Details here Click here to open pathway diagram... Home and Analyze buttons Search
Zagreb 30 June Highlights Details here Navigating in the Pathway Browser II Click here Zoom
Zagreb 30 June Exercise 4 Open the pathway Apoptosis in the Pathway Browser. Select the sub-pathway box ‘Intrinsic Pathway in Apoptosis’. 1.How do you open the pathway diagram for this sub-pathway? 2.With this pathway diagram open, what happens if you click on the sub-pathway ‘BH3-only proteins associate with and inactivate anti- apoptotic BCL-2 members’? 3.What happens if you click on the reaction ‘Sequestration of tBID by BCL-2’ in the in the pathway hierarchy?
Zagreb 30 June The Details Panel
Zagreb 30 June Exercise 5 1.Find the reaction 'Activated type I receptor phosphorylates R- SMAD directly'. What pathway does it belong to? 2.In which cellular compartment does this reaction take place? 3.What is the associated GO molecular function? 4.What references verify this reaction? 5.Is this reaction predicted to occur in Canis familiaris? In Saccharomyces cerevisiae?
Zagreb 30 June Pathway Analysis
Zagreb 30 June Pathway Analysis – Overrepresentation ‘Top-level’ Reveal next level P-val
Zagreb 30 June Exercise 6 Check this! 1.What is the most significantly over-represented top-level pathway for this dataset? 2.How many genes are in this pathway, and how many were represented in the dataset? 3.Why is the top-level pathway Chromosome Maintenance higher in the list than Signalling by Wnt when the latter has a more significant probability score? (Hint – use the Open All button) 4.Can you interpret these results in terms of the underlying biology? (Hint: good luck, there are many correct answers!)
Zagreb 30 June Species Comparison I
Zagreb 30 June Species Comparison II Yellow = human/rat Blue = human only Grey = not relevant Black = Complex
Zagreb 30 June Exercise 7 Launch the Species Comparison and select the species Rattus norvegicus. When the results are displayed, open the pathway Complement Cascade. 1.Find Complement factor B (bottom left in the diagram) - what colour is it? What does that mean? 2.What other species is this protein inferred to be present in? Hint: You can answer this question without rerunning Species comparison. 3.Find Complement factor 2 (top middle) – why is it blue? 4.Find C3b (top left corner) – Why is it black? How many proteins contribute to this object? Are they predicted to exist in Rattus? 5.Why is Calcium grey?
Zagreb 30 June Expression Analysis I
Zagreb 30 June Expression Analysis II ‘Hot’ = high ‘Cold’ = low Step through Data columns
Zagreb 30 June Exercise 8 Launch Expression Analysis and load the example dataset (The data was obtained from ArrayExpress and comes from the processed results of an experiment studying the differentiation of KG1 cells.). Click Analyse. When the results are displayed, find the pathway Nucleotide Excision Repair. 1.How many proteins are in this pathway? 2.How many had expression data? Click on the View button to see this pathway in the Pathway Browser. Use the Experiment Browser toolbar to cycle through the timepoints. 3.Which protein has the greatest change of expression? 4.Find the complex ‘Active Pol II complex with repaired DNA template:mRNA hybrid’ (top right of the diagram). Which component of the complex has the highest expression at 24h? 5.What was the probe ID used to measure expression of this component?
Zagreb 30 June Molecular Interaction Overlay
Zagreb 30 June Exercise 9 Open the pathway diagram for Netrin-1 Signaling. 1.Find the protein SHP2 (top left of the cytosol). Right click on it and select Display Interactors. How many are there? 2.How many times has the interaction between SHP2 Adapter protein GRB2 been documented? Hint: This detail is not in Reactome. 3.Find the protein NCK1 (to the right of SHP2). Display interactors for this protein. How many are there? Can you get a list of them? 4.Display interactors for UNC5B (bottom left of the cytosol). What happens and why? 5.What is the easiest way to remove interactors?
Zagreb 30 June BioMart – selecting your dataset
Zagreb 30 June BioMart – filters
Zagreb 30 June BioMart – attributes Check to get attribute
Zagreb 30 June BioMart – results
Zagreb 30 June Exercise 10 Go to the Reactome BioMart page. 1.How would you select the “pathway” dataset? 2.If you were interested in the protein “Nucleolar transcription factor 1” (UniProt ID P17480), how would you identify pathways in Reactome involving that protein? How many pathways does your query find? 3.How would you find the UniProt IDs of the other proteins in the first of the pathways that you discovered? 4.Save the results list from 3) in tab-delimited format. How many lines of output are there, excluding column headers?
Zagreb 30 June The End