Analysis: Tools for directly examining sequence What follows is a simulation of the proposed sequence interface. A PC-based prototype exists, but the interface.

Slides:



Advertisements
Similar presentations
How to make up your own ‘Millionnaire’ game using the Powerpoint template supplied: Open PowerPoint, then use ‘File/Open’ to find ‘millionnaire’ in P:\MFL\French.
Advertisements

Support.ebsco.com EBSCOhost Digital Archives Viewer Tutorial.
A complete citation, notecard, and outlining tool
Usually the next step is to run the Cognitive Tests. Click on “Run Cognitive Tests” button to start testing. All of the tests begin with you giving a brief.
SRI International Bioinformatics 1 Genome Browser Markus Krummenacker Bioinformatics Research Group SRI, International Q
Microsoft Excel The Basics. spreadsheet A type of application program which manipulates numerical and string data in rows and columns of cells. The value.
Using the Web-based Training Tool MyFloridaMarketPlace Revised Date: 12/14/06.
CHOCOLATE MOLD If there is anything missing from this PowerPoint beyond what is acknowledged, please tell Grant Dunbar. Thanks!!!
Copyright OpenHelix. No use or reproduction without express written consent1 Organization of genomic data… Genome backbone: base position number sequence.
1 Exploring Stagecast Creator Stagecast Creator Tutorial: Kids Smoking on the Playground By: Community Simulations Team Center for Human-Computer Interaction.
Computational Biology, Part 2 Representing and Finding Sequence Features using Consensus Sequences Robert F. Murphy Copyright  All rights reserved.
Google SketchUp Castle
Genome Evolution: Duplication (Paralogs) & Degradation (Pseudogenes)
2 pt 3 pt 4 pt 5pt 1 pt 2 pt 3 pt 4 pt 5 pt 1 pt 2pt 3 pt 4pt 5 pt 1pt 2pt 3 pt 4 pt 5 pt 1 pt 2 pt 3 pt 4pt 5 pt 1pt.
Let’s Review with a Game of Jeopardy
Locating genes in Plasmodium falciparum You have seen how artemis is used to view, analyse and annotate bacterial genomes, but now we are going to move.
Microsoft Windows LEARNING HOW USE AN OPERATING SYSTEM 1.
Click your mouse for next slide Flash – Introduction and Startup Many times on websites you will see animations of various sorts Many of these are created.
Hello! Keep watching … I’ll show you how to use a mouse.
What is comparative genomics? Analyzing & comparing genetic material from different species to study evolution, gene function, and inherited disease Understand.
7.4.1 Explain that each tRNA molecule is recognized by a tRNA-activating enzyme that binds a specific amino acid to the tRNA, using ATP for energy. 3 Summary:
1 Welcome to the GrameneMart Tutorial A tool for batch data sequence retrieval 1.Select a Gramene dataset to search against. 2.Add filters to the dataset.
1 IE in the Classroom The Internet Explorer Web Browser EDW647 Internet for Educators Roger Webster, Ph.D. Millersville University Department of Computer.
Support.ebsco.com EBSCOhost Visual Search Tutorial.
Anotation: Gene of which little is known What follows is a simulation of an orf page in the proposed graphical interface. The interface does not yet exist.
SAGExplore web server tutorial for Module II: Genome Mapping.
Vector NTI. Go Herd! Download your sequence and open the file Click your name on my web page on the class genes page
© Wiley Publishing All Rights Reserved. Building Multiple- Sequence Alignments.
Just as there are many human languages, there are many computer programming languages that can be used to develop software. Some are named after people,
Get up to speed Get to know the Ribbon When you first open Word 2007, you may be surprised by its new look. Most of the changes are in the Ribbon, the.
How to Create an Interactive PowerPoint using TurningPoint By Emily Higgins.
Anotation: Gene of which something known What follows is a simulation of an orf page in the proposed graphical interface. The interface does not yet exist.
ANALYSIS AND VISUALIZATION OF SINGLE COPY ORTHOLOGS IN ARABIDOPSIS, LETTUCE, SUNFLOWER AND OTHER PLANT SPECIES. Alexander Kozik and Richard W. Michelmore.
Welcome to DNA Subway Classroom-friendly Bioinformatics.
Downloading and Installing Autodesk Revit 2016
Sequence-based Similarity Module (BLAST & CDD only ) & Horizontal Gene Transfer Module (Ortholog Neighborhood & GC content only)
Introduction to Using the Notebook 10 Software for SMART Board Day 2 LIVINGSTON PARISH PUBLIC SCHOOLS Facilitated by S. Waltman.
Comparative genomics analysis of NtcA regulons in cyanobacteria: Regulation of nitrogen assimilation and its coupling to photosynthesis Wen-Ting Huang.
A Tale of Two Fishes Delving into genetic inheritance Continue here.
Downloading and Installing Autodesk Inventor Professional 2015 This is a 4 step process 1.Register with the Autodesk Student Community 2.Downloading the.
Sackler Medical School
Lab 1 : Introduction to LabView 1 Southern Methodist University Bryan Rodriguez.
SRI International Bioinformatics 1 Genome Browser Markus Krummenacker Bioinformatics Research Group SRI, International Q
Basic Local Alignment Search Tool BLAST Why Use BLAST?
How to use the PRS Clickers with PowerPoint Created by: Lindsay Proctor.
Class material and homework for February 9 today’s in-class topic: selected examples of contemporary biotechnology –polymerase chain reaction (PCR) –DNA.
SRI International Bioinformatics 1 Genome Browser Tomer Altman Bioinformatics Research Group SRI, International August 19th, 2009.
Analysis: Discovery of possible regulatory motifs What follows is a simulation of the proposed graphical interface. As you go through the simulation please.
Condor: BLAST Monday, 3:30pm Alain Roy OSG Software Coordinator University of Wisconsin-Madison.
PRESERVING YOUR PAST AND YOUR PRESENT FOR THE FUTURE.
1 From Mendel to Genomics Historically –Identify or create mutations, follow inheritance –Determine linkage, create maps Now: Genomics –Not just a gene,
ARTSYS Tutorial Welcome! If you are a first-time user, we strongly recommend that you go through the entire tutorial. Just single click or press the page.
Anotation Process What follows is a simulation of the process of annotating, using the proposed graphical interface. The interface does not yet exist.
Click anywhere to go on to the next slide This demonstration is best viewed as a slide show, enabling you to simulate a session and make changes in cursor.
RNA and Gene Expression BIO 224 Intro to Molecular and Cell Biology.
Intro to Probabilistic Models PSSMs Computational Genomics, Lecture 6b Partially based on slides by Metsada Pasmanik-Chor.
Finding genes in the genome
Copyright OpenHelix. No use or reproduction without express written consent1.
BIOINFORMATICS Ayesha M. Khan Spring 2013 Lec-8.
Analysis: Discovery of coregulated genes What follows is a simulation of the proposed graphical interface. As you go through the simulation please consider.
Welcome to the GrameneMart Tutorial A tool for batch data sequence retrieval 1.Select a Gramene dataset to search against. 2.Add filters to the dataset.
Introducing Scratch Learning resources for the implementation of the scenario
PowerTeacher Gradebook PTG and PowerTeacher Pro PT Pro A Comparison The following slides will give you an overview of the changes that will occur moving.
Lesson 11 Exploring Microsoft Office 2007
Bioinformatics Research Group
How to Use This Presentation
Basic Local Alignment Search Tool
From Mendel to Genomics
Applying principles of computer science in a biological context
Presentation transcript:

Analysis: Tools for directly examining sequence What follows is a simulation of the proposed sequence interface. A PC-based prototype exists, but the interface has not yet been ported to the web. As you go through the simulation please consider what capabilities you would want to serve your research and annotation interests. A narrative to help you go through the simulation appears in a red-bordered box, such as the one below. To begin: 1. Click on Slide Show, (on the upper toolbar) 2. Click View Show 3. Click Continue button Continue Scenario 6

You’re intrigued by the motif you found in front of Anabaena PCC 7120 all4312 and its cyanobacterial orthologs (see Scenarios 1 and 5). You’d like to look more deeply into it, by examining the sequence near the orf. You’re not sure what you’re looking for, and you’re open for anything. Continue Scenario 6 Analysis: Tools for directly examining sequence

Anab7120:all4312 NostPunc: TricEryt: Syny6803:sll1330 TherElon:tlr1330 Anabaena PCC 7120: all4312 OptionsAnnotate Main Menu History Replicon: Chromosome Coordinates: (stop) < (start-GTG) System Length = 256 amino acids Strand: Complementary Function: Two-component response regulator System Syny6803:sll1330: Expression data (click to expand)Experiment Mutant: None Syny6803:sll1330: Failed to segregate Experiment Cyanobacterial orthologs: NostPunc TricEryt Syny6803 TherElon Scenario 1 left us with the provocative finding that all five cyanobacterial orthologs of all4312 are preceded by the same motif. What is that motif and what might it mean? To answer that question, click on the coordinates of all4312 to get to the sequence interface. A Lawrence/Collier conserved motif set

GCTGAGTTAGGAGTAAAAATCATTATTTTTCCTCCCTCTGCCTCCTCTAC ACCCTCTGCCCACTTAGAGTTGAGCGTTGGTTGCTAAATCTTTCTTTTGT TAACTTTGCTTGTGTTTGTGGAGGATTAGCATTCAAAATTTCCATGTTAA ATCGGTATCCAACATTGCGGATAGTCTGAATGAGGCTAGGTTGGCGGGGA TCAAGTTCTACTTTTTTACGTAACGATAGAACATGAGTGTCAATGGTACG CGGATTGTCGATAGCGTCAGGCCACGCACGACGTAGCAACTCTGATCGGC TCAAAGGTACTCCACCAGCTTGCGCCAAAACGTACAACAAACTAAATTCC TGTGGAGTCAGGTCGATAAACTCCCCTTGGAATCGTACACGGCGTTGGAC TAAATCGATTTGCAAAGTACCATAATCCAAATAAGCAGGAGCAGTAGGTG TGCGCTTGCGGCGGATTAATGCCTCTACCCTAGCCAAAAACTCCTGCATC CCAAATGGTTTGCTCAAGTAATCATCAGCTCCCGCCTTCAACCCGGCAAC GATATCAGCCTCATTAGTCCGAGCAGATAACATGAGAATTAGCGGCTGTT GCTGACGATGCAGCCAACGGCAAAATTCAATACCGTCACCATCTGGCAAA TCAGCATCCAGAATCACTAGAGTTGGCTGATGGCTCAAAAAGGCTTCCCT TGCTTGATATATGCTGGCGGCTTGATGCACACGGTATTCCAATTGTTGCA AGTGCCAACCCAGCAACGACCTCAGATGGGGATTCCCCTCAACGATTTCA ATACAAACCGAACCCACGGCGGCAAGACCCCTTAGCGTCGATGAATTTCA AAGTAACAAGCCCATGCCCAAGGTTTTGTAGTCTTTGTTACTCTAGCCAC AATTACCTCTACGTTTTTTTATAACCTATGTTGGCAAAATGTTGAGTTTA TGCCTTGTCCAGAGCGAAGTTGATAATATTATTAAATTTTTGTTATAGTT all4312 two-component system < Anabaena Chromosome ( bp): | | | | | Contig GoTo Block Find Display PgUp/PgDn Help Quit The interface places you in the Anabaena chromosome in the region surrounding all4312, with the orf highlighted as a block. Clicking on all4312 would get us back to the annotation page. Our goal was to look at the motif preceding the orf, so click on Display.

GCTGAGTTAGGAGTAAAAATCATTATTTTTCCTCCCTCTGCCTCCTCTAC ACCCTCTGCCCACTTAGAGTTGAGCGTTGGTTGCTAAATCTTTCTTTTGT TAACTTTGCTTGTGTTTGTGGAGGATTAGCATTCAAAATTTCCATGTTAA ATCGGTATCCAACATTGCGGATAGTCTGAATGAGGCTAGGTTGGCGGGGA TCAAGTTCTACTTTTTTACGTAACGATAGAACATGAGTGTCAATGGTACG CGGATTGTCGATAGCGTCAGGCCACGCACGACGTAGCAACTCTGATCGGC TCAAAGGTACTCCACCAGCTTGCGCCAAAACGTACAACAAACTAAATTCC TGTGGAGTCAGGTCGATAAACTCCCCTTGGAATCGTACACGGCGTTGGAC TAAATCGATTTGCAAAGTACCATAATCCAAATAAGCAGGAGCAGTAGGTG TGCGCTTGCGGCGGATTAATGCCTCTACCCTAGCCAAAAACTCCTGCATC CCAAATGGTTTGCTCAAGTAATCATCAGCTCCCGCCTTCAACCCGGCAAC GATATCAGCCTCATTAGTCCGAGCAGATAACATGAGAATTAGCGGCTGTT GCTGACGATGCAGCCAACGGCAAAATTCAATACCGTCACCATCTGGCAAA TCAGCATCCAGAATCACTAGAGTTGGCTGATGGCTCAAAAAGGCTTCCCT TGCTTGATATATGCTGGCGGCTTGATGCACACGGTATTCCAATTGTTGCA AGTGCCAACCCAGCAACGACCTCAGATGGGGATTCCCCTCAACGATTTCA ATACAAACCGAACCCACGGCGGCAAGACCCCTTAGCGTCGATGAATTTCA AAGTAACAAGCCCATGCCCAAGGTTTTGTAGTCTTTGTTACTCTAGCCAC AATTACCTCTACGTTTTTTTATAACCTATGTTGGCAAAATGTTGAGTTTA TGCCTTGTCCAGAGCGAAGTTGATAATATTATTAAATTTTTGTTATAGTT all4312 two-component system < Anabaena Chromosome ( bp): | | | | | Contig GoTo Block Find Display PgUp/PgDn Help Quit We want to display the motif predicted by Lawrence/Collier, so click on Predicted features. Alternate starts Annotated features Predicted features Private features Tandem repeats Inverted repeats Base symbols Invert display Predicted features

GCTGAGTTAGGAGTAAAAATCATTATTTTTCCTCCCTCTGCCTCCTCTAC ACCCTCTGCCCACTTAGAGTTGAGCGTTGGTTGCTAAATCTTTCTTTTGT TAACTTTGCTTGTGTTTGTGGAGGATTAGCATTCAAAATTTCCATGTTAA ATCGGTATCCAACATTGCGGATAGTCTGAATGAGGCTAGGTTGGCGGGGA TCAAGTTCTACTTTTTTACGTAACGATAGAACATGAGTGTCAATGGTACG CGGATTGTCGATAGCGTCAGGCCACGCACGACGTAGCAACTCTGATCGGC TCAAAGGTACTCCACCAGCTTGCGCCAAAACGTACAACAAACTAAATTCC TGTGGAGTCAGGTCGATAAACTCCCCTTGGAATCGTACACGGCGTTGGAC TAAATCGATTTGCAAAGTACCATAATCCAAATAAGCAGGAGCAGTAGGTG TGCGCTTGCGGCGGATTAATGCCTCTACCCTAGCCAAAAACTCCTGCATC CCAAATGGTTTGCTCAAGTAATCATCAGCTCCCGCCTTCAACCCGGCAAC GATATCAGCCTCATTAGTCCGAGCAGATAACATGAGAATTAGCGGCTGTT GCTGACGATGCAGCCAACGGCAAAATTCAATACCGTCACCATCTGGCAAA TCAGCATCCAGAATCACTAGAGTTGGCTGATGGCTCAAAAAGGCTTCCCT TGCTTGATATATGCTGGCGGCTTGATGCACACGGTATTCCAATTGTTGCA AGTGCCAACCCAGCAACGACCTCAGATGGGGATTCCCCTCAACGATTTCA ATACAAACCGAACCCACGGCGGCAAGACCCCTTAGCGTCGATGAATTTCA AAGTAACAAGCCCATGCCCAAGGTTTTGTAGTCTTTGTTACTCTAGCCAC AATTACCTCTACGTTTTTTTATAACCTATGTTGGCAAAATGTTGAGTTTA TGCCTTGTCCAGAGCGAAGTTGATAATATTATTAAATTTTTGTTATAGTT all4312 two-component system < Anabaena Chromosome ( bp): | | | | | Contig GoTo Block Find Display PgUp/PgDn Help Quit I was hoping to see sequences I recognized, but that’s made more difficult by the orf being on the wrong strand. I could invert the entire display, but instead I’ll just work on a segment. Click Block.

GCTGAGTTAGGAGTAAAAATCATTATTTTTCCTCCCTCTGCCTCCTCTAC ACCCTCTGCCCACTTAGAGTTGAGCGTTGGTTGCTAAATCTTTCTTTTGT TAACTTTGCTTGTGTTTGTGGAGGATTAGCATTCAAAATTTCCATGTTAA ATCGGTATCCAACATTGCGGATAGTCTGAATGAGGCTAGGTTGGCGGGGA TCAAGTTCTACTTTTTTACGTAACGATAGAACATGAGTGTCAATGGTACG CGGATTGTCGATAGCGTCAGGCCACGCACGACGTAGCAACTCTGATCGGC TCAAAGGTACTCCACCAGCTTGCGCCAAAACGTACAACAAACTAAATTCC TGTGGAGTCAGGTCGATAAACTCCCCTTGGAATCGTACACGGCGTTGGAC TAAATCGATTTGCAAAGTACCATAATCCAAATAAGCAGGAGCAGTAGGTG TGCGCTTGCGGCGGATTAATGCCTCTACCCTAGCCAAAAACTCCTGCATC CCAAATGGTTTGCTCAAGTAATCATCAGCTCCCGCCTTCAACCCGGCAAC GATATCAGCCTCATTAGTCCGAGCAGATAACATGAGAATTAGCGGCTGTT GCTGACGATGCAGCCAACGGCAAAATTCAATACCGTCACCATCTGGCAAA TCAGCATCCAGAATCACTAGAGTTGGCTGATGGCTCAAAAAGGCTTCCCT TGCTTGATATATGCTGGCGGCTTGATGCACACGGTATTCCAATTGTTGCA AGTGCCAACCCAGCAACGACCTCAGATGGGGATTCCCCTCAACGATTTCA ATACAAACCGAACCCACGGCGGCAAGACCCCTTAGCGTCGATGAATTTCA AAGTAACAAGCCCATGCCCAAGGTTTTGTAGTCTTTGTTACTCTAGCCAC AATTACCTCTACGTTTTTTTATAACCTATGTTGGCAAAATGTTGAGTTTA TGCCTTGTCCAGAGCGAAGTTGATAATATTATTAAATTTTTGTTATAGTT all4312 two-component system < Anabaena Chromosome ( bp): | | | | | Contig GoTo Block Find Display PgUp/PgDn Help Quit The highlighted orf sequence could now be downloaded or first translated then downloaded, but I’m interested now only in the region preceding the gene. Click Define, in order to highlight a new block of sequence. Define Invert Translate Save Tools Define

GCTGAGTTAGGAGTAAAAATCATTATTTTTCCTCCCTCTGCCTCCTCTAC ACCCTCTGCCCACTTAGAGTTGAGCGTTGGTTGCTAAATCTTTCTTTTGT TAACTTTGCTTGTGTTTGTGGAGGATTAGCATTCAAAATTTCCATGTTAA ATCGGTATCCAACATTGCGGATAGTCTGAATGAGGCTAGGTTGGCGGGGA TCAAGTTCTACTTTTTTACGTAACGATAGAACATGAGTGTCAATGGTACG CGGATTGTCGATAGCGTCAGGCCACGCACGACGTAGCAACTCTGATCGGC TCAAAGGTACTCCACCAGCTTGCGCCAAAACGTACAACAAACTAAATTCC TGTGGAGTCAGGTCGATAAACTCCCCTTGGAATCGTACACGGCGTTGGAC TAAATCGATTTGCAAAGTACCATAATCCAAATAAGCAGGAGCAGTAGGTG TGCGCTTGCGGCGGATTAATGCCTCTACCCTAGCCAAAAACTCCTGCATC CCAAATGGTTTGCTCAAGTAATCATCAGCTCCCGCCTTCAACCCGGCAAC GATATCAGCCTCATTAGTCCGAGCAGATAACATGAGAATTAGCGGCTGTT GCTGACGATGCAGCCAACGGCAAAATTCAATACCGTCACCATCTGGCAAA TCAGCATCCAGAATCACTAGAGTTGGCTGATGGCTCAAAAAGGCTTCCCT TGCTTGATATATGCTGGCGGCTTGATGCACACGGTATTCCAATTGTTGCA AGTGCCAACCCAGCAACGACCTCAGATGGGGATTCCCCTCAACGATTTCA ATACAAACCGAACCCACGGCGGCAAGACCCCTTAGCGTCGATGAATTTCA AAGTAACAAGCCCATGCCCAAGGTTTTGTAGTCTTTGTTACTCTAGCCAC AATTACCTCTACGTTTTTTTATAACCTATGTTGGCAAAATGTTGAGTTTA TGCCTTGTCCAGAGCGAAGTTGATAATATTATTAAATTTTTGTTATAGTT all4312 two-component system < Anabaena Chromosome ( bp): | | | | | Contig GoTo Block Find Display PgUp/PgDn Help Quit Define the beginning of the block by clicking on base (4 th line up). Then click on the last base on the page (lower right corner).

GCTGAGTTAGGAGTAAAAATCATTATTTTTCCTCCCTCTGCCTCCTCTAC ACCCTCTGCCCACTTAGAGTTGAGCGTTGGTTGCTAAATCTTTCTTTTGT TAACTTTGCTTGTGTTTGTGGAGGATTAGCATTCAAAATTTCCATGTTAA ATCGGTATCCAACATTGCGGATAGTCTGAATGAGGCTAGGTTGGCGGGGA TCAAGTTCTACTTTTTTACGTAACGATAGAACATGAGTGTCAATGGTACG CGGATTGTCGATAGCGTCAGGCCACGCACGACGTAGCAACTCTGATCGGC TCAAAGGTACTCCACCAGCTTGCGCCAAAACGTACAACAAACTAAATTCC TGTGGAGTCAGGTCGATAAACTCCCCTTGGAATCGTACACGGCGTTGGAC TAAATCGATTTGCAAAGTACCATAATCCAAATAAGCAGGAGCAGTAGGTG TGCGCTTGCGGCGGATTAATGCCTCTACCCTAGCCAAAAACTCCTGCATC CCAAATGGTTTGCTCAAGTAATCATCAGCTCCCGCCTTCAACCCGGCAAC GATATCAGCCTCATTAGTCCGAGCAGATAACATGAGAATTAGCGGCTGTT GCTGACGATGCAGCCAACGGCAAAATTCAATACCGTCACCATCTGGCAAA TCAGCATCCAGAATCACTAGAGTTGGCTGATGGCTCAAAAAGGCTTCCCT TGCTTGATATATGCTGGCGGCTTGATGCACACGGTATTCCAATTGTTGCA AGTGCCAACCCAGCAACGACCTCAGATGGGGATTCCCCTCAACGATTTCA ATACAAACCGAACCCACGGCGGCAAGACCCCTTAGCGTCGATGAATTTCA AAGTAACAAGCCCATGCCCAAGGTTTTGTAGTCTTTGTTACTCTAGCCAC AATTACCTCTACGTTTTTTTATAACCTATGTTGGCAAAATGTTGAGTTTA TGCCTTGTCCAGAGCGAAGTTGATAATATTATTAAATTTTTGTTATAGTT all4312 two-component system < Anabaena Chromosome ( bp): | | | | | Contig GoTo Block Find Display PgUp/PgDn Help Quit Now that the bottom four lines are blocked, Click on Block and then Invert.

GCTGAGTTAGGAGTAAAAATCATTATTTTTCCTCCCTCTGCCTCCTCTAC ACCCTCTGCCCACTTAGAGTTGAGCGTTGGTTGCTAAATCTTTCTTTTGT TAACTTTGCTTGTGTTTGTGGAGGATTAGCATTCAAAATTTCCATGTTAA ATCGGTATCCAACATTGCGGATAGTCTGAATGAGGCTAGGTTGGCGGGGA TCAAGTTCTACTTTTTTACGTAACGATAGAACATGAGTGTCAATGGTACG CGGATTGTCGATAGCGTCAGGCCACGCACGACGTAGCAACTCTGATCGGC TCAAAGGTACTCCACCAGCTTGCGCCAAAACGTACAACAAACTAAATTCC TGTGGAGTCAGGTCGATAAACTCCCCTTGGAATCGTACACGGCGTTGGAC TAAATCGATTTGCAAAGTACCATAATCCAAATAAGCAGGAGCAGTAGGTG TGCGCTTGCGGCGGATTAATGCCTCTACCCTAGCCAAAAACTCCTGCATC CCAAATGGTTTGCTCAAGTAATCATCAGCTCCCGCCTTCAACCCGGCAAC GATATCAGCCTCATTAGTCCGAGCAGATAACATGAGAATTAGCGGCTGTT GCTGACGATGCAGCCAACGGCAAAATTCAATACCGTCACCATCTGGCAAA TCAGCATCCAGAATCACTAGAGTTGGCTGATGGCTCAAAAAGGCTTCCCT TGCTTGATATATGCTGGCGGCTTGATGCACACGGTATTCCAATTGTTGCA AGTGCCAACCCAGCAACGACCTCAGATGGGGATTCCCCTCAACGATTTCA ATACAAACCGAACCCACGGCGGCAAGACCCCTTAGCGTCGATGAATTTCA AAGTAACAAGCCCATGCCCAAGGTTTTGTAGTCTTTGTTACTCTAGCCAC AATTACCTCTACGTTTTTTTATAACCTATGTTGGCAAAATGTTGAGTTTA TGCCTTGTCCAGAGCGAAGTTGATAATATTATTAAATTTTTGTTATAGTT all4312 two-component system < Anabaena Chromosome ( bp): | | | | | Contig GoTo Block Find Display PgUp/PgDn Help Quit Now that the bottom four lines are blocked, Click on Block and then Invert. Define Invert Translate Save Tools Invert

AACTATAACAAAAATTTAATAATATTATCAACTTCGCTCTGGACAAGGCA TAAACTCAACATTTTGCCAACATAGGTTATAAAAAAACGTAGAGGTAATT GTGGCTAGAGTAACAAAGACTACAAAACCTTGGGCATGGGCTTGTTACTT TGAAATTCATCGACGCTAAGGGGTCTTGCCGCCGTGGGTTCGGTTTGTAT all4312 two-component system > Anabaena Chromosome ( bp): (inverted) | | | | | Contig GoTo Block Find Display PgUp/PgDn Help Quit That’s more like it. Now a person attuned to such things can recognize the elements of a binding site for the transcriptional regulator NtcA, followed by the -10 region of a promoter, properly spaced. The gene comes shortly after that, now in the direct (blue) orientation. To get back to the full sequence, click on Block and then unInvert.

AACTATAACAAAAATTTAATAATATTATCAACTTCGCTCTGGACAAGGCA TAAACTCAACATTTTGCCAACATAGGTTATAAAAAAACGTAGAGGTAATT GTGGCTAGAGTAACAAAGACTACAAAACCTTGGGCATGGGCTTGTTACTT TGAAATTCATCGACGCTAAGGGGTCTTGCCGCCGTGGGTTCGGTTTGTAT all4312 two-component system > Anabaena Chromosome ( bp): (inverted) | | | | | Contig GoTo Block Find Display PgUp/PgDn Help Quit That’s more like it. Now a person attuned to such things can recognize the elements of a binding site for the transcriptional regulator NtcA, followed by the -10 region of a promoter, properly spaced. The gene comes shortly after that, now in the direct (blue) orientation. To get back to the full sequence, click on Block and then unInvert. Invert Translate Save Tools unInvert

GCTGAGTTAGGAGTAAAAATCATTATTTTTCCTCCCTCTGCCTCCTCTAC ACCCTCTGCCCACTTAGAGTTGAGCGTTGGTTGCTAAATCTTTCTTTTGT TAACTTTGCTTGTGTTTGTGGAGGATTAGCATTCAAAATTTCCATGTTAA ATCGGTATCCAACATTGCGGATAGTCTGAATGAGGCTAGGTTGGCGGGGA TCAAGTTCTACTTTTTTACGTAACGATAGAACATGAGTGTCAATGGTACG CGGATTGTCGATAGCGTCAGGCCACGCACGACGTAGCAACTCTGATCGGC TCAAAGGTACTCCACCAGCTTGCGCCAAAACGTACAACAAACTAAATTCC TGTGGAGTCAGGTCGATAAACTCCCCTTGGAATCGTACACGGCGTTGGAC TAAATCGATTTGCAAAGTACCATAATCCAAATAAGCAGGAGCAGTAGGTG TGCGCTTGCGGCGGATTAATGCCTCTACCCTAGCCAAAAACTCCTGCATC CCAAATGGTTTGCTCAAGTAATCATCAGCTCCCGCCTTCAACCCGGCAAC GATATCAGCCTCATTAGTCCGAGCAGATAACATGAGAATTAGCGGCTGTT GCTGACGATGCAGCCAACGGCAAAATTCAATACCGTCACCATCTGGCAAA TCAGCATCCAGAATCACTAGAGTTGGCTGATGGCTCAAAAAGGCTTCCCT TGCTTGATATATGCTGGCGGCTTGATGCACACGGTATTCCAATTGTTGCA AGTGCCAACCCAGCAACGACCTCAGATGGGGATTCCCCTCAACGATTTCA ATACAAACCGAACCCACGGCGGCAAGACCCCTTAGCGTCGATGAATTTCA AAGTAACAAGCCCATGCCCAAGGTTTTGTAGTCTTTGTTACTCTAGCCAC AATTACCTCTACGTTTTTTTATAACCTATGTTGGCAAAATGTTGAGTTTA TGCCTTGTCCAGAGCGAAGTTGATAATATTATTAAATTTTTGTTATAGTT all4312 two-component system < Anabaena Chromosome ( bp): | | | | | Contig GoTo Block Find Display PgUp/PgDn Help Quit If suspicious, we could have found this same site by a direct search for its consensus sequence (though there are better ways than this), clicking on Find, then Sequence, and typing in the NtcA/promoter consensus sequence.

GCTGAGTTAGGAGTAAAAATCATTATTTTTCCTCCCTCTGCCTCCTCTAC ACCCTCTGCCCACTTAGAGTTGAGCGTTGGTTGCTAAATCTTTCTTTTGT TAACTTTGCTTGTGTTTGTGGAGGATTAGCATTCAAAATTTCCATGTTAA ATCGGTATCCAACATTGCGGATAGTCTGAATGAGGCTAGGTTGGCGGGGA TCAAGTTCTACTTTTTTACGTAACGATAGAACATGAGTGTCAATGGTACG CGGATTGTCGATAGCGTCAGGCCACGCACGACGTAGCAACTCTGATCGGC TCAAAGGTACTCCACCAGCTTGCGCCAAAACGTACAACAAACTAAATTCC TGTGGAGTCAGGTCGATAAACTCCCCTTGGAATCGTACACGGCGTTGGAC TAAATCGATTTGCAAAGTACCATAATCCAAATAAGCAGGAGCAGTAGGTG TGCGCTTGCGGCGGATTAATGCCTCTACCCTAGCCAAAAACTCCTGCATC CCAAATGGTTTGCTCAAGTAATCATCAGCTCCCGCCTTCAACCCGGCAAC GATATCAGCCTCATTAGTCCGAGCAGATAACATGAGAATTAGCGGCTGTT GCTGACGATGCAGCCAACGGCAAAATTCAATACCGTCACCATCTGGCAAA TCAGCATCCAGAATCACTAGAGTTGGCTGATGGCTCAAAAAGGCTTCCCT TGCTTGATATATGCTGGCGGCTTGATGCACACGGTATTCCAATTGTTGCA AGTGCCAACCCAGCAACGACCTCAGATGGGGATTCCCCTCAACGATTTCA ATACAAACCGAACCCACGGCGGCAAGACCCCTTAGCGTCGATGAATTTCA AAGTAACAAGCCCATGCCCAAGGTTTTGTAGTCTTTGTTACTCTAGCCAC AATTACCTCTACGTTTTTTTATAACCTATGTTGGCAAAATGTTGAGTTTA TGCCTTGTCCAGAGCGAAGTTGATAATATTATTAAATTTTTGTTATAGTT all4312 two-component system < Anabaena Chromosome ( bp): | | | | | Contig GoTo Block Find Display PgUp/PgDn Help Quit Gene name Description Sequence If suspicious, we could have found this same site by a direct search for its consensus sequence (though there are better ways than this), clicking on Find, then Sequence, and typing in the NtcA/promoter consensus sequence.

GCTGAGTTAGGAGTAAAAATCATTATTTTTCCTCCCTCTGCCTCCTCTAC ACCCTCTGCCCACTTAGAGTTGAGCGTTGGTTGCTAAATCTTTCTTTTGT TAACTTTGCTTGTGTTTGTGGAGGATTAGCATTCAAAATTTCCATGTTAA ATCGGTATCCAACATTGCGGATAGTCTGAATGAGGCTAGGTTGGCGGGGA TCAAGTTCTACTTTTTTACGTAACGATAGAACATGAGTGTCAATGGTACG CGGATTGTCGATAGCGTCAGGCCACGCACGACGTAGCAACTCTGATCGGC TCAAAGGTACTCCACCAGCTTGCGCCAAAACGTACAACAAACTAAATTCC TGTGGAGTCAGGTCGATAAACTCCCCTTGGAATCGTACACGGCGTTGGAC TAAATCGATTTGCAAAGTACCATAATCCAAATAAGCAGGAGCAGTAGGTG TGCGCTTGCGGCGGATTAATGCCTCTACCCTAGCCAAAAACTCCTGCATC CCAAATGGTTTGCTCAAGTAATCATCAGCTCCCGCCTTCAACCCGGCAAC GATATCAGCCTCATTAGTCCGAGCAGATAACATGAGAATTAGCGGCTGTT GCTGACGATGCAGCCAACGGCAAAATTCAATACCGTCACCATCTGGCAAA TCAGCATCCAGAATCACTAGAGTTGGCTGATGGCTCAAAAAGGCTTCCCT TGCTTGATATATGCTGGCGGCTTGATGCACACGGTATTCCAATTGTTGCA AGTGCCAACCCAGCAACGACCTCAGATGGGGATTCCCCTCAACGATTTCA ATACAAACCGAACCCACGGCGGCAAGACCCCTTAGCGTCGATGAATTTCA AAGTAACAAGCCCATGCCCAAGGTTTTGTAGTCTTTGTTACTCTAGCCAC AATTACCTCTACGTTTTTTTATAACCTATGTTGGCAAAATGTTGAGTTTA TGCCTTGTCCAGAGCGAAGTTGATAATATTATTAAATTTTTGTTATAGTT all4312 two-component system < Anabaena Chromosome ( bp): | | | | | Contig GoTo Block Find Display PgUp/PgDn Help Quit Gene name Description Sequence GTA.{8}TAC.{20,24}TA...T The NtcA binding sequence is flexible, like most sequences of biological interest. Search tools need to be similarly flexible.This search string says: Look for “GTA” followed by 8 nucleotides of any sort, followed by “TAC” followed by 20 to 24 nucleotides, followed by “TA”, three nucleotides, then a final “T”. Press Enter to find a matching sequence.

GCTGAGTTAGGAGTAAAAATCATTATTTTTCCTCCCTCTGCCTCCTCTAC ACCCTCTGCCCACTTAGAGTTGAGCGTTGGTTGCTAAATCTTTCTTTTGT TAACTTTGCTTGTGTTTGTGGAGGATTAGCATTCAAAATTTCCATGTTAA ATCGGTATCCAACATTGCGGATAGTCTGAATGAGGCTAGGTTGGCGGGGA TCAAGTTCTACTTTTTTACGTAACGATAGAACATGAGTGTCAATGGTACG CGGATTGTCGATAGCGTCAGGCCACGCACGACGTAGCAACTCTGATCGGC TCAAAGGTACTCCACCAGCTTGCGCCAAAACGTACAACAAACTAAATTCC TGTGGAGTCAGGTCGATAAACTCCCCTTGGAATCGTACACGGCGTTGGAC TAAATCGATTTGCAAAGTACCATAATCCAAATAAGCAGGAGCAGTAGGTG TGCGCTTGCGGCGGATTAATGCCTCTACCCTAGCCAAAAACTCCTGCATC CCAAATGGTTTGCTCAAGTAATCATCAGCTCCCGCCTTCAACCCGGCAAC GATATCAGCCTCATTAGTCCGAGCAGATAACATGAGAATTAGCGGCTGTT GCTGACGATGCAGCCAACGGCAAAATTCAATACCGTCACCATCTGGCAAA TCAGCATCCAGAATCACTAGAGTTGGCTGATGGCTCAAAAAGGCTTCCCT TGCTTGATATATGCTGGCGGCTTGATGCACACGGTATTCCAATTGTTGCA AGTGCCAACCCAGCAACGACCTCAGATGGGGATTCCCCTCAACGATTTCA ATACAAACCGAACCCACGGCGGCAAGACCCCTTAGCGTCGATGAATTTCA AAGTAACAAGCCCATGCCCAAGGTTTTGTAGTCTTTGTTACTCTAGCCAC AATTACCTCTACGTTTTTTTATAACCTATGTTGGCAAAATGTTGAGTTTA TGCCTTGTCCAGAGCGAAGTTGATAATATTATTAAATTTTTGTTATAGTT all4312 two-component system < Anabaena Chromosome ( bp): | | | | | Contig GoTo Block Find Display PgUp/PgDn Help Quit It is sometimes easier to see patterns in DNA sequences if we can engage our visual recognition abilities. Click Display and then Base Symbols to try it out for yourself.

GCTGAGTTAGGAGTAAAAATCATTATTTTTCCTCCCTCTGCCTCCTCTAC ACCCTCTGCCCACTTAGAGTTGAGCGTTGGTTGCTAAATCTTTCTTTTGT TAACTTTGCTTGTGTTTGTGGAGGATTAGCATTCAAAATTTCCATGTTAA ATCGGTATCCAACATTGCGGATAGTCTGAATGAGGCTAGGTTGGCGGGGA TCAAGTTCTACTTTTTTACGTAACGATAGAACATGAGTGTCAATGGTACG CGGATTGTCGATAGCGTCAGGCCACGCACGACGTAGCAACTCTGATCGGC TCAAAGGTACTCCACCAGCTTGCGCCAAAACGTACAACAAACTAAATTCC TGTGGAGTCAGGTCGATAAACTCCCCTTGGAATCGTACACGGCGTTGGAC TAAATCGATTTGCAAAGTACCATAATCCAAATAAGCAGGAGCAGTAGGTG TGCGCTTGCGGCGGATTAATGCCTCTACCCTAGCCAAAAACTCCTGCATC CCAAATGGTTTGCTCAAGTAATCATCAGCTCCCGCCTTCAACCCGGCAAC GATATCAGCCTCATTAGTCCGAGCAGATAACATGAGAATTAGCGGCTGTT GCTGACGATGCAGCCAACGGCAAAATTCAATACCGTCACCATCTGGCAAA TCAGCATCCAGAATCACTAGAGTTGGCTGATGGCTCAAAAAGGCTTCCCT TGCTTGATATATGCTGGCGGCTTGATGCACACGGTATTCCAATTGTTGCA AGTGCCAACCCAGCAACGACCTCAGATGGGGATTCCCCTCAACGATTTCA ATACAAACCGAACCCACGGCGGCAAGACCCCTTAGCGTCGATGAATTTCA AAGTAACAAGCCCATGCCCAAGGTTTTGTAGTCTTTGTTACTCTAGCCAC AATTACCTCTACGTTTTTTTATAACCTATGTTGGCAAAATGTTGAGTTTA TGCCTTGTCCAGAGCGAAGTTGATAATATTATTAAATTTTTGTTATAGTT all4312 two-component system < Anabaena Chromosome ( bp): | | | | | Contig GoTo Block Find Display PgUp/PgDn Help Quit Alternate starts Annotated features Local features Tandem repeats Inverted repeats Base symbols It is sometimes easier to see patterns in DNA sequences if we can engage our visual recognition abilities. Click Display and then Base Symbols to try it out for yourself.

□ ■■ □□□ ■■ □□□□□ ■ □□□□□ ■■ □ ■■ □■■■■■■■■■■■■■■□■■■■■■ CTAC ACCCTCTGCCCACTTAGAGTTGAGCGTTGGTTGCTAAATCTTTCTTTTGT TAACTTTGCTTGTGTTTGTGGAGGATTAGCATTCAAAATTTCCATGTTAA ATCGGTATCCAACATTGCGGATAGTCTGAATGAGGCTAGGTTGGCGGGGA TCAAGTTCTACTTTTTTACGTAACGATAGAACATGAGTGTCAATGGTACG CGGATTGTCGATAGCGTCAGGCCACGCACGACGTAGCAACTCTGATCGGC TCAAAGGTACTCCACCAGCTTGCGCCAAAACGTACAACAAACTAAATTCC TGTGGAGTCAGGTCGATAAACTCCCCTTGGAATCGTACACGGCGTTGGAC TAAATCGATTTGCAAAGTACCATAATCCAAATAAGCAGGAGCAGTAGGTG TGCGCTTGCGGCGGATTAATGCCTCTACCCTAGCCAAAAACTCCTGCATC CCAAATGGTTTGCTCAAGTAATCATCAGCTCCCGCCTTCAACCCGGCAAC GATATCAGCCTCATTAGTCCGAGCAGATAACATGAGAATTAGCGGCTGTT GCTGACGATGCAGCCAACGGCAAAATTCAATACCGTCACCATCTGGCAAA TCAGCATCCAGAATCACTAGAGTTGGCTGATGGCTCAAAAAGGCTTCCCT TGCTTGATATATGCTGGCGGCTTGATGCACACGGTATTCCAATTGTTGCA AGTGCCAACCCAGCAACGACCTCAGATGGGGATTCCCCTCAACGATTTCA ATACAAACCGAACCCAC □□■□□■■■□■■■■■ ■■□ □■□■■□□■□□□ ■■ ■■□ □□□■□□■□□□ ■■■ □■□■■■□□□□■■■■□■□□ ■■■■■ □■■□■■■■□□■■□■ □□■■□ ■■■ ■■□■□ ■■■ ■■■■□■□□■■■□■□■ ■□ □■□□□□■□■■□□□■■■□ ■□■■■■□■■■□□□□■□□□□■■□□■□□■□■■□ ■■□□□ ■■■■■□■ ■□■ □□■■ all4312 two-component system < Anabaena Chromosome ( bp): | | | | | Contig GoTo Block Find Display PgUp/PgDn Help Quit Purines are represented as open symbols and pyrimidines as filled in symbols. A and T are purple, G and C are green. Fortunately, you don’t have to remember any of this to recognize patterns. Look at the top line. It’s immediately evident (as it probably was not before) that all4312 is followed by a string of... pyrimidines and then a string of purines. Possibly a termination region? Let’s look beyond. Press the right arrow key to move the display one line down.

AACCAAGCCGATGAAGAATGGAACTAA □■ ■ ■■■ ■ ■ ■ ■■■■■□■■■■■■□■ □■■□□□■■□□□□□■□□□□□■■ □■■□■ ■■■■■■ ■■ ■■■■■□■■■■■■ CTAC ACCCTCTGCCCACTTAGAGTTGAGCGTTGGTTGCTAAATCTTTCTTTTGT TAACTTTGCTTGTGTTTGTGGAGGATTAGCATTCAAAATTTCCATGTTAA ATCGGTATCCAACATTGCGGATAGTCTGAATGAGGCTAGGTTGGCGGGGA TCAAGTTCTACTTTTTTACGTAACGATAGAACATGAGTGTCAATGGTACG CGGATTGTCGATAGCGTCAGGCCACGCACGACGTAGCAACTCTGATCGGC TCAAAGGTACTCCACCAGCTTGCGCCAAAACGTACAACAAACTAAATTCC TGTGGAGTCAGGTCGATAAACTCCCCTTGGAATCGTACACGGCGTTGGAC TAAATCGATTTGCAAAGTACCATAATCCAAATAAGCAGGAGCAGTAGGTG TGCGCTTGCGGCGGATTAATGCCTCTACCCTAGCCAAAAACTCCTGCATC CCAAATGGTTTGCTCAAGTAATCATCAGCTCCCGCCTTCAACCCGGCAAC GATATCAGCCTCATTAGTCCGAGCAGATAACATGAGAATTAGCGGCTGTT GCTGACGATGCAGCCAACGGCAAAATTCAATACCGTCACCATCTGGCAAA TCAGCATCCAGAATCACTAGAGTTGGCTGATGGCTCAAAAAGGCTTCCCT TGCTTGATATATGCTGGCGGCTTGATGCACACGGTATTCCAATTGTTGCA AGTGCCAACCCAGCAACGACCTCAGATGGGGATTCCCCTCAACGATTTCA ATACAAACCGAACCCAC □□■□□■■■□■■■■■ ■■■ □■□■■□■■□ ■■■ ■■■■ □□□■□□■□□□■■■□■□■■■□□□□ ■■■■ □■□□■ ■■■ ■□■■□■■■ ■□ □■■□■ □□■■□■■■■■□■□■■■■ ■■■ □■□□■■■□■□■■□□■ □□□□ ■□■■□□□■■■□ alr4311 ABC transporter > all4312 two-component system < Anabaena Chromosome ( bp): | | | | | Contig GoTo Block Find Display PgUp/PgDn Help Quit From the change in color from yellow to blue, we’ve evidently run into a gene on the other strand, this one also ending in a string of pyrimidines. Let’s look further by clicking on PgUp.

CCAAAGCAAAACAGGTATAGACACCACTGATGTTCGCCCTTTAGCGCAAC CGTGGATGTATTTGATTTTATTAGGATTTACACTATTACTACTTTTAATT GATGCTTGGGCGATCGCCACAGCTATAGCCATCTAA □■■■■□□■■■■□□□ □■■■■□□□■■■□□□■■ ■■■■■ ■■■■■■■■■■■■■■■■■■■■■■■■■■■■■ ■■■■ ■■■■■■■■■■■■■■■■■ ATGACAGCCCAATTAAGGCTAGAACAAGT TAATCTGTTTGCCAAGCTAAAAACCCAGCTTCAGGGCTACCCAATATTGC AGGATATCTCTTTTGAGATTAACTCTGGCGATCGCCTAGCAATTATTGGC CCCTCCGGTGCTGGTAAAACTTCTTTACTACGTCTAATTAACCGCCTCAG TGAACCTAATAGCGGCAAAATTTTTTTAGAAAATCAAGAATATCCGCAAA TTCCTGTTATCCAGTTGCGCCAGATAGTGACCCTGGTATTACAAGAGCCA AAGTTTCTGGGGATGACAGTCCAACAAGCCTTAGCTTACCCTTTAATTTT GCGCGGTTTGACCAAAGAGACGATTCAGCAGCGAGTCAGTCATTGGGCGG AACAGCTGCAAATCCCTGGTGATTGGTTAGGACGCACTGAGGTACAACTT TCGGCTGGACAGAGACAGCTCGTAGCGATCGCTCGTGCTTTAGTCATTCA ACCGAAAATCCTCCTGTTAGATGAGCCAACCTCTCATCTAGATATTGGTA TAGCCTCCCATCTTATCCAAGTCTTAACCCAGCTAACTCAAACTCATCAC ACAACAATTGTGATGGTAAACAGCCAGCTAGACTTCACTCAGATGTTTTG TAATCGGCTTTTGTATTTACAGCAAGGACGTTTATTGGTTAATCAAACAG CTTCTAACATCGACTGGATTGACTTACAAAAAAGGTTGATGCACGCCGAA AACCAAGCCGATGAAGAATGGAACTAA □■■ ■■■ ■■■■■■■■□■■■■■■□■ alr4310 hypothetical protein > alr4311 ABC transporter > Anabaena Chromosome ( bp): | | | | | Contig GoTo Block Find Display PgUp/PgDn Help Quit The intergenic region between alr4310 and alr4311 shows a remarkable pattern. I’ll give you a few seconds to try to find it yourself... The intergenic region between alr4310 and alr4311 shows a remarkable pattern. I’ll give you a few seconds to try to find it yourself......a series of tandem repeats. Now that we see it by eye, we can ask the computer to find them in a more systematic fashion. Click on Display and then Tandem repeats.

CCAAAGCAAAACAGGTATAGACACCACTGATGTTCGCCCTTTAGCGCAAC CGTGGATGTATTTGATTTTATTAGGATTTACACTATTACTACTTTTAATT GATGCTTGGGCGATCGCCACAGCTATAGCCATCTAA □■■■■□□■■■■□□□ □■■■■□□□■■■□□□■■ ■■■■■ ■■■■■■■■■■■■■■■■■■■■■■■■■■■■■ ■■■■ ■■■■■■■■■■■■■■■■■ ATGACAGCCCAATTAAGGCTAGAACAAGT TAATCTGTTTGCCAAGCTAAAAACCCAGCTTCAGGGCTACCCAATATTGC AGGATATCTCTTTTGAGATTAACTCTGGCGATCGCCTAGCAATTATTGGC CCCTCCGGTGCTGGTAAAACTTCTTTACTACGTCTAATTAACCGCCTCAG TGAACCTAATAGCGGCAAAATTTTTTTAGAAAATCAAGAATATCCGCAAA TTCCTGTTATCCAGTTGCGCCAGATAGTGACCCTGGTATTACAAGAGCCA AAGTTTCTGGGGATGACAGTCCAACAAGCCTTAGCTTACCCTTTAATTTT GCGCGGTTTGACCAAAGAGACGATTCAGCAGCGAGTCAGTCATTGGGCGG AACAGCTGCAAATCCCTGGTGATTGGTTAGGACGCACTGAGGTACAACTT TCGGCTGGACAGAGACAGCTCGTAGCGATCGCTCGTGCTTTAGTCATTCA ACCGAAAATCCTCCTGTTAGATGAGCCAACCTCTCATCTAGATATTGGTA TAGCCTCCCATCTTATCCAAGTCTTAACCCAGCTAACTCAAACTCATCAC ACAACAATTGTGATGGTAAACAGCCAGCTAGACTTCACTCAGATGTTTTG TAATCGGCTTTTGTATTTACAGCAAGGACGTTTATTGGTTAATCAAACAG CTTCTAACATCGACTGGATTGACTTACAAAAAAGGTTGATGCACGCCGAA AACCAAGCCGATGAAGAATGGAACTAA □■ ■■ ■■■■■■■■■■□■■■■■■□■ alr4310 hypothetical protein > alr4311 ABC transporter > Anabaena Chromosome ( bp): | | | | | Contig GoTo Block Find Display PgUp/PgDn Help Quit The intergenic region between alr4310 and alr4311 show a remarkable pattern. I’ll give you a few seconds to try to find it yourself......a series of tandem repeats. Now that we see it by eye, we can ask the computer to find them in a more systematic fashion. Click on Display and then Tandem repeats. Alternate starts Annotated features Local features Tandem repeats Inverted repeats Base symbols Tandem repeats

CCAAAGCAAAACAGGTATAGACACCACTGATGTTCGCCCTTTAGCGCAAC CGTGGATGTATTTGATTTTATTAGGATTTACACTATTACTACTTTTAATT GATGCTTGGGCGATCGCCACAGCTATAGCCATCTAA □■■■■□□■■■■□□□ □■■■■□□□■■■□□□■■ ■■■■ ■■■■■■■■■■■■■■ ■■■ ■■■■■■■■■■■■■ ■■■■ ■■■■■■■■■■■■■■■■■ ATGACAGCCCAATTAAGGCTAGAACAAGT TAATCTGTTTGCCAAGCTAAAAACCCAGCTTCAGGGCTACCCAATATTGC AGGATATCTCTTTTGAGATTAACTCTGGCGATCGCCTAGCAATTATTGGC CCCTCCGGTGCTGGTAAAACTTCTTTACTACGTCTAATTAACCGCCTCAG TGAACCTAATAGCGGCAAAATTTTTTTAGAAAATCAAGAATATCCGCAAA TTCCTGTTATCCAGTTGCGCCAGATAGTGACCCTGGTATTACAAGAGCCA AAGTTTCTGGGGATGACAGTCCAACAAGCCTTAGCTTACCCTTTAATTTT GCGCGGTTTGACCAAAGAGACGATTCAGCAGCGAGTCAGTCATTGGGCGG AACAGCTGCAAATCCCTGGTGATTGGTTAGGACGCACTGAGGTACAACTT TCGGCTGGACAGAGACAGCTCGTAGCGATCGCTCGTGCTTTAGTCATTCA ACCGAAAATCCTCCTGTTAGATGAGCCAACCTCTCATCTAGATATTGGTA TAGCCTCCCATCTTATCCAAGTCTTAACCCAGCTAACTCAAACTCATCAC ACAACAATTGTGATGGTAAACAGCCAGCTAGACTTCACTCAGATGTTTTG TAATCGGCTTTTGTATTTACAGCAAGGACGTTTATTGGTTAATCAAACAG CTTCTAACATCGACTGGATTGACTTACAAAAAAGGTTGATGCACGCCGAA AACCAAGCCGATGAAGAATGGAACTAA □■ ■■■ ■■■■■■■■■□■■■■■■□■ alr4310 hypothetical protein > alr4311 ABC transporter > Anabaena Chromosome ( bp): | | | | | Contig GoTo Block Find Display PgUp/PgDn Help Quit The machine saw more than we did! Not only are the repeats we saw more extensive, but there is also another set of repeats nearby. What do they mean? Hard to say, but certainly our chances of figuring them out are better if we can engage our visual imagination and if we can see them in a biological context. End

Analysis: Tools for directly examining sequence Summary (article of faith) The freshest insights and most fundamental discoveries require intimate contact with the basic phenomenon. In genomic analysis, the basic phenomenon is often the genome. The sequence interface makes it possible to view DNA features within a biological context. The interface provides tool to aid discovery of features within noncoding DNA. Scenario 6 Software that does most of what you saw already exists, but it would need to be rewritten before it could serve as a web interface.