1 3. genome analysis
2 The first DNA-based genome to be sequenced in its entirety was that of bacteriophage Φ-X174; (5,368 bp), sequenced by Frederick Sanger in genome analysis
There are several things to notice in this plot. First, the genome is circular. The density of the four nucleotides are plotted in the four outer-most circles. This density is not evenly distributed; although all four of the scales range from 0% (min., no colour) to 40% (max colour intensity), it can be easily seen that the sequence is dominated by T's (red circle), and that there are relatively few G's (outermost turquoise circle) and C's (pink circle), and a few A- rich regions (green 2nd circle). There are many genes which overlap (the genes are indicated in the "annotation circle", which is the fifth circle from the outside - with the blue bands representing genes in the forward direction). 3 GC Skew = (G - C)/(G + C) AT Skew = (A - T)/(A + T) 3. genome analysis The first DNA-based genome to be sequenced in its entirety was that of bacteriophage Φ-X174; (5,368 bp), sequenced by Frederick Sanger in 1976
4 3. genome analysis exploring genomes
5 3. genome analysis exploring genomes
6 3. genome analysis exploring genomes
7 3. genome analysis
8
9
10 3. genome analysis
11 3. genome analysis
12 3. genome analysis
Circular maps of the chromosome and plasmids of enteropathogenic E. coli (da Iguchi A et al. J. Bacteriol. 2009) Circular maps of the chromosome and plasmids of EPEC strain E2348/69. (A) EPEC strain E2348/69 chromosome. From the outside in, the first circle shows the locations of PPs and IEs (purple, lambda-like PPs; light blue, other PPs; green, IEs and the LEE element), the second circle shows the nucleotide sequence positions (in Mbp), the third and fourth circles show CDSs transcribed clockwise and anticlockwise, respectively (gray, conserved in all eight other sequenced E. coli strains; red, conserved only in the B2 phylogroup; yellow, variable distribution; blue, E2348/69 specific), the fifth circle shows the tRNA genes (red), the sixth circle shows the rRNA operons (blue), the seventh circle shows the G+C content, and the eighth circle shows the GC skew. (B) EPEC strain E2348/69 plasmids. The boxes in the outer and inner circles represent CDSs transcribed clockwise and anticlockwise, respectively. Pseudogenes are indicated by black boxes, and other CDSs are indicated by the colors described above for panel A. CDS = CoDing Sequence, region of nucleotides that corresponds to the sequence of amino acids in the predicted protein PP = prophage: a phage (viral) genome inserted and integrated into the circular bacterial DNA chromosome IE = integrative elements genome analysis
14 3. genome analysis
15 3. genome analysis
16 3. genome analysis
17 3. genome analysis
18 3. genome analysis
19 3. genome analysis
20 3. genome analysis
21 3. genome analysis
22 3. genome analysis
23 3. genome analysis
24 3. genome analysis
25 3. genome analysis
26 3. genome analysis
27 3. genome analysis
Structural genomics 0101# # # #10010#1001# # #0 DNA Algorithm Residue THR THR CYS PRO Protein Structure X Ray diffractometry NMR cryo-electron tomography genome analysis