HIV Project -Matt Hagen. The Problem Are there any DNA sequences in common between HIV and human genomes? HIV-1, complete genome, chimeric clone AF033819.3HIV-1,

Slides:



Advertisements
Similar presentations
PV92 PCR/Informatics Kit
Advertisements

You start with a biologically relevant protein from a pathogen (Bacterium, virus, parasite…)
Statistics in Bioinformatics May 2, 2002 Quiz-15 min Learning objectives-Understand equally likely outcomes, Counting techniques (Example, genetic code,
Reference mapping and variant detection Peter Tsai Bioinformatics Institute, University of Auckland.
Chi square.  Non-parametric test that’s useful when your sample violates the assumptions about normality required by other tests ◦ All other tests we’ve.
Combined analysis of ChIP- chip data and sequence data Harbison et al. CS 466 Saurabh Sinha.
Regulatory Motifs. Contents Biology of regulatory motifs Experimental discovery Computational discovery PSSM MEME.
The Sense of Sequense The Sense of Sequense Chris Evelo BiGCaT Bioinformatics Universiteit Maastricht.
Prepared By: Miguel Perez Joel Shepherd.  Build a Java Program to represent the Finite- Difference Method numerically and graphically for easy visualization.
Physical Mapping I CIS 667 February 26, Physical Mapping A physical map of a piece of DNA tells us the location of certain markers  A marker is.
Finding approximate palindromes in genomic sequences.
Whole Genome Assembly. WGA 1. Screener 2. Overlapper 3. Unitigger, 4. Scaffolder, 5. Repeat Resolver.
Genotyping of James Watson’s genome from Low-coverage Sequencing Data Sanjiv Dinakar and Yözen Hernández.
Computational Analysis of Transcript Identification Using GenBank Slides by Terry Clark.
PSY 307 – Statistics for the Behavioral Sciences Chapter 19 – Chi-Square Test for Qualitative Data Chapter 21 – Deciding Which Test to Use.
AP Biology. Chi-Square Purpose: To determine if a deviation from expected results is significant. (or was it just chance) Purpose: To determine if a deviation.
RNAseq analyses -- methods
Tests for Random Numbers Dr. Akram Ibrahim Aly Lecture (9)
5.3 – Advances in Genetics Trashketball!. Selecting organisms with desired traits to be parents of the next generation is… A. Inbreeding A. Inbreeding.
T-test - paired Testing for a difference. What does it do? Tests for a difference in means Compares two cases (eg soil moisture content north & south.
Positioning of DNA Uptake Sequences in the Pasteurellaceae family
% Shared DNA (expectations)
Pairwise Sequence Alignment Part 2. Outline Summary Local and Global alignments FASTA and BLAST algorithms Evaluating significance of alignments Alignment.
1 Q1-Q3 results Roderic Guigó' s lab April 11 th 2007 conference call.
Chapter 6: Continuous Probability Distributions A visual comparison.
Determine the sequence of genes along a chromosome based on the following recombination frequencies A-C 20% A-D 10% B-C 15% B-D 5%
Applications & Analysis DNA Gel Electrophoresis 1.
Chi square Test. Chi squared tests are used to determine whether the difference between an observed and expected frequency distribution is statistically.
Squaring of a number ending in 5 An approach to determine answers Quickly! Squaring of a number ending in 5.
Chapter 6: Continuous Probability Distributions A visual comparison.
Genome Analysis. This involves finding out the: order of the bases in the DNA location of genes parts of the DNA that controls the activity of the genes.
Figure S1 (a) (b) Fig. S1. Hydroponics culture of Arabidopsis thaliana. (a) Illustration of the hydroponics system in the growth chamber. (b) close-up.
DNA Questions What makes up a DNA backbone? How would you describe how DNA looks? Name the 4 bases that make up DNA. “T” base can only match with? What.
Bivariate Association. Introduction This chapter is about measures of association This chapter is about measures of association These are designed to.
Chi Square Chi square is employed to test the difference between an actual sample and another hypothetical or previously established distribution such.
Chi Square Test Dr. Asif Rehman.
ChiMerge Discretization
Lec. 38 – Data Through Time / Sequences:
Phylogeny - based on whole genome data
Testing for a difference
X Chromosome Inactivation
Data Analysis-Descriptive Statistics
BLAST Anders Gorm Pedersen & Rasmus Wernersson.
Categorical Data Aims Loglinear models Categorical data
Example of a common SNP in dogs
The Human Genome Project
The Human Genome Project
Warm-Up: Chi Square Practice!!!
Chi-Square Test.
Chi Square Review.
Quantitative Methods PSY302 Quiz Chapter 9
Chi-Square Test.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Ruth B. McCole, Jelena Erceg, Wren Saylor, Chao-ting Wu  Cell Reports 
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Population Genetic Structure of the People of Qatar

Chi-Square Test.
Incorporating changing population size into the coalescent
Chi2 (A.K.A X2).
A Complete mtDNA Genome of an Early Modern Human from Kostenki, Russia
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Volume 110, Issue 4, Pages (August 2002)
Chi Squared! Determine whether the difference between an observed and expected frequency distribution is statistically significant Complete your work sheet.
Quadrat sampling & the Chi-squared test
Biotechnology Mader 19.4.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Population Genetic Structure of the People of Qatar
Will use Fruit Flies for our example
Presentation transcript:

HIV Project -Matt Hagen

The Problem Are there any DNA sequences in common between HIV and human genomes? HIV-1, complete genome, chimeric clone AF HIV-1, complete genome UCSC Human genome, March 2006 AssemblyUCSC

What’s been done

Determine if the matches found are significant Are these matches due to chance?

Current Problem Calculate expected outcomes o Find all unique sequences in each chromosome of length 16 Obtain frequencies for all unique sequences Chi Squared Analysis?

Unique Sequences

Current Problem How are the matches distributed along the HIV genome? Distributed evenly Distribution is not equal Distribution is in clusters

– Distribution of sequence matches across the HIV genome, overlaps removed

Distribution of Matches Are areas on the HIV Genome more represented than others?

Future Work Why are some areas more represented than others? What is the biological significance of these areas? Where are these sequences found in the human genome? Nice graphic output of matches...