Presentation is loading. Please wait.

Presentation is loading. Please wait.

Motif discovery Tutorial 5. Motif discovery MEME Creates motif PSSM de-novo (unknown motif) MAST Searches for a PSSM in a DB TOMTOM Searches for a PSSM.

Similar presentations


Presentation on theme: "Motif discovery Tutorial 5. Motif discovery MEME Creates motif PSSM de-novo (unknown motif) MAST Searches for a PSSM in a DB TOMTOM Searches for a PSSM."— Presentation transcript:

1 Motif discovery Tutorial 5

2 Motif discovery MEME Creates motif PSSM de-novo (unknown motif) MAST Searches for a PSSM in a DB TOMTOM Searches for a PSSM in motif DBs Agenda Cool story of the day: How NOT to be a bioinformatician

3 Motif – definition Motif a widespread pattern with a biological significance. Sequence motif PTB (RNA binding protein) UCUU CAP (DNA binding protein) TGTGAXXXXXXTCACAXT

4 Sequence motif – definition 12345678910 A000003/61/62/600 D03/62/6001/65/61/60 E004/61000015/6 G01/60011/30000 H01/600000000 N0 00000000 Y1000003/6 00..YDEEGGDAEE....YGEEGADYED....YDEEGADYEE....YNDEGDDYEE....YHDEGAADEE.. Motif a nucleotide or amino-acid sequence pattern that is widespread and has a biological significance PSSM - position-specific scoring matrix

5 Can we find motifs using multiple sequence alignment (MSA)? YES! NO Local multiple sequence alignment is a hard problem to solve

6 Motif search: from de-novo motifs to motif annotation gapped motifs Large DNA data http://meme.sdsc.edu/

7 MEME

8 MEME – Multiple EM* for Motif finding Motif discovery from unaligned sequences - genomic or protein sequences Flexible model of motif presence (Motif can be absent in some sequences or appear several times in one sequence) *Expectation-maximization http://meme.sdsc.edu/

9 MEME - Input Input file (fasta file) How many times in each sequence? How many motifs? How many sites? Range of motif lengths

10 MEME - Output Motif e- value

11 MEME – Sequence logo Motif length Number of appearnces Motif e- value A graphical representation of the sequence motif

12 MEME – Sequence logo High information content = High confidence The relative sizes of the letters indicates their frequency in the sequences The total height of the letters depicts the information content of the position, in bits of information.

13 Multilevel Consensus MEME – Sequence logo

14 Patterns can be presented as regular expressions [AG]-x-V-x(2)-{YW} [] - Either residue x - Any residue x(2) - Any residue in the next 2 positions {} - Any residue except these Examples: AYVACM, GGVGAA

15 Sequence names Position in sequence Strength of match Motif within sequence MEME – motif alignment

16 Overall strength of motif matches Motif location in the input sequence MEME – motif locations Sequence names

17 What can we do with motifs? MAST - Search for them in non annotated sequence databases (protein and DNA). TOMTOM - Find the protein which binds the DNA motifs.

18 MAST

19 Searches for motifs (one or more) in sequence databases: – Like BLAST but motifs for input – Similar to iterations of PSI-BLAST Profile defines strength of match – Multiple motif matches per sequence MEME uses MAST to summarize results: – Each MEME result is accompanied by the MAST result for searching the discovered motifs on the given sequences. http://meme.sdsc.edu/meme4_4_0/cgi-bin/mast.cgi

20 MAST - Input Input file (motifs) Database

21 If you wish to use motifs discovered by MEME

22 MAST - Output Input motifs Presence of the motifs in a given database

23 MAST – Output (another example, global view)

24 MAST – Output (another example, global view)

25 TOMTOM

26 Searches one or more query DNA motifs against one or more databases of target motifs, and reports for each query a list of target motifs, ranked by p-value. The output contains results for each query, in the order that the queries appear in the input file. http://meme.sdsc.edu/meme/doc/tomtom.html

27 TOMTOM - Input Input motif Background frequencies Database

28 TOMTOM - Output Input motif Matching motifs

29 TOMTOM – Output Wrong input (RNA sequence of RNA binding protein NOVA1) “OK” results

30 MAST vs. TOMTOM MASTTOMTOM ComparisonProfile against DBProfile against Profile DBGeneral DBsKnown motif DBs

31 Cool Story of the day How NOT to be a bioinformatician

32

33

34

35


Download ppt "Motif discovery Tutorial 5. Motif discovery MEME Creates motif PSSM de-novo (unknown motif) MAST Searches for a PSSM in a DB TOMTOM Searches for a PSSM."

Similar presentations


Ads by Google