Biomathematics seminar Application of Fourier to Bioinformatics Girolamo Giudice.

Slides:



Advertisements
Similar presentations
Symmetry and the DTFT If we know a few things about the symmetry properties of the DTFT, it can make life simpler. First, for a real-valued sequence x(n),
Advertisements

Lecture 7: Basis Functions & Fourier Series
Signals The main function of the physical layer is moving information in the form of electromagnetic signals across a transmission media. Information can.
DFT/FFT and Wavelets ● Additive Synthesis demonstration (wave addition) ● Standard Definitions ● Computing the DFT and FFT ● Sine and cosine wave multiplication.
Fourier Transform – Chapter 13. Image space Cameras (regardless of wave lengths) create images in the spatial domain Pixels represent features (intensity,
 Signals  Analog and Digital  Analog and Digital Data & Signals  Periodic & Aperiodic Signals.
CMP206 – Introduction to Data Communication & Networks Lecture 2 – Signals.
TRANSMISSION FUNDAMENTALS Review
Properties of continuous Fourier Transforms
Image Enhancement in the Frequency Domain Part I Image Enhancement in the Frequency Domain Part I Dr. Samir H. Abdul-Jauwad Electrical Engineering Department.
Image Fourier Transform Faisal Farooq Q: How many signal processing engineers does it take to change a light bulb? A: Three. One to Fourier transform the.
Time and Frequency Representation
CH#3 Fourier Series and Transform
Chapter 25 Nonsinusoidal Waveforms. 2 Waveforms Used in electronics except for sinusoidal Any periodic waveform may be expressed as –Sum of a series of.
EE513 Audio Signals and Systems Digital Signal Processing (Systems) Kevin D. Donohue Electrical and Computer Engineering University of Kentucky.
Vibrationdata 1 Unit 5 The Fourier Transform. Vibrationdata 2 Courtesy of Professor Alan M. Nathan, University of Illinois at Urbana-Champaign.
Lecture 1 Signals in the Time and Frequency Domains
ELECTRICAL CIRCUIT ET 201 Define and explain characteristics of sinusoidal wave, phase relationships and phase shifting.
Motivation Music as a combination of sounds at different frequencies
Signals & Systems Lecture 11: Chapter 3 Spectrum Representation (Book: Signal Processing First)
Fourier series. The frequency domain It is sometimes preferable to work in the frequency domain rather than time –Some mathematical operations are easier.
Module 2 SPECTRAL ANALYSIS OF COMMUNICATION SIGNAL.
1 Chapter 5 Image Transforms. 2 Image Processing for Pattern Recognition Feature Extraction Acquisition Preprocessing Classification Post Processing Scaling.
Fourier Series. Introduction Decompose a periodic input signal into primitive periodic components. A periodic sequence T2T3T t f(t)f(t)
Vibrationdata 1 Unit 5 The Fourier Transform. Vibrationdata 2 Courtesy of Professor Alan M. Nathan, University of Illinois at Urbana-Champaign.
INF380 - Proteomics-101 INF380 – Proteomics Chapter 10 – Spectral Comparison Spectral comparison means that an experimental spectrum is compared to theoretical.
Digital Image Processing Chapter 4 Image Enhancement in the Frequency Domain Part I.
ELECTRICAL TECHNOLOGY EET 103/4
CH#3 Fourier Series and Transform
ES97H Biomedical Signal Processing
Vibrationdata 1 Unit 6a The Fourier Transform. Vibrationdata 2 Courtesy of Professor Alan M. Nathan, University of Illinois at Urbana-Champaign.
revision Transfer function. Frequency Response
Fourier series, Discrete Time Fourier Transform and Characteristic functions.
صدا و ارتعاش در صنعت جلسه دوم محمد رضا منظم
1 Signals. 2 Signals Introduction Introduction Analog and Digital Analog and Digital.
Chapter2 : SIGNALS 1st semester King Saud University
The Fourier series handles steady-state sinusoidal functions, but does not have the ability to model transients effectively. As we have seen, the Fourier.
2 nd semester nalhareqi King Saud University College of Applied studies and Community Service 1301CT By: Nour Alhariqi.
Digital Image Processing Lecture 8: Fourier Transform Prof. Charlene Tsai.
Lecture 19 Spectrogram: Spectral Analysis via DFT & DTFT
Ch 10.2: Fourier Series We will see that many important problems involving partial differential equations can be solved, provided a given function can.
Signal Fndamentals Analogue, Discrete and Digital Signals
Chapter 4 Dynamical Behavior of Processes Homework 6 Construct an s-Function model of the interacting tank-in-series system and compare its simulation.
Chapter 4 Dynamical Behavior of Processes Homework 6 Construct an s-Function model of the interacting tank-in-series system and compare its simulation.
Sinusoidal Excitation of Circuits
Integral Transform Method
COMPLEX NUMBERS and PHASORS
Spectral Analysis Spectral analysis is concerned with the determination of the energy or power spectrum of a continuous-time signal It is assumed that.
MECH 373 Instrumentation and Measurements
SIGNALS PROCESSING AND ANALYSIS
3.1 Introduction Why do we need also a frequency domain analysis (also we need time domain convolution):- 1) Sinusoidal and exponential signals occur.
Basics of Comparative Genomics
Unit 5 The Fourier Transform.
MECH 373 Instrumentation and Measurements
Fourier Analyses Time series Sampling interval Total period
CHAPTER 3 DATA AND SIGNAL
ECET 345 Competitive Success/snaptutorial.com
ECET 345 Education for Service-- snaptutorial.com.
ECET 345 Teaching Effectively-- snaptutorial.com.
Sinusoidal Waveform Phasor Method.
UNIT II Analysis of Continuous Time signal
Periodic Functions and Fourier Series
Lecture 2: SIGNALS 2nd semester By: Elham Sunbu.
Speech Pathologist #10.
Physical Layer Part 1 Lecture -3.
Fourier Analyses Time series Sampling interval Total period
Correlation, Energy Spectral Density and Power Spectral Density
Lecture 5: Phasor Addition & Spectral Representation
Basics of Comparative Genomics
Lec.6:Discrete Fourier Transform and Signal Spectrum
Presentation transcript:

Biomathematics seminar Application of Fourier to Bioinformatics Girolamo Giudice

Background −1 + 3i and −1 − 3i,

Background Z = −1 + 3i Complex plane

Background Euler’s Formula Polar Cartesian Exponential

Periodic and aperiodic function Periodic functionAperiodic function

Sine (or cosine) wave A the amplitude, is the peak deviation of the function from zero. F the frequency, is the number of oscillations (cycles) that occur each second of time. ω = 2πf, the angular frequency, how many cycles occur in a second φ the phase, specifies (in radians) where in its cycle the oscillation is at t = 0.

Harmonic analysis and Fouries series It is possible to express periodic function into the sum of a (possibly infinite) set sines and cosines (or, equivalently, complex exponentials). Complex Fourier series Quadrature Fourier Series Euler Formula

Harmonic analysis and Fouries series It is possible to express a periodic function into the sum of a (possibly infinite) set sines and cosines Fourier Series

Spectrum Fourier Series

Spectrum of a sinusoid Quadrature Fourier Series Complex Fourier Series Delta Function

Example A=5 0

Take home message

Spectrum of a sinusoid

Spectrum of a Swiched sinusoid

Fourier transform Discrete domain Inverse Fourier Transform Continuous Domain Complex Fourier series

Fourier’s Transform

Relationship between Fourier series and transform

Application of Fourier to Bioinformatics

Conversion of DNA Character Strings to Conversion of DNA Character Strings to Numbers Create four binary sequences, one for each character one for each character (base), which specify whether a character is present (1) or absent (0) at a specific location known as indicator sequences Assign meaningful real or complex numbers to the four A, T, G, and C characters. In this way, a single numerical sequence representing the entire character string is obtained.

Binary Indicator Sequences Dna Sequence ATTGCACCGTGA Sum Any three of the four indicator sequences completely characterize the full DNA character string. Indicator sequences can be analyzed to identify in the structure of a DNA string.

Period-3 property PS of a protein coding regionPS of a non-coding region

Identify Protein-Coding Regions

Resonant Recognition Model The energy of delocalized electrons in amino acids produce the strongest impact on the electronic distribution of the whole protein because produce electromagnetic irradiation or absorption with spectral characteristics corresponding to energy distribution along the protein

Resonant Recognition Model

This frequency is related to the biological function provided the following criteria are met: 1) One peak only exists for a group of protein sequences 2) No significant peak exists for biologically unrelated 3) Peak frequencies are different for different biological In our previous studies

Each frequency in the RRM characterizes one biological function. To grasp the meaning of characteristic frequency, it is important first to understand what is meant by the biological function of proteins. Each biological process involves a number of interactions between proteins and their targets (other protein, DNA regulatory segment or small molecule). Each of these processes involves energy transfer between interacting molecules. These interactions are highly selective and this selectivity is defined within the protein structure. Protein and their protein or DNA targets have been analyzed to find out whether RRM characteristic frequencies denote a parameter which describes this selectivity between interacting molecules. It has been shown that proteins and their DNA or protein targets share the same characteristic frequency [ 131, [151, [17], [22], but of opposite phase [17], [22] for each in a pair of interacting macromolecules. Thus, it can be postulated that RRM characteristic frequencies characterize not only general functions but also provide recognition between a particular protein and its target (receptor, ligand, etc.). As this recognition arises from the matching of periodicities within the distribution of energies of free electrons along the interacting proteins, it can be regarded as resonant recognition.

Repetita Permit to find periodicities hidden along the sequence.

Repetita

Alignment-MAFFT Uses FFT for the detection of homologous segments Substitutions between physico‐chemically similar amino acids tend to preserve the structure of proteins, and such neutral substitutions have been accumulated in molecules during evolution An amino acid a is assigned to a vector whose components are the volume value v(a) and the polarity value p(a) Calculation of the correlation between two amino acid sequences. We define the correlation c(k) between two sequences of such vectors as c(k) = c v (k) + c p (k),1 where c v (k) and c p (k) are, as defined below, the correlations of volume component and polarity component, respectively, between two amino acid sequences to be aligned. The correlation c(k) represents the degree of similarity of two sequences with the positional lag of k sites. The high value of c(k) indicates that the sequences may have homologous regions The correlation c v (k) of volume component between sequence 1 and sequence 2 with the positional lag of k sites is defined as where v̂ 1 (n) and v̂ 2 (n) are the volume component of the nth site of sequence 1 with the length of N and that of sequence 2 with the length ofM, respectively. If two sequences compared have homologous regions, the correlation c(k) has some peaks corresponding to these regions By the FFT analysis, however, we can know only the positional lag k of a homologous region in two sequences but not the position of the region. As shown in Figure 1B, to determine the positions of the homologous region in each sequence, a sliding window analysis with the window size of 30 sites is carried out, in which the degree of local homologies is calculated for each of the highest 20 peaks in the correlation c(k)1

MSA with Fourier - MAFFT

Homologous regions are quickly identified by converting amino acid residues to vectors of volume and polarity If is are the volume component of the nth site

Recursive Splicing

RNA Binding protein motifs Exon Intron Exon Pre-mRNA

Take home message Fourier transform and Fourier series have the same purpose: decompose a signal in sum of waves. It was possible: Detect hidden signal (period-3 property) Filtering noise (identify Protein-Coding Regions) Detect periodicity (Repetita) Detect common structure and sequence similarities (Resonant recognition model, MAFFT) Denoising and reconstructing signals( Recursive splicing) Reduce computational time ( MAFFT)

Thank you