S-SENCE Signal processing for chemical sensors Martin Holmberg S-SENCE Applied Physics, Department of Physics and Measurement Technology (IFM) Linköping.

Slides:



Advertisements
Similar presentations
The Robert Gordon University School of Engineering Dr. Mohamed Amish
Advertisements

Review Of Statistical Mechanics
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
PCA for analysis of complex multivariate data. Interpretation of large data tables by PCA In industry, research and finance the amount of data is often.
Regression analysis Relating two data matrices/tables to each other Purpose: prediction and interpretation Y-data X-data.
Principal Component Analysis Based on L1-Norm Maximization Nojun Kwak IEEE Transactions on Pattern Analysis and Machine Intelligence, 2008.
11 Simple Linear Regression and Correlation CHAPTER OUTLINE
Ole Mathis Kruse, IMT Feature extraction techniques to use in cereal classification Department of Mathematical Sciences and Technology 1.
A PRESENTATION ON ELECTRONIC NOSE PRESENTED BY vivek gangwar ELECTRONICS & COMMUNICATION.
Signal Processing Institute Swiss Federal Institute of Technology, Lausanne 1 Feature selection for audio-visual speech recognition Mihai Gurban.
1 Fourth Lecture Static Characteristics of Measurement Systems (continued) Instrumentation and Product Testing.
Principal Component Analysis
Pattern Recognition Topic 1: Principle Component Analysis Shapiro chap
Introduction to Neural Networks Simon Durrant Quantitative Methods December 15th.
An Overview of Today’s Class
Methods and Measurement in Psychology. Statistics THE DESCRIPTION, ORGANIZATION AND INTERPRATATION OF DATA.
Basic Concepts and Definitions Vector and Function Space. A finite or an infinite dimensional linear vector/function space described with set of non-unique.
Scalable Training of Mixture Models via Coresets Daniel Feldman Matthew Faulkner Andreas Krause MIT.
A Quick Practical Guide to PCA and ICA Ted Brookings, UCSB Physics 11/13/06.
PATTERN RECOGNITION : PRINCIPAL COMPONENTS ANALYSIS Prof.Dr.Cevdet Demir
Laurent Itti: CS599 – Computational Architectures in Biological Vision, USC Lecture 7: Coding and Representation 1 Computational Architectures in.
Biomedical Image Analysis and Machine Learning BMI 731 Winter 2005 Kun Huang Department of Biomedical Informatics Ohio State University.
Database Construction for Speech to Lip-readable Animation Conversion Gyorgy Takacs, Attila Tihanyi, Tamas Bardi, Gergo Feldhoffer, Balint Srancsik Peter.
NIMIA October 2001, Crema, Italy - Vincenzo Piuri, University of Milan, Italy NEURAL NETWORKS FOR SENSORS AND MEASUREMENT SYSTEMS Part III Vincenzo.
Data Mining Techniques
Attention Deficit Hyperactivity Disorder (ADHD) Student Classification Using Genetic Algorithm and Artificial Neural Network S. Yenaeng 1, S. Saelee 2.
Kansas State University Department of Computing and Information Sciences CIS 830: Advanced Topics in Artificial Intelligence From Data Mining To Knowledge.
Principle Component Analysis (PCA) Networks (§ 5.8) PCA: a statistical procedure –Reduce dimensionality of input vectors Too many features, some of them.
Handling Data and Figures of Merit Data comes in different formats time Histograms Lists But…. Can contain the same information about quality What is meant.
This week: overview on pattern recognition (related to machine learning)
PPT 206 Instrumentation, Measurement and Control SEM 2 (2012/2013) Dr. Hayder Kh. Q. Ali 1.
Image recognition using analysis of the frequency domain features 1.
Presented by Tienwei Tsai July, 2005
IE 585 Introduction to Neural Networks. 2 Modeling Continuum Unarticulated Wisdom Articulated Qualitative Models Theoretic (First Principles) Models Empirical.
Element 2: Discuss basic computational intelligence methods.
EMIS 8381 – Spring Netflix and Your Next Movie Night Nonlinear Programming Ron Andrews EMIS 8381.
Data Mining Manufacturing Data Dave E. Stevens Eastman Chemical Company Kingsport, TN.
Business Statistics for Managerial Decision Farideh Dehkordi-Vakil.
Classification Course web page: vision.cis.udel.edu/~cv May 12, 2003  Lecture 33.
1 Pattern Recognition: Statistical and Neural Lonnie C. Ludeman Lecture 24 Nov 2, 2005 Nanjing University of Science & Technology.
Descriptive Statistics vs. Factor Analysis Descriptive statistics will inform on the prevalence of a phenomenon, among a given population, captured by.
A B S T R A C T The study presents the application of selected chemometric techniques to the pollution monitoring dataset, namely, cluster analysis,
1. (1.3) (1.8) (1.11) (1.14) Fundamental equations for homogeneous closed system consisting of 1 mole:
Propagation of Error Ch En 475 Unit Operations. Quantifying variables (i.e. answering a question with a number) 1. Directly measure the variable. - referred.
Introduction: Olfactory Physiology Organic Chemistry Signal Processing Pattern Recognition Computational Learning Electronic Nose Chemical Sensors.
Blind Information Processing: Microarray Data Hyejin Kim, Dukhee KimSeungjin Choi Department of Computer Science and Engineering, Department of Chemical.
CHEE320 Analysis of Process Data J. McLellan Fall, 2001.
PCA vs ICA vs LDA. How to represent images? Why representation methods are needed?? –Curse of dimensionality – width x height x channels –Noise reduction.
PATTERN RECOGNITION : PRINCIPAL COMPONENTS ANALYSIS Richard Brereton
ECE 8443 – Pattern Recognition ECE 8527 – Introduction to Machine Learning and Pattern Recognition LECTURE 12: Advanced Discriminant Analysis Objectives:
BASIC STATISTICAL CONCEPTS Statistical Moments & Probability Density Functions Ocean is not “stationary” “Stationary” - statistical properties remain constant.
Test strength = anomaly / normal range Summer temperatures of Switzerland (Schar et al., 2004)
Statistics Presentation Ch En 475 Unit Operations.
WHAT IS DATA MINING?  The process of automatically extracting useful information from large amounts of data.  Uses traditional data analysis techniques.
WHAT IS DATA MINING?  The process of automatically extracting useful information from large amounts of data.  Uses traditional data analysis techniques.
Physical Behavior of Matter Review. Matter is classified as a substance or a mixture of substances.
Multivariate statistical methods. Multivariate methods multivariate dataset – group of n objects, m variables (as a rule n>m, if possible). confirmation.
Kim HS Introduction considering that the amount of MRI data to analyze in present-day clinical trials is often on the order of hundreds or.
THERMAL ANALYSIS FOR PREFORMULATION TRIALS
Statistical Concepts Basic Principles An Overview of Today’s Class What: Inductive inference on characterizing a population Why : How will doing this allow.
CLASSIFICATION OF ECG SIGNAL USING WAVELET ANALYSIS
Data statistics and transformation revision Michael J. Watts
Artificial nose A case-study by Shirish Bharati BT11MEC073 Shivam Rathod BT11MEC074 Siddhant Goyal BT11MEC075 Sushant Somkuwar BT11MEC076 Measurement and.
Unsupervised Learning
LECTURE 11: Advanced Discriminant Analysis
Final Year Project Presentation --- Magic Paint Face
Machine Learning Dimensionality Reduction
Application of Independent Component Analysis (ICA) to Beam Diagnosis
Descriptive Statistics vs. Factor Analysis
Unsupervised Learning
Presentation transcript:

S-SENCE Signal processing for chemical sensors Martin Holmberg S-SENCE Applied Physics, Department of Physics and Measurement Technology (IFM) Linköping University

S-SENCE Human olfactory system n a large number of olfactory cells (  10 million) but only a small number of sensitivity classes (  30) n a mixture of volatile compounds results in a signal pattern to the brain via the mitral cells (  10 thousand) n the brain interprets the signal pattern as a specific odour

S-SENCE Electronic nose concept n sensitivity to many compounds using a limited number of chemical sensors with different and partly overlapping selectivity profiles n analyse the sensor signal pattern rather than the individual signals n gives a description of the total measurement state, i.e. the sum of all components, which can be more important than a detailed knowledge of each of the individual components odour sensor chamber(s) chemical sensors pre- processing computer information pattern recognition

S-SENCE Solid-state gas sensor principles

S-SENCE Electronic nose gas flow diagram

S-SENCE Signal processing: feature extraction

S-SENCE Background; The Tongue n Based on voltammetry n Currents are measured as a function of applied potential on electrode surfaces n The currents are caused by: n Redox reactions at the surface n Ionic movement in the sample

S-SENCE The currents are sampled once every [ms]  variables Background; Voltammetry

S-SENCE Information overflow n An electronic nose or tongue generates large data series (one measurement can consist of up to variabler  Linköping’s telephone book) n Impossible to get a good grip of these numbers n We need to decrease the information content n A lot of redundant information means that compression is possible

S-SENCE Other possible problems n Noise n Drift n Low sensitivity n Sampling problems n Measurements system n …

S-SENCE What is drift? n Definition: “a gradual change in any quantitative characteristic that is supposed to remain constant” (Webster’s Seventh New Collegiate Dictionary) n For chemical sensors: Measurements made under identical chemical conditions give different sensor responses at different times.

S-SENCE Causes of drift n Reactions on the sensor surface (poisoning) n changes in the physical properties of the sensing material (e.g. the size of the metal islands on a MOSFET) n adsorption of species on the sensor surface n layer formation of reaction products n Variations in the gas (composition, pressure, temperature,…) n Remaining gas in the measurement system

S-SENCE Example of drift n Measurements made during 60 days on nine different mixtures of four gases n Gradual change + jumps n Drift + noise n Drift in different directions for the different sensors n Below are shown the responses of three sensors as a function of time

S-SENCE Comparison noise and drift n Drift n low frequency n caused by changes in the measurement system n similar for similar sensors n Noise n high frequency n caused by randomness in the measurements n individual for each sensor

S-SENCE What do we do? n Pre-processing to compensate for some of the problems mentioned previously n Pattern recognition models to give the desired information n If possible, give feedback regarding the measurements odour sensor chamber(s) chemical sensors pre- processing computer information pattern recognition

S-SENCE Geometrical interpretation ? variables1,2,3-variables Easy to visualiseAbstract Reduce the number of dimensions Often made by choosing directions with a lot of variation

S-SENCE Principal Component Analysis Can be used for data compression, feature extraction, or visualisation

S-SENCE Examples of pre-processing n 1 st example uses PCA to find a direction where irrelevant information is dominant. This direction is then removed from the data set. n 2 nd example shows how wavelets are used to compress data, and how the choice of wavelets depends on the application.

S-SENCE Component correction n Uses a reference gas n Calculates the direction of drift by PCA for the reference gas n Removes this direction(s) from all other measurements n Example: Artursson et al. J. of Chemometr. 14, 5/6 (2000)

S-SENCE Component correction n 1 st (or several) component in a PCA analysis of the reference gas will describe the drift direction, p n Project the samples, X, on the first loading, p from the PCA on the reference gas, n t=Xp n Subtract the bilinear expression, tp T, from X n X corrected =X- tp T

S-SENCE Component correction n Also used for quantification beforecomponent correction

S-SENCE Wavelets n Data is described using wavelet base functions with different scales n The number of wavelets = the number of original variables

S-SENCE Wavelet selection n By selecting only the most relevant wavelets, a data reduction is obtained n Different criteria for selection in different applications (e.g. variance or discrimination) n Here approximately 100 wavelets are chosen from the original ca variables

S-SENCE Reconstruction n The original signal can be reconstructed from the compressed data n Different results from different wavelet selection criteria Variance selectedDiscrimination selected

S-SENCE Pattern recognition n After reducing initial obstacles with different pre- processing techniques, a model that gives the user something he/she can understand must be made odour sensor chamber(s) chemical sensors pre- processing computer information pattern recognition

S-SENCE Data models n Classification or quantification? n Statistical tools for class membership

S-SENCE Data models n Quantification of gases or liquids: n Regression models (e.g. PLS) for linear relationships n Artificial neural networks (ANN) for non-linear black-box modeling n Important always to validate data, especially when many parameters are used in the models!!!