Sentiment Analysis and The Fourth Paradigm MSE 2400 EaLiCaRA Spring 2014 Dr. Tom Way.

Slides:



Advertisements
Similar presentations
SCIENCE PROCESS SKILLS
Advertisements

Presentation at Society of The Query conference, Amsterdam November 13-14, 2009 (original title: Learning from Google: software design as a methodology.
1 of 18 Information Dissemination New Digital Opportunities IMARK Investing in Information for Development Information Dissemination New Digital Opportunities.
Language Technologies Reality and Promise in AKT Yorick Wilks and Fabio Ciravegna Department of Computer Science, University of Sheffield.
School of something FACULTY OF OTHER School of Computing FACULTY OF ENGINEERING An open discussion and exchange of ideas Introduced by Eric Atwell, Language.
© 2014 Systems and Proposal Engineering Company. All Rights Reserved Using Natural Language Parsing (NLP) for Automated Requirements Quality Analysis Chris.
LABELING TURKISH NEWS STORIES WITH CRF Prof. Dr. Eşref Adalı ISTANBUL TECHNICAL UNIVERSITY COMPUTER ENGINEERING 1.
Establishing Performance Standards for PARCC Assessments Initial Discussion PARCC Governing Board Meeting April 3,
The Chinese Room: Understanding and Correcting Machine Translation This work has been supported by NSF Grants IIS Solution: The Chinese Room Conclusions.
Information Visualization for an Intrusion Detection System Ching-Lung Fu James Blustein Daniel Silver.
Introduction to Computational Linguistics
Intelligent Architectures for Electronic Commerce Part 1.5: Symbolic Reasoning Agents.
Introduction to Computational Linguistics
Does one size really fit all? Evaluating classifiers in Bag-of-Visual-Words classification Christian Hentschel, Harald Sack Hasso Plattner Institute.
Business Analytics BI applications to support human and automated decision making Business Analytics—predict future outcomes Decision Support Systems.
Information Extraction Lecture 7 – Linear Models (Basic Machine Learning) CIS, LMU München Winter Semester Dr. Alexander Fraser, CIS.
A Brief Overview. Contents Introduction to NLP Sentiment Analysis Subjectivity versus Objectivity Determining Polarity Statistical & Linguistic Approaches.
Natural Language and Speech Processing Creation of computational models of the understanding and the generation of natural language. Different fields coming.
CSE111: Great Ideas in Computer Science Dr. Carl Alphonce 219 Bell Hall Office hours: M-F 11:00-11:
Science Inquiry Minds-on Hands-on.
Statistical Natural Language Processing. What is NLP?  Natural Language Processing (NLP), or Computational Linguistics, is concerned with theoretical.
Chapter 4 Principles of Quantitative Research. Answering Questions  Quantitative Research attempts to answer questions by ascribing importance (significance)
Author(s): August E. Evrard, PhD License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution-Non-commercial-Share.
Artificial Intelligence (AI) Addition to the lecture 11.
Information Retrieval – and projects we have done. Group Members: Aditya Tiwari ( ) Harshit Mittal ( ) Rohit Kumar Saraf ( ) Vinay.
ELN – Natural Language Processing Giuseppe Attardi
Introduction to Machine Learning MSE 2400 EaLiCaRA Spring 2015 Dr. Tom Way Based in part on notes from Gavin Brown, University of Manchester.
Lecture 12: 22/6/1435 Natural language processing Lecturer/ Kawther Abas 363CS – Artificial Intelligence.
 The most intelligent device - “Human Brain”.  The machine that revolutionized the whole world – “computer”.  Inefficiencies of the computer has lead.
Machine Learning in Spoken Language Processing Lecture 21 Spoken Language Processing Prof. Andrew Rosenberg.
Introduction to Text and Web Mining. I. Text Mining is part of our lives.
Natural Language Processing Guangyan Song. What is NLP  Natural Language processing (NLP) is a field of computer science and linguistics concerned with.
Educational Objectives
1 Machine Learning 1.Where does machine learning fit in computer science? 2.What is machine learning? 3.Where can machine learning be applied? 4.Should.
1 CSI 5180: Topics in AI: Natural Language Processing, A Statistical Approach Instructor: Nathalie Japkowicz Objectives of.
Based on the concept by Ira Socol. What is Tool Belt Theory? Tool Belt theory is based on the concept: Humans are tool users Knowledge of and access to.
1 Pattern Recognition Pattern recognition is: 1. A research area in which patterns in data are found, recognized, discovered, …whatever. 2. A catchall.
NLP ? Natural Language is one of fundamental aspects of human behaviors. One of the final aim of human-computer communication. Provide easy interaction.
Basics of Research and Development and Design STEM Education HON4013 ENGR1020 Learning and Action Cycles.
Chapter 5: Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization DECISION SUPPORT SYSTEMS AND BUSINESS.
Computational Linguistics. The Subject Computational Linguistics is a branch of linguistics that concerns with the statistical and rule-based natural.
Introduction to Earth Science Section 2 Section 2: Science as a Process Preview Key Ideas Behavior of Natural Systems Scientific Methods Scientific Measurements.
Data Mining BY JEMINI ISLAM. Data Mining Outline: What is data mining? Why use data mining? How does data mining work The process of data mining Tools.
Machine Learning Extract from various presentations: University of Nebraska, Scott, Freund, Domingo, Hong,
Some questions -What is metadata? -Data about data.
Creating Subjective and Objective Sentence Classifier from Unannotated Texts Janyce Wiebe and Ellen Riloff Department of Computer Science University of.
Introduction Chapter 1 Foundations of statistical natural language processing.
Jeff Howbert Introduction to Machine Learning Winter Machine Learning Natural Language Processing.
For Friday Finish chapter 23 Homework –Chapter 23, exercise 15.
Welcome and Introduction to the Course MSE 2400 EaLiCaRA Spring 2015 Dr. Tom Way.
NATURAL LANGUAGE PROCESSING Zachary McNellis. Overview  Background  Areas of NLP  How it works?  Future of NLP  References.
1 An Introduction to Computational Linguistics Mohammad Bahrani.
Pattern Recognition NTUEE 高奕豪 2005/4/14. Outline Introduction Definition, Examples, Related Fields, System, and Design Approaches Bayesian, Hidden Markov.
Basics of Natural Language Processing Introduction to Computational Linguistics.
Multi-Class Sentiment Analysis with Clustering and Score Representation Yan Zhu.
ECE 8443 – Pattern Recognition ECE 8527 – Introduction to Machine Learning and Pattern Recognition Objectives: Bayes Rule Mutual Information Conditional.
Network Management Lecture 13. MACHINE LEARNING TECHNIQUES 2 Dr. Atiq Ahmed Université de Balouchistan.
Introducing Precictive Analytics
How can speech technology be used to help people with disabilities?
Jonatas Wehrmann, Willian Becker, Henry E. L. Cagnini, and Rodrigo C
Natural Language Processing with Qt
IB Assessments CRITERION!!!.
Purpose of Research Research may be broadly classified into two areas; basic and applied research. The primary purpose of basic research (as opposed to.
Section 2: Science as a Process
School of Information Management Nanjing University China
What is Pattern Recognition?
Basic Intro Tutorial on Machine Learning and Data Mining
Artificial Intelligence Lecture 2: Foundation of Artificial Intelligence By: Nur Uddin, Ph.D.
THE NATURE OF SCIENCE.
Artificial Intelligence 2004 Speech & Natural Language Processing
Presentation transcript:

Sentiment Analysis and The Fourth Paradigm MSE 2400 EaLiCaRA Spring 2014 Dr. Tom Way

Necessary Background Fourth Paradigm Natural Language Processing Machine Learning Sentiment Sentiment Analysis Sentiment Tracking Word Cloud Various Tools 2MSE 2400 Evolution & Learning

Fourth Paradigm Scientific breakthroughs powered by advanced computing capabilities that help researchers manipulate and explore massive datasets. 3MSE 2400 Evolution & Learning

Nth paradigm? MSE 2400 Evolution & Learning4 The historian of science Thomas Kuhn gave paradigm its contemporary meaning when he adopted the word to refer to the set of practices that define a scientific discipline at any particular period of time. Kuhn himself came to prefer the terms exemplar and normal science, which have more precise philosophical meanings. However in his book The Structure of Scientific Revolutions Kuhn defines a scientific paradigm as: * what is to be observed and scrutinized * the kind of questions that are supposed to be asked and probed for answers in relation to this subject * how these questions are to be structured * how the results of scientific investigations should be interpreted Alternatively, the Oxford English Dictionary defines paradigm as "a pattern or model, an exemplar." Thus an additional component of Kuhn's definition of paradigm is: * how is an experiment to be conducted, and what equipment is available to conduct the experiment.

Jim Grays four scientific paradigms MSE 2400 Evolution & Learning5 1. empiricism observe phenomenon and attempt to classify Ptolemys universe of concentric spheres 2. theory describe above classifications with mathematical models Newtonian/Einsteinian gravity 3. computation build `virtual physical systems via solution of math models Cosmic structure formation 4. data-driven synthesis (?) unite empirical, theoretical and computational branches with data (X- info and Comp-X) Matter/energy content of the universe

Natural Language Processing Natural language processing (NLP) is a field of computer science, artificial intelligence, and linguistics concerned with the interactions between computers and human (natural) languages. 6MSE 2400 Evolution & Learning

NLP Examples Automatic summarization – generate a summary from original Machine translation – Google Translate Morphological segmentation – break up words into phonemes Natural language generation – generate text automatically Natural language understanding – analyze and learn meaning Optical character recognition (OCR) – read the text Parts of speech tagging – identify nouns, verbs, adjectives, etc. Question answering – parse a question and generate a response Speech recognition – hear what is spoken, convert to text Word sense disambiguation – pick the correct definition of a word Readability – at what grade level is text written? And more… such as? 7MSE 2400 Evolution & Learning

Machine Learning Machine Learning (ML), a branch of artificial intelligence, is about the construction and study of systems that can learn from data. 8MSE 2400 Evolution & Learning

ML Examples Decision tree – maps observations to conclusions Association rule – discovering relationships between things Neural networks – learn to recognize patterns and apply that learning Support vector machines – given training data it creates categories, can classify new data into the correct category Baysian networks – fine-tunes probabilities of classifying data as it gets more data More… 9MSE 2400 Evolution & Learning

Sentiment What is sentiment? an attitude, thought, or judgment prompted by feeling a specific view or notion an idea colored by emotion 10MSE 2400 Evolution & Learning

Sentiment Analysis Sentiment analysis or opinion mining refers to the application of natural language processing, computational linguistics, and text analytics to identify and extract subjective information in source materials. It is an example of a Computational Agent 11MSE 2400 Evolution & Learning

Sentiment Analysis Example A basic task in sentiment analysis is classifying the polarity of a given text - is it positive, negative, or neutral for some desired measure? Advanced, "beyond polarity" sentiment classification looks, for instance, at emotional states such as "angry," "sad," and "happy." 12MSE 2400 Evolution & Learning

Sentiment Tracking Performing Sentiment Analysis over time It is a Computational Agent that learns Watching the sentiment (positive, neutral, negative) as it changes over time 13MSE 2400 Evolution & Learning

Word Cloud A word cloud (or tag cloud) is a visual representation for text data, typically used to depict keyword metadata (tags) on websites, or to visualize free form text. How does it fit into the Fourth Paradigm? 14MSE 2400 Evolution & Learning

Word Cloud example 15MSE 2400 Evolution & Learning Inaugural Address Jan. 21, 2013

Example of tools Machine translation: Readability: Sentiment analysis: Word cloud: 16MSE 2400 Evolution & Learning