Ping-Tsun Chang Intelligent Systems Laboratory Computer Science and Information Engineering National Taiwan University Text Mining with Machine Learning.

Slides:



Advertisements
Similar presentations
Statistical Machine Learning- The Basic Approach and Current Research Challenges Shai Ben-David CS497 February, 2007.
Advertisements

Document Summarization using Conditional Random Fields Dou Shen, Jian-Tao Sun, Hua Li, Qiang Yang, Zheng Chen IJCAI 2007 Hao-Chin Chang Department of Computer.
Chapter 5: Introduction to Information Retrieval
Image classification Given the bag-of-features representations of images from different classes, how do we learn a model for distinguishing them?
SVM - Support Vector Machines A new classification method for both linear and nonlinear data It uses a nonlinear mapping to transform the original training.
Data Mining Classification: Alternative Techniques
Support Vector Machines
SVM—Support Vector Machines
A Survey on Text Categorization with Machine Learning Chikayama lab. Dai Saito.
Classification and Decision Boundaries
Discriminative and generative methods for bags of features
On feature distributional clustering for text categorization Bekkerman, El-Yaniv, Tishby and Winter The Technion. June, 27, 2001.
The Nature of Statistical Learning Theory by V. Vapnik
Chapter 7: Text mining UIC - CS 594 Bing Liu 1 1.
Text Classification With Support Vector Machines
University of Texas at Austin Machine Learning Group Department of Computer Sciences University of Texas at Austin Support Vector Machines.
Chapter 2: Pattern Recognition
Support Vector Machines Based on Burges (1998), Scholkopf (1998), Cristianini and Shawe-Taylor (2000), and Hastie et al. (2001) David Madigan.
SVM Active Learning with Application to Image Retrieval
Active Learning with Support Vector Machines
SVMs Finalized. Where we are Last time Support vector machines in grungy detail The SVM objective function and QP Today Last details on SVMs Putting it.
Text Classification With Labeled and Unlabeled Data Presenter: Aleksandar Milisic Supervisor: Dr. David Albrecht.
12 -1 Lecture 12 User Modeling Topics –Basics –Example User Model –Construction of User Models –Updating of User Models –Applications.
Text Mining: Finding Nuggets in Mountains of Textual Data Jochen Dijrre, Peter Gerstl, Roland Seiffert Presented by Huimin Ye.
Text Mining: Finding Nuggets in Mountains of Textual Data Jochen Dijrre, Peter Gerstl, Roland Seiffert Presented by Drew DeHaas.
Introduction to Data Mining Engineering Group in ACL.
1 Text Categorization  Assigning documents to a fixed set of categories  Applications:  Web pages  Recommending pages  Yahoo-like classification hierarchies.
This week: overview on pattern recognition (related to machine learning)
Copyright R. Weber Machine Learning, Data Mining ISYS370 Dr. R. Weber.
Processing of large document collections Part 2 (Text categorization) Helena Ahonen-Myka Spring 2006.
Text mining.
Information Filtering LBSC 796/INFM 718R Douglas W. Oard Session 10, April 13, 2011.
Introduction to machine learning and data mining 1 iCSC2014, Juan López González, University of Oviedo Introduction to machine learning Juan López González.
Support Vector Machine (SVM) Based on Nello Cristianini presentation
No. 1 Classification and clustering methods by probabilistic latent semantic indexing model A Short Course at Tamkang University Taipei, Taiwan, R.O.C.,
1 SUPPORT VECTOR MACHINES İsmail GÜNEŞ. 2 What is SVM? A new generation learning system. A new generation learning system. Based on recent advances in.
Universit at Dortmund, LS VIII
Special topics on text mining [ Part I: text classification ] Hugo Jair Escalante, Aurelio Lopez, Manuel Montes and Luis Villaseñor.
Transfer Learning Task. Problem Identification Dataset : A Year: 2000 Features: 48 Training Model ‘M’ Testing 98.6% Training Model ‘M’ Testing 97% Dataset.
Kernel Methods A B M Shawkat Ali 1 2 Data Mining ¤ DM or KDD (Knowledge Discovery in Databases) Extracting previously unknown, valid, and actionable.
Text Feature Extraction. Text Classification Text classification has many applications –Spam detection –Automated tagging of streams of news articles,
Text mining. The Standard Data Mining process Text Mining Machine learning on text data Text Data mining Text analysis Part of Web mining Typical tasks.
1 Chapter 6. Classification and Prediction Overview Classification algorithms and methods Decision tree induction Bayesian classification Lazy learning.
Copyright R. Weber Machine Learning, Data Mining INFO 629 Dr. R. Weber.
WEB MINING. In recent years the growth of the World Wide Web exceeded all expectations. Today there are several billions of HTML documents, pictures and.
Chapter Ⅳ. Categorization 2007 년 2 월 15 일 인공지능연구실 송승미 Text : THE TEXT MINING HANDBOOK Page. 64 ~ 81.
Active learning Haidong Shi, Nanyi Zeng Nov,12,2008.
Neural Text Categorizer for Exclusive Text Categorization Journal of Information Processing Systems, Vol.4, No.2, June 2008 Taeho Jo* 報告者 : 林昱志.
University of Texas at Austin Machine Learning Group Department of Computer Sciences University of Texas at Austin Support Vector Machines.
Matwin Text classification: In Search of a Representation Stan Matwin School of Information Technology and Engineering University of Ottawa
Supervised Learning. CS583, Bing Liu, UIC 2 An example application An emergency room in a hospital measures 17 variables (e.g., blood pressure, age, etc)
Text Categorization With Support Vector Machines: Learning With Many Relevant Features By Thornsten Joachims Presented By Meghneel Gore.
Ping-Tsun Chang Intelligent Systems Laboratory Computer Science and Information Engineering National Taiwan University Combining Unsupervised Feature Selection.
26/01/20161Gianluca Demartini Ranking Categories for Faceted Search Gianluca Demartini L3S Research Seminars Hannover, 09 June 2006.
Chapter 6. Classification and Prediction Classification by decision tree induction Bayesian classification Rule-based classification Classification by.
Chapter 6. Classification and Prediction Classification by decision tree induction Bayesian classification Rule-based classification Classification by.
NTU & MSRA Ming-Feng Tsai
Support-Vector Networks C Cortes and V Vapnik (Tue) Computational Models of Intelligence Joon Shik Kim.
Computer Vision Lecture 7 Classifiers. Computer Vision, Lecture 6 Oleh Tretiak © 2005Slide 1 This Lecture Bayesian decision theory (22.1, 22.2) –General.
SUPERVISED AND UNSUPERVISED LEARNING Presentation by Ege Saygıner CENG 784.
1 Text Categorization  Assigning documents to a fixed set of categories  Applications:  Web pages  Recommending pages  Yahoo-like classification hierarchies.
Ping-Tsun Chang Intelligent Systems Laboratory NTU/CSIE Using Support Vector Machine for Integrating Catalogs.
Roughly overview of Support vector machines Reference: 1.Support vector machines and machine learning on documents. Christopher D. Manning, Prabhakar Raghavan.
Support Vector Machines Reading: Textbook, Chapter 5 Ben-Hur and Weston, A User’s Guide to Support Vector Machines (linked from class web page)
School of Computer Science & Engineering
Prepared by: Mahmoud Rafeek Al-Farra
Presented by: Prof. Ali Jaoua
Other Classification Models: Support Vector Machine (SVM)
Extracting Why Text Segment from Web Based on Grammar-gram
Support Vector Machines 2
Presentation transcript:

Ping-Tsun Chang Intelligent Systems Laboratory Computer Science and Information Engineering National Taiwan University Text Mining with Machine Learning Techniques

Ping-Tsun Chang Language Identification Classification Clustering Summerization Feature Selection Text Analysis

Ping-Tsun Chang Text Mining Text mining is about looking for patterns in natural language text –Natural Language Processing May be defined as the process of analyzing text to extract information from it for particular purposes. –Information Extraction –Information Retrieval

Ping-Tsun Chang Text Mining and Knowledge Management a recent study indicated that 80% of a company's information is contained in text documents – s, memos, customer correspondence, and reports The ability to distil this untapped source of information provides substantial competitive advantages for a company to succeed in the era of a knowledge-based economy.

Ping-Tsun Chang Text Mining Applications Customer profile analysis –mining incoming s for customers' complaint and feedback. Patent analysis –analyzing patent databases for major technology players, trends, and opportunities. Information dissemination –organizing and summarizing trade news and reports for personalized information services. Company resource planning – mining a company's reports and correspondences for activities, status, and problems reported.

Ping-Tsun Chang Text Categorization Problem Definition Text categorization is the problem of automatically assigned predefined categories to free text documents –Document classification –Web page classification –News classification

Ping-Tsun Chang Information Retrieval Full text is hard to process, but is a complete representation to document Logical view of documents Models –Boolean Model –Vector Model –Probabilistic Model Think text as patterns?

Ping-Tsun ChangEvaluation Retrieved Relevant a b c d

Ping-Tsun Chang Pattern Recognization Sensing Segmentation Classification Post-Processing Feature ExtractionDecision

Ping-Tsun Chang Pattern Classification f 1 f 2 C 1 C 2

Ping-Tsun Chang Machine Learning Using Computer help us to induction from complex and large amount of pattern data Bayesian Learning Instance-Based Learning –K-Nearest Neighbors Neural Networks Support Vector Machine

Ping-Tsun Chang Feature Selection (I) Information Gain

Ping-Tsun Chang Feature Selection (II) Mutual Information CHI-Square

Ping-Tsun Chang Weighting Scheme TF ‧ IDF

Ping-Tsun Chang Simility Evaluation Cosine-Like schema didi djdj

Ping-Tsun Chang Machine Learning Approaches: Baysian Classifier

Ping-Tsun Chang Machine Learning Approaches: kNN Classifier d ?

Ping-Tsun Chang Machine Learning Approaches: Support Vector Machine Basic hypotheses : Consistent hypotheses of the Version Space Project the original training data in space X to a higher dimension feature space F via a Mercel operator K

Ping-Tsun Chang Compare: SVM and traditional Leaners Traditional Leaner SVM access the hypothesis space! P(h) hypothesis P(h|D 1 ) hypothesis P(h|D 1^ D 2 ) hypothesis

Ping-Tsun Chang SVM Learning in Feature Spaces Example: XF

Ping-Tsun Chang Support Vector Machine (cont’d) Nonlinear –Example: XOR Problem Natural Language is Nonlinear! f 1 f 2 f 1 f 1 f 2

Ping-Tsun Chang Support Vector Machine (cont’d) Consistent hypothses Maximum margin Support Vector

Ping-Tsun Chang Statistical Learning Theory P(X)P(y|x) F(x) y y* x x Generator Supervisor Leaner

Ping-Tsun Chang Support Vector Machine Linear Discriminant Functions Linear discriminant space Hyperplane g(y)>1 y2y2 y1y1 g(y)<1

Ping-Tsun Chang Learning of Support Vector Machine Maxmize Margin Minimize ||a|| Optimal hyperplane

Ping-Tsun Chang Version Space Hypothesis Space H Version Space V H V

Ping-Tsun Chang Support Vector Machine Active Learning Why Support Vector Machine? –Text Categorization have large amount of data –Traditional Learning cause Over-Fitting –Language is complex and nonlinear Why Active Learning? –Labeling instance is time-consuming and costly –Reduce the need for labeled training instances

Ping-Tsun Chang Active Learning: History Text Classification [Rochio, 71] [Dumais, 98] Support Vector Machine [Vapnik,82] Text Classification Support Vector Machine [Joachims,98] [Dumais,98] Pool-Based Active Learning [Lewis, Gale ‘94] [McCallum, Nigrm ‘98] The Nature of Statistical Learning Theory [Vapnik, 95] Automated Text Categorization Using Support Vector Machine [Kwok, 98]

Ping-Tsun Chang Active Learning UPool-Based active learning have a pool U of unlabeled instances Active Lerner l have three components (f,q,X) –f: classifier x->{-1, 1} –q: querying function q(X), given a training instance labeled set X, decide which instance in U to query next. –X: training data, labeled.

Ping-Tsun Chang Active Learning (cont’d) Main difference: querying component q. How to choose the next unlabeled instance to query? Resulting Version Space

Ping-Tsun Chang Active Learner Active learner l* always queries instances whose corresponding hyperplanes in parameter space W halves the area of the current version space

Ping-Tsun Chang Experienments Bayesian Classifier

Ping-Tsun Chang Comparsion of Learning Methods Precision Training Data Size SVM kNN NB NNet

Ping-Tsun Chang Conclusions Text-Mining extraction knowledge from text. Support Vector Machine is almost the best statistic-based machine learning method Natural Language Understanding is still a open problem Knowledge