Incremental Context Mining for Adaptive Document Classification

Slides:



Advertisements
Similar presentations
WG 2 \ WG 3 Model suitability for various user demands.
Advertisements

12/2006DSS Employee Recognition Survey THE FOLLOWING STATEMENTS ARE IMPORTANT TO ME SUMMARY OF ALL QUESTIONS (1 = Least Important and 5 = Most Important)
Chapter 7: Text mining UIC - CS 594 Bing Liu 1 1.
A Technique for Advanced Dynamic Integration of Multiple Classifiers Alexey Tsymbal*, Seppo Puuronen**, Vagan Terziyan* *Department of Artificial Intelligence.
1 Web Query Classification Query Classification Task: map queries to concepts Application: Paid advertisement 问题:百度 /Google 怎么赚钱?
Language Model. Major role: Language Models help a speech recognizer figure out how likely a word sequence is, independent of the acoustics. A lot of.
5/30/2006EE 148, Spring Visual Categorization with Bags of Keypoints Gabriella Csurka Christopher R. Dance Lixin Fan Jutta Willamowski Cedric Bray.
Examples of Ensemble Methods
KDD for Science Data Analysis Issues and Examples.
Statistical Natural Language Processing. What is NLP?  Natural Language Processing (NLP), or Computational Linguistics, is concerned with theoretical.
Yoonjung Choi.  The Knowledge Discovery in Databases (KDD) is concerned with the development of methods and techniques for making sense of data.  One.
SoundSense: Scalable Sound Sensing for People-Centric Application on Mobile Phones Hon Lu, Wei Pan, Nocholas D. lane, Tanzeem Choudhury and Andrew T. Campbell.
Incremental Mining of Information Interest for Personalized Web Scanning Rey-Long Liu ( 劉瑞瓏 ) Dept. of Medical Informatics Tzu Chi University.
Processing of large document collections Part 3 (Evaluation of text classifiers, applications of text categorization) Helena Ahonen-Myka Spring 2005.
Automated Patent Classification By Yu Hu. Class 706 Subclass 12.
Mining the Semantic Web: Requirements for Machine Learning Fabio Ciravegna, Sam Chapman Presented by Steve Hookway 10/20/05.
Data Mining By Dave Maung.
Measurement error Roland Caulcutt Caulcutt Associates.
Computing & Information Sciences Kansas State University Paper Review Guidelines KDD Lab Course Supplement William H. Hsu Kansas State University Department.
Copyright © 2004 Pearson Education, Inc.. Chapter 27 Data Mining Concepts.
1 Pattern Recognition Pattern recognition is: 1. A research area in which patterns in data are found, recognized, discovered, …whatever. 2. A catchall.
A Scalable Machine Learning Approach for Semi-Structured Named Entity Recognition Utku Irmak(Yahoo! Labs) Reiner Kraft(Yahoo! Inc.) WWW 2010(Information.
Reference Collections: Collection Characteristics.
Summary „Data mining” Vietnam national university in Hanoi, College of technology, Feb.2006.
Software Engineering Issues Software Engineering Concepts System Specifications Procedural Design Object-Oriented Design System Testing.
Integrating Multiple Knowledge Sources For Improved Speech Understanding Sherif Abdou, Michael Scordilis Department of Electrical and Computer Engineering,
Data Mining Concepts and Techniques Course Presentation by Ali A. Ali Department of Information Technology Institute of Graduate Studies and Research Alexandria.
Incremental Context Mining for Adaptive Document Classification Advisor : Dr. Hsu Graduate : Chien-Shing Chen Author : Rey-Long Liu Yun-Ling Lu.
NP ⊆ PCP(n 3, 1) Theory of Computation. NP ⊆ PCP(n 3,1) What is that? NP ⊆ PCP(n 3,1) What is that?
IRU INTERNATIONAL STAR CLASSIFICATION SYSTEM IN ROMANIA.
ESSENTIAL WRITING SKILLS Why good writing skills are essential to career and business success.
The KDD Process for Extracting Useful Knowledge from Volumes of Data Fayyad, Piatetsky-Shapiro, and Smyth Ian Kim SWHIG Seminar.
Introduction to Machine Learning, its potential usage in network area,
Module 1: Overview of Information System in Organizations
(Disarikan dari berbagai sumber)
Chapter 14: Designing and Building Decision Support Systems
Information Organization: Overview
بسم الله الرحمن الرحيم.
Presentations Computer Vision Research Lab
Foundations of Information Systems in Business
Improving Health Question Classification by Word Location Weights
Rey-Long Liu Dept. of Medical Informatics Tzu Chi University Taiwan
INFORMATION SYSTEM AUDIT
MIS 451 Building Business Intelligence Systems
Information Retrieval
Title Goal Method Result
Aspect-based sentiment analysis
Adrian Tuhtan CS157A Section1
What is Pattern Recognition?
Lecture 1 – The process of decision making
I don’t need a title slide for a lecture
Introduction to Data Mining, 2nd Edition
Ninja Trader: Introduction to data mining in financial applications
Text Categorization Rong Jin.
سيستم هاي اطلاعات مديريت
Decision Support Systems: An Overview
Citation-based Extraction of Core Contents from Biomedical Articles
MIS2502: Data Analytics Clustering and Segmentation
SVM Based Learning System for F-term Patent Classification
Chapter 11 user support.
MIS2502: Data Analytics Clustering and Segmentation
Understanding Customer Behaviors with Information Technologies
Barry Anderson OECD 1st MENA SBO Meeting Cairo November 24-25, 2008
Dynamic Category Profiling for Text Filtering and Classification
Topic 5: Cluster Analysis
Rey-Long Liu Dept. of Medical Informatics Tzu Chi University Taiwan
Lecture 4. Niching and Speciation (1)
Information Organization: Overview
© 2018 The Hartford. Classification: Internally Controlled
THE TOPICS AND TITLES OF RESEARCH
Presentation transcript:

Incremental Context Mining for Adaptive Document Classification Rey-Long Liu & Yun-Ling Lu Main goals: (1) Adapting the classifier to the evolution of a hierarchical document database (2) Recognizing the document’s context of discussion in document classification Why? (1) Both content and vocabulary of the document database may evolve over time (2) A document may be properly classified only if its context of discussion is recognized How? (1) Incrementally mining the contextual requirement of each category (2) Document classification by context recognition Organization (Finance, Manufacturing…) CBIS (Computer, …) ‧‧‧ C11 Antecedents of the target (x) recognize the context of the document d. Antecedents of the error category (y) do not strongly accept the context of d. (Context Recognition) ‧‧‧ ‧‧‧ F 1 F 2 F 3 F F n F F F n-1 n+1 n+2 n+p DSS (Decision, …) MIS (Management, …) ‧‧‧ ‧‧‧ ‧‧‧ New Doc. ‧‧‧ Correct Target F 1 F 2 F 3 F F F F F n-1 n+1 n+2 n n+p (Incremental Mining) Main contributions: (1) Incremental adaptation (2) Efficient and higher-precision classification 2002/7/24 ACM KDD-2002