Download presentation
Presentation is loading. Please wait.
Published byAbraham Porter Modified over 9 years ago
1
Author : Jochen Dijrre, Peter Gerstl, Roland Seiffert Proceedings of the 5th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Diego, California, August 15-18, 1999, 398-401. Presented by Xxxxxx
2
Outline Motivation Methodology Feature Extraction Clustering and Categorizing Data Mining VS Text Mining Conclusion
3
Motivation Problem: Most of data in a company is unstructured or semi-structured Examples: Letters Emails Phone transcripts Contracts
4
Definition and Application Text mining: The discovery by computer of new, previously unknown information, by automatically extracting information from different written resources. Applications: Summarizing documents Discovering/monitoring relations among people Customer profile analysis Trend analysis Documents summarization
5
Methodology Aspect 1: Knowledge Discovery Aspect 2: Information Distillation Approaches: Extraction Analysis
6
Feature Extraction Recognize and classify significant vocabulary items from the text Categories of vocabulary Proper names Multiword terms Abbreviations Relations Other useful things
7
Clustering Model
8
Categorization Model
9
Data Mining VS Text Mining Data MiningText Mining GoalDiscover hidden modelsDiscover hidden facts MethodTries to generalize all of data into a single model Tries to understand the details, cross reference between individual instances FieldsMarketing, medicine, health care Biosciences, customer profile analysis
10
Conclusion Introduction of text mining Differences between data mining and text mining Overview of IBM’s Intelligent Miner for Text The tools and methods used in the past
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.