Download presentation
Presentation is loading. Please wait.
Published byWidyawati Budiaman Modified over 6 years ago
1
Text Categorization Document classification categorizes documents into one or more classes which is useful in Information Retrieval (IR). IR is the task of storing, organizing and accessing information.
2
Methods Machine Learning: Pattern recognition and computational learning in artificial intelligence to categorize documents. Evaluate Cluster Predict Label
3
Methods
4
Data End = citation Abstract Words? (insert graph photo) Cut-off?
5
Not every paper is alike…
6
Cut-off!!!
7
Data Hand evaluated the 147 documents searching for…...
Abstract: report, Report, study, Study, studies, Studies, studied, Studied, updated, Updated, response, Response, analysis, Analysis, evaluated, Evaluated , background, Background, methods , Methods, results, Results, conclusions, Conclusions, abstract, Abstract, purpose, Purpose, experimental, Experimental, design, Design, patients and methods, Patients and Methods, Patients and methods, summary, Summary, findings, Findings, interpretation, Interpretation, objectives, Objectives, clusions, Clusions, aims, and Aims Ending Words: Introduction, introduction, ©, Keywords, keywords, '/[0-9]{4}[;]/'( ####; ) , '/,\s?[0-9]{4}/' ( ,#### ), Classification of evidence, Classification of Evidence, Trial Registration, and Trial registration
8
Evaluation: Precision and recall:
Precision: Did the machine learning label the documents correctly? Recall: Were all documents labeled correctly? Was all the information extracted? F1 measure: Precision and Recall are used together to produce a F1 measure. This gives a single value to judge.
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.