Presentation is loading. Please wait.

Presentation is loading. Please wait.

Text Categorization Document classification categorizes documents into one or more classes which is useful in Information Retrieval (IR). IR is the task.

Similar presentations


Presentation on theme: "Text Categorization Document classification categorizes documents into one or more classes which is useful in Information Retrieval (IR). IR is the task."— Presentation transcript:

1 Text Categorization Document classification categorizes documents into one or more classes which is useful in Information Retrieval (IR). IR is the task of storing, organizing and accessing information.

2 Methods Machine Learning: Pattern recognition and computational learning in artificial intelligence to categorize documents. Evaluate Cluster Predict Label

3 Methods

4 Data End = citation Abstract Words? (insert graph photo) Cut-off?

5 Not every paper is alike…

6 Cut-off!!!

7 Data Hand evaluated the 147 documents searching for…...
Abstract: report, Report, study, Study, studies, Studies, studied, Studied, updated, Updated, response, Response, analysis, Analysis, evaluated, Evaluated , background, Background, methods , Methods, results, Results, conclusions, Conclusions, abstract, Abstract, purpose, Purpose, experimental, Experimental, design, Design, patients and methods, Patients and Methods, Patients and methods, summary, Summary, findings, Findings, interpretation, Interpretation, objectives, Objectives, clusions, Clusions, aims, and Aims Ending Words: Introduction, introduction, ©, Keywords, keywords, '/[0-9]{4}[;]/'( ####; ) , '/,\s?[0-9]{4}/' ( ,#### ), Classification of evidence, Classification of Evidence, Trial Registration, and Trial registration

8 Evaluation: Precision and recall:
Precision: Did the machine learning label the documents correctly? Recall: Were all documents labeled correctly? Was all the information extracted? F1 measure: Precision and Recall are used together to produce a F1 measure. This gives a single value to judge.


Download ppt "Text Categorization Document classification categorizes documents into one or more classes which is useful in Information Retrieval (IR). IR is the task."

Similar presentations


Ads by Google