Artificial Intelligence Final Project Text document Classification with new type Rule-based PLM Chang, Jung Woo Shin, Dong In Jung, Hyun Joon School of.

Slides:



Advertisements
Similar presentations
A Comparison of Implicit and Explicit Links for Web Page Classification Dou Shen 1 Jian-Tao Sun 2 Qiang Yang 1 Zheng Chen 2 1 Department of Computer Science.
Advertisements

SINAI-GIR A Multilingual Geographical IR System University of Jaén (Spain) José Manuel Perea Ortega CLEF 2008, 18 September, Aarhus (Denmark) Computer.
Becerra-Fernandez, et al. -- Knowledge Management 1/e -- © 2004 Prentice Hall Chapter 7 Technologies to Manage Knowledge: Artificial Intelligence.
Explorations in Tag Suggestion and Query Expansion Jian Wang and Brian D. Davison Lehigh University, USA SSM 2008 (Workshop on Search in Social Media)
Iterative Optimization of Hierarchical Clusterings Doug Fisher Department of Computer Science, Vanderbilt University Journal of Artificial Intelligence.
Decision Tree Algorithm
Relevance Feedback based on Parameter Estimation of Target Distribution K. C. Sia and Irwin King Department of Computer Science & Engineering The Chinese.
Iterative Optimization and Simplification of Hierarchical Clusterings Doug Fisher Department of Computer Science, Vanderbilt University Journal of Artificial.
Scalable and Distributed Similarity Search in Metric Spaces Michal Batko Claudio Gennaro Pavel Zezula.
Introduction to WEKA Aaron 2/13/2009. Contents Introduction to weka Download and install weka Basic use of weka Weka API Survey.
Text Classification Using Stochastic Keyword Generation Cong Li, Ji-Rong Wen and Hang Li Microsoft Research Asia August 22nd, 2003.
Text Classification With Labeled and Unlabeled Data Presenter: Aleksandar Milisic Supervisor: Dr. David Albrecht.
Introduction to Data Mining Engineering Group in ACL.
TransRank: A Novel Algorithm for Transfer of Rank Learning Depin Chen, Jun Yan, Gang Wang et al. University of Science and Technology of China, USTC Machine.
Web Usage Mining Sara Vahid. Agenda Introduction Web Usage Mining Procedure Preprocessing Stage Pattern Discovery Stage Data Mining Approaches Sample.
CONTI’2008, 5-6 June 2008, TIMISOARA 1 Towards a digital content management system Gheorghe Sebestyen-Pal, Tünde Bálint, Bogdan Moscaliuc, Agnes Sebestyen-Pal.
Engineering Applications of Artificial Intelligence,
Slide Image Retrieval: A Preliminary Study Guo Min Liew and Min-Yen Kan National University of Singapore Web IR / NLP Group (WING)
High-Performance Packet Classification on GPU Author: Shijie Zhou, Shreyas G. Singapura and Viktor K. Prasanna Publisher: HPEC 2014 Presenter: Gang Chi.
Comparing the Parallel Automatic Composition of Inductive Applications with Stacking Methods Hidenao Abe & Takahira Yamaguchi Shizuoka University, JAPAN.
Language Identification of Search Engine Queries Hakan Ceylan Yookyung Kim Department of Computer Science Yahoo! Inc. University of North Texas 2821 Mission.
Short Introduction to Machine Learning Instructor: Rada Mihalcea.
Boosting Neural Networks Published by Holger Schwenk and Yoshua Benggio Neural Computation, 12(8): , Presented by Yong Li.
COMPUTER-ASSISTED PLAGIARISM DETECTION PRESENTER: CSCI 6530 STUDENT.
What is a neural network? Collection of interconnected neurons that compute and generate impulses. Components of a neural network include neurons, synapses,
1 LiveClassifier: Creating Hierarchical Text Classifiers through Web Corpora Chien-Chung Huang Shui-Lung Chuang Lee-Feng Chien Presented by: Vu LONG.
Sampletalk Technology Presentation Andrew Gleibman
UOS 1 Ontology Based Personalized Search Zhang Tao The University of Seoul.
The Perceptron. Perceptron Pattern Classification One of the purposes that neural networks are used for is pattern classification. Once the neural network.
CSC 196k Semester Project: Instance Based Learning
1 Machine Learning 1.Where does machine learning fit in computer science? 2.What is machine learning? 3.Where can machine learning be applied? 4.Should.
Xiangnan Kong,Philip S. Yu Multi-Label Feature Selection for Graph Classification Department of Computer Science University of Illinois at Chicago.
Stefan Mutter, Mark Hall, Eibe Frank University of Freiburg, Germany University of Waikato, New Zealand The 17th Australian Joint Conference on Artificial.
Greedy is not Enough: An Efficient Batch Mode Active Learning Algorithm Chen, Yi-wen( 陳憶文 ) Graduate Institute of Computer Science & Information Engineering.
Jun Li, Peng Zhang, Yanan Cao, Ping Liu, Li Guo Chinese Academy of Sciences State Grid Energy Institute, China Efficient Behavior Targeting Using SVM Ensemble.
Date : 2013/03/18 Author : Jeffrey Pound, Alexander K. Hudek, Ihab F. Ilyas, Grant Weddell Source : CIKM’12 Speaker : Er-Gang Liu Advisor : Prof. Jia-Ling.
Protein motif extraction with neuro-fuzzy optimization Bill C. H. Chang and Author : Bill C. H. Chang and Saman K. Halgamuge Saman K. Halgamuge Adviser.
Feature selection with Neural Networks Dmitrij Lagutin, T Variable Selection for Regression
Intelligent Database Systems Lab N.Y.U.S.T. I. M. Psychiatric document retrieval using a discourse-aware model Presenter : Wu, Jia-Hao Authors : Liang-Chih.
Learning Classifier Systems (Introduction) Muhammad Iqbal Evolutionary Computation Research Group School of Engineering and Computer Science Victoria University.
School of Computer Science 1 Information Extraction with HMM Structures Learned by Stochastic Optimization Dayne Freitag and Andrew McCallum Presented.
Intelligent Database Systems Lab N.Y.U.S.T. I. M. Mining massive document collections by the WEBSOM method Presenter : Yu-hui Huang Authors :Krista Lagus,
The Inference via DNA Computing Piort Wasiewicz et al. Proceedings of the 1999 Congress on Evolutionary Computation, vol. 2, pp Cho, Dong-Yeon.
Post-Ranking query suggestion by diversifying search Chao Wang.
Information Retrieval and Organisation Chapter 14 Vector Space Classification Dell Zhang Birkbeck, University of London.
Divided Pretreatment to Targets and Intentions for Query Recommendation Reporter: Yangyang Kang /23.
Virtual Examples for Text Classification with Support Vector Machines Manabu Sassano Proceedings of the 2003 Conference on Emprical Methods in Natural.
Binary-tree-based high speed packet classification system on FPGA Author: Jingjiao Li*, Yong Chen*, Cholman HO**, Zhenlin Lu* Publisher: 2013 ICOIN Presenter:
Support-Vector Networks C Cortes and V Vapnik (Tue) Computational Models of Intelligence Joon Shik Kim.
The Development of a search engine & Comparison according to algorithms Sung-soo Kim The final report.
Mining Tag Semantics for Social Tag Recommendation Hsin-Chang Yang Department of Information Management National University of Kaohsiung.
Chapter 4. Analysis of Brain-Like Structures and Dynamics (2/2) Creating Brain-Like Intelligence, Sendhoff et al. Course: Robots Learning from Humans 09/25.
TUMOR BURDEN ANALYSIS ON CT BY AUTOMATED LIVER AND TUMOR SEGMENTATION RAMSHEEJA.RR Roll : No 19 Guide SREERAJ.R ( Head Of Department, CSE)
Relevant Document Distribution Estimation Method for Resource Selection Luo Si and Jamie Callan School of Computer Science Carnegie Mellon University
Pre-processing Tasks for Rule- Based English-Korean Machine Translation System Sung-Dong Kim, Dept. of Computer Engineering, Hansung University, Seoul,
Scalable Multi-match Packet Classification Using TCAM and SRAM Author: Yu-Chieh Cheng, Pi-Chung Wang Publisher: IEEE Transactions on Computers (2015) Presenter:
Artificial Intelligence DNA Hypernetworks Biointelligence Lab School of Computer Sci. & Eng. Seoul National University.
Automatic Video Shot Detection from MPEG Bit Stream
Prepared by: Mahmoud Rafeek Al-Farra
RESEARCH APPROACH.
Efficient Ranking of Keyword Queries Using P-trees
Information Retrieval
Source: Procedia Computer Science(2015)70:
Waikato Environment for Knowledge Analysis
What is Pattern Recognition?
Example: Academic Search
Prepared by: Mahmoud Rafeek Al-Farra
DNA computing on surfaces
A Small and Fast IP Forwarding Table Using Hashing
Support vector machine-based text detection in digital video
Presentation transcript:

Artificial Intelligence Final Project Text document Classification with new type Rule-based PLM Chang, Jung Woo Shin, Dong In Jung, Hyun Joon School of Computer Science and Engineering Seoul National University Presented by Jung Hyun Joon

Artificial Intelligence Final Project Contents Introduction Architecture Wet design scheme Performance Evaluation Conclusion References

Artificial Intelligence Final Project Introduction Classification Problem –Decision Tree & Version space learning.. –Some shortcomings Not include all possible rule sets, only focus part.. Vulnerable to noisy data In this paper –Utilize massive parallelism of DNA computing –Define rules as a element with 1 / 0 / don’t care –Make noise-tolerant classification system

Artificial Intelligence Final Project Architecture Rule-based PLM –training and test Rule-based PLM 의 전체적인 구조

Artificial Intelligence Final Project Model structure Property of target data Target Data and Model Structure 1010 Can be involved in class A, B or C Document i Class Tag + n digit binary bit + history count in Training 1010A B C07

Artificial Intelligence Final Project Training … … 1010A Training query 0000A0 0000B 01010A01010B01010C0****C0 … … 0000A0 0000B A1 1010B0 1010C0 ****C0 1010B … … 0000A0 0000B A1 1010B1 1010C0 ****C0 0000B … … 0000A0 0000B A1 1010B1 1010C0 ****C0 1010A … … 0000A0 0000B A2 1010B1 1010C0 ****C0

Artificial Intelligence Final Project Test 0000A 0000B 1010A 1010B 1010C ****C … … A Test query Class A – 12 / 17 Class B – 2 / 17 Class C – 3 / 17 Class A

Artificial Intelligence Final Project Wet-design Scheme Initial DNA strand 생성 과정

Artificial Intelligence Final Project Wet design Scheme training example set 생성 과정

Artificial Intelligence Final Project Wet design Scheme classification 과정

Artificial Intelligence Final Project Forward and Backward scheme to untrained query Comparison of the forward and backward model scheme

Artificial Intelligence Final Project Performance Evaluation.

Artificial Intelligence Final Project Performance Evaluation Average Classification Success RateCISI classification Success Rate

Artificial Intelligence Final Project Performance Evaluation CRAN Classification Success RateMED Classification Success Rate Cause of MED classification success rate 1.Preprocessing ( all zero term document delete ) 2.Sparse vector of term

Artificial Intelligence Final Project Conclusion Present new type rule-based PLM –Support the flexibility with don’t care property –Forward and backward search scheme to untrained query –Showing the similar performances compared with WEKA –Possibility of wet-design

Artificial Intelligence Final Project References Version Space Learning with DNA Molecules, Lim, H.-W. et al, LNCS, vol. 2568, pp , 2003 DNA computing on surfaces, Liu et al., Nature, 2000 A Bayesian Algorithm for In Vitro Molecular Evolution of Pattern Classifiers, Zhang, B.-T. and Jang, H.-Y., Preliminary Proceedings of the Tenth International Meeting on DNA Computing, pp , more papers and many web-sites