Software Agents for Web Mining FYP Project by: Shuchi Mittal Quek Siew Guat Patricia Professor: Franklin Fu
Organisation Software Agent 1. Monitor/Retrieval 2. Classification 3. Matching/Dispatch
Monitor/Retrieval Crawling relevant URL’s Search for required data and links HTML parsing and cleaning Information Extraction Transfer raw data to the Database
Classification Data Pre-processing Training Data and data cleaning Multi-class Classification of data Inserting classified data into the database Training Data SVM Learner Classifier (Model) Testing or unknown Data SVM Learn SVM multi-class Classify Class A Class B……….. Class n
Matching/Dispatch Query Parsing Database Transaction Data Cleaning Information retrieval from the database Interface and Display
Overall Process