Kerim KORKMAZ A. Tolga KILINÇ H. Özgür BATUR Berkan KURTOĞLU
Outline What is MindGate? Motivation for MindGate Project Features of System Parts of The System User scenerios Review
What is MindGate? MindGate : Is an expert system Is a content filtering gateway Is a complete software. Categorizes requested web pages
MindGate : Controls web access of an organization Blocking harmful content Caching for fast access Keeping user actions’ statistics What is MindGate?
Demand for content filtering: Schools Business Organizations Governmental Organizations Motivation for The MindGate Project
MindGate IS NOT A security tool About server side filtering A script based admin tool A bandwith limiter A P2P blocker
Features of MindGate Inspecting all requests and responses on the web traffic Filtering connections according to predefined policies. User groups, web site categories and filter thresholds Smart filtering techniques for preventing users entering sites according to policies. Tracking of the user actions and generating satistical data
Key Features of MindGate Expert system for categorization Improves accuracy in time, with feedback mechanisms and statistic accumulated during a time period. Using simple but effective heuristics to categorize the web pages. Highly modularized categorization unit.
Key Features of MindGate Expanding website checklist Build by categorizer in time Corrected by user feedbacks Faster, because all lists and tables are not stored on an SQL server but stored as XML documents with a completely indexed manner on the RAM.
Key Features of MindGate Special Indexing Techniques For Fast Access Using Apache Lucene API Ram Index for Checklist Disk Indexing for Log Archive Complex queries can easily be applied to all these indexed files for searching or listing purposes.
Key Features of MindGate Threaded Structure Multi user environment Concurrency Synchronisation
Parts Of The System Core System Embedded proxy and web server: Connection Control & Admin Interface Archive File System SQL Server: User Account Data
Modules of The System Connection Control: requests and responses Database Control: static index checking, user authentication, and log archiving. Categorizer: dynamic categorization Interface Control: user interaction, page generation
Connection Control Handles requests and responses Interface to proxy Handles only HTTP Protocol by
Database Control Holds Data: (All indexed) Checklist (URL - category...) SessionTable (user - ip number...) GroupInfo Table (user - group...)
Database Control Interface to: SQL Database Archive File System(Logs)
Archive Files
Categorizer Invoked when a page not found in checklist Evaluation by a heuristic equation –Structural Criteria (Colors,Scripts,Links) –Content Criteria (Meta Data)
Interface Control User interaction Administrator interaction Page generation Interface to embedded web server
User (Administrator) Interface
MindGate Status Screen
Architecture Manager Screen
Add New User
Add New Group
Add New Category Screen
Check Lists Screen
Internet Activities Screen
Tolga’s Categries result
Tolga’s URL List
MindGate Status Screen
What is next? More sophisticated categorization, new heuristics Categorized Data Pool System, to share categorized data Extending MindGate to handle other protocols (ie. ftp,pop3,…) Please! Stay Tuned :)