Seher Acer, Başak Çakar, Elif Demirli, Şadiye Kaptanoğlu.

Slides:



Advertisements
Similar presentations
kareRCIserIsviFIsaRsþGnuvtþKMerag
Advertisements

CROSSmeeting UM, Fev T5: Slicing Techniques Applied to OSS Pedro Rangel Henriques Daniela da Cruz Dep. de Informática/CCTC, Universidade do Minho,
FOR PROFESSIONAL OR ACADEMIC PURPOSES September 2007 L. Codina. UPF Interdisciplinary CSIM Master Online Searching 1.
DISCOVERING EVENT EVOLUTION GRAPHS FROM NEWSWIRES Christopher C. Yang and Xiaodong Shi Event Evolution and Event Evolution Graph: We define event evolution.
WIPO Patent Information Services
1 centre dÉtudes de la navigation aérienne COMMON METRICS FRAMEWORK FOR ATM PERFORMANCE ASSESSMENT AND MONITORING Almira Williams, CSSI Inc.
Teacher Name Class / Subject Date A:B: Write an answer here #1 Write your question Here C:D: Write an answer here.
Mark Levene, An Introduction to Search Engines and Web Navigation © Pearson Education Limited 2005 Slide 4.1 Chapter 4 : Searching the Web The mechanics.
Introduction Lesson 1 Microsoft Office 2010 and the Internet
An Online News Recommender System for Social Networks Department of Computer Science University of Illinois at Urbana-Champaign Manish Agrawal, Maryam.
LABELING TURKISH NEWS STORIES WITH CRF Prof. Dr. Eşref Adalı ISTANBUL TECHNICAL UNIVERSITY COMPUTER ENGINEERING 1.
An Intelligent System for Dynamic Online TV Programming Allocation from TV Internet Broadcasting Thamar E. Mora, Rene V. Mayorga Faculty of Engineering,
26/10/2008 SWESE'08 1 Enhanced Semantic Access to Software Artefacts Danica Damljanović and Kalina Bontcheva.
Business Development Suit Presented by Thomas Mathews.
M obile U ser I nterface Survey March 11, 2011 Hyojin Song.
Clustering k-mean clustering Genome 559: Introduction to Statistical and Computational Genomics Elhanan Borenstein.
Document Clustering Carl Staelin. Lecture 7Information Retrieval and Digital LibrariesPage 2 Motivation It is hard to rapidly understand a big bucket.
Enhancing Spotfire with the Power of R
® Microsoft Office 2010 Browser and Basics.
Tae-Shick Wang; Kang-Sun Choi; Hyung-Seok Jang; Morales, A.W.; Sung-Jea Ko; IEEE Transactions on Consumer Electronics, Vol. 56, No. 2, May 2010 ENHANCED.
WEB MINING. Why IR ? Research & Fun
Chapter 13 The Data Warehouse
Personalized Navigation in the Semantic Web: An Enhanced Faceted Browser Michal Tvarožek FIIT STU BA.
Information Retrieval Lecture 7 Introduction to Information Retrieval (Manning et al. 2007) Chapter 17 For the MSc Computer Science Programme Dell Zhang.
Clustering Categorical Data The Case of Quran Verses
Multi-Phase Reasoning of temporal semantic knowledge Sakirulai O. Isiaq and Taha Osman School of Computer and Informatics Nottingham Trent University Nottingham.
Presenters: Başak Çakar Şadiye Kaptanoğlu.  Typical output of an IR system – static predefined summary ◦ Title ◦ First few sentences  Not a clear view.
Basic IR: Queries Query is statement of user’s information need. Index is designed to map queries to likely to be relevant documents. Query type, content,
ADVISE: Advanced Digital Video Information Segmentation Engine
Creating Concept Hierarchies in a Customer Self-Help System Bob Wall CS /29/05.
Interfaces for Selecting and Understanding Collections.
Supervised by Prof. LYU, Rung Tsong Michael Department of Computer Science & Engineering The Chinese University of Hong Kong Prepared by: Chan Pik Wah,
Clustering Specific Issues related to Project 2. Reducing dimensionality –Lowering the number of dimensions makes the problem more manageable Less memory.
Multimedia Search and Retrieval Presented by: Reza Aghaee For Multimedia Course(CMPT820) Simon Fraser University March.2005 Shih-Fu Chang, Qian Huang,
Visual Information Retrieval Chapter 1 Introduction Alberto Del Bimbo Dipartimento di Sistemi e Informatica Universita di Firenze Firenze, Italy.
1 Today  Tools (Yves)  Efficient Web Browsing on Hand Held Devices (Shrenik)  Web Page Summarization using Click- through Data (Kathy)  On the Summarization.
Searching and Researching the World Wide: Emphasis on Christian Websites Developed from the book: Searching and Researching on the Internet and World Wide.
Stuff I’ve Seen: A System for Personal Information Retrieval and Re-use by Seher Acer Elif Demirli Susan Dumais, Edward Cutrell, JJ Cadiz, Gavin Jancke,
Overview of Search Engines
1/16 Final project: Web Page Classification By: Xiaodong Wang Yanhua Wang Haitang Wang University of Cincinnati.
Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification on Reviews Peter D. Turney Institute for Information Technology National.
Presented by Tienwei Tsai July, 2005
Searching Computing Reviews Connie Ury
Presented by: Apeksha Khabia Guided by: Dr. M. B. Chandak
Incident Threading for News Passages (CIKM 09) Speaker: Yi-lin,Hsu Advisor: Dr. Koh, Jia-ling. Date:2010/06/14.
Fourth Edition Discovering the Internet Discovering the Internet Complete Concepts and Techniques, Second Edition Chapter 3 Searching the Web.
©2003 Paula Matuszek CSC 9010: Text Mining Applications Document Summarization Dr. Paula Matuszek (610)
Giorgos Giannopoulos (IMIS/”Athena” R.C and NTU Athens, Greece) Theodore Dalamagas (IMIS/”Athena” R.C., Greece) Timos Sellis (IMIS/”Athena” R.C and NTU.
1 Motivation Web query is usually two or three words long. –Prone to ambiguity –Example “keyboard” –Input device of computer –Musical instruments How can.
Ground Truth Free Evaluation of Segment Based Maps Rolf Lakaemper Temple University, Philadelphia,PA,USA.
Evaluating Semantic Metadata without the Presence of a Gold Standard Yuangui Lei, Andriy Nikolov, Victoria Uren, Enrico Motta Knowledge Media Institute,
An Introduction to the Study Centre’s One Stop Search Tool for all your resources.
Information Retrieval Lecture 6 Introduction to Information Retrieval (Manning et al. 2007) Chapter 16 For the MSc Computer Science Programme Dell Zhang.
1 Modeling Long Distance Dependence in Language: Topic Mixtures Versus Dynamic Cache Models Rukmini.M Iyer, Mari Ostendorf.
Summarizing Encyclopedic Term Descriptions on the Web from Coling 2004 Atsushi Fujii and Tetsuya Ishikawa Graduate School of Library, Information and Media.
DATA MINING WITH CLUSTERING AND CLASSIFICATION Spring 2007, SJSU Benjamin Lam.
CP3024 Lecture 12 Search Engines. What is the main WWW problem?  With an estimated 800 million web pages finding the one you want is difficult!
Clustering Algorithms Sunida Ratanothayanon. What is Clustering?
The Development of a search engine & Comparison according to algorithms Sung-soo Kim The final report.
Instance Discovery and Schema Matching With Applications to Biological Deep Web Data Integration Tantan Liu, Fan Wang, Gagan Agrawal {liut, wangfa,
Welcome to the ACM Portal. the ACM Digital Library (ACM DL) Gateway to: Access to The Guide to Computing Literature (citations-only database) is not included.
Investigate Plan Design Create Evaluate (Test it to objective evaluation at each stage of the design cycle) state – describe - explain the problem some.
Data Mining and Text Mining. The Standard Data Mining process.
Visual Information Retrieval
A Deep Learning Technical Paper Recommender System
Data Clustering Michael J. Watts
Martin Rajman, EPFL Switzerland & Martin Vesely, CERN Switzerland
Text Categorization Berlin Chen 2003 Reference:
Tantan Liu, Fan Wang, Gagan Agrawal The Ohio State University
Presentation transcript:

Seher Acer, Başak Çakar, Elif Demirli, Şadiye Kaptanoğlu

Introduction Motivation Aspect Based Clustering Modeling Aspects Aspect Extraction Framing Cycle-Aware Clustering User Interface & Demo Conclusion References 2/14

News are produced in multiple stages: Gathering, writing, editing, etc. Subjective opinion of producers, owners, advertisers – biased environment Effort needed for a comprehensive and balanced understanding of a news event A system that guides and encourages reader to read news from different perspectives 3/14

Current systems provide limited presentation of news Listing news arbitrarily or according to date A system that helps users reach news from different viewpoints via a single portal Capture the difference of aspects within articles reporting a common news story Use of advanced computational techniques of information retrieval 4/14

5/14

Aspect: keyword-weight pairs Keywords are extracted from Head, sub-head, lead GATE (General Architecture for Text Engineering) Person, organization, location Event extraction (Zemberek) Frequently used action words/phrases 6/14

7/14

Set of articles on a news shows head-tail characteristics Head – common aspects Tail – uncommon aspects Separation of head and tail provides effective classification Two steps: Head-tail partitioning Tail-side clustering 8/14

Generate common-uncommon keyword sets HgP: head group proportion Calculate keyword commonness & uncommonness Commonness – an article with many common keywords with high weight values Uncommonness - an article with many uncommon keywords with high weight values 9/14

Agglomerative hierarchical clustering Similarity measure – Cosine similarity During Agglomerative Clustering Each object forms a cluster of its own as a singleton Pairs of clusters are merged iteratively until a certain stopping criterion is met In the merging process - the similarity between two clusters is measured by the similarity of the most similar pair of sequences belonging to these two clusters (the single-link approach) 10/14

Simple & user-friendly Present news from different aspects fairly Motivate reader to read news from different aspects 11/14

Existing systems: Google news, Yahoo News Limited presentation News listed arbitrarily Proposed system: Gathers same news with existing systems Clusters news according to aspects Simple user interface Easy to track news stories The approach is suitable for Turkish news 12/14

[1] Park, S., Kang, S., Lee, S., Chung, S., Song, J. Mitigating Media Bias: A Computational Approach. ACM, 2008, pp [2] Park, S., Kang, S., Chung, S., Song, J. NewsCube: Delivering Multiple Aspects of News to Mitigate Media Bias. ACM, [3] Cunningham, H., Maynard, D., Bontcheva, K., Tablan, V. GATE: A Framework and Graphical Development Environment for Robust NLP Tools and Applications. Proceedings of the 40th Anniversary Meeting of the Association for Computational Linguistics. ACL'02, [4] Park, S., Lee, S., Song, J. Aspect-level News Browsing: Understanding News Events from Multiple Viewpoints. ACM, 2010, pp /14