S YMPOSIUM ON B IAS AND D IVERSITY IN IR (aka L IVING K NOWLEDGE S UMMER S CHOOL ) LivingKnowledge Consortium ESSIR Summer School 2011 August 31, 2011,

Slides:



Advertisements
Similar presentations
GMD German National Research Center for Information Technology Darmstadt University of Technology Perspectives and Priorities for Digital Libraries Research.
Advertisements

DELOS Highlights COSTANTINO THANOS ITALIAN NATIONAL RESEARCH COUNCIL.
A Stepwise Modeling Approach for Individual Media Semantics Annett Mitschick, Klaus Meißner TU Dresden, Department of Computer Science, Multimedia Technology.
Web Search Results Visualization: Evaluation of Two Semantic Search Engines Kalliopi Kontiza, Antonis Bikakis,
Image Information Retrieval Shaw-Ming Yang IST 497E 12/05/02.
Copyright © 2005, SAS Institute Inc. All rights reserved. Creating Dashboards with the SAS ® Information Delivery Portal Don Chapman Development Manager.
Galia Angelova Institute for Parallel Processing, Bulgarian Academy of Sciences Visualisation and Semantic Structuring of Content (some.
WebMiningResearch ASurvey Web Mining Research: A Survey Raymond Kosala and Hendrik Blockeel ACM SIGKDD, July 2000 Presented by Shan Huang, 4/24/2007.
Rutgers Components Phase 2 Principal investigators –Paul Kantor, PI; Design, modelling and analysis –Kwong Bor Ng, Co-PI - Fusion; Experimental design.
MS DB Proposal Scott Canaan B. Thomas Golisano College of Computing & Information Sciences.
Web Mining Research: A Survey
WebMiningResearch ASurvey Web Mining Research: A Survey By Raymond Kosala & Hendrik Blockeel, Katholieke Universitat Leuven, July 2000 Presented 4/18/2002.
© Anselm SpoerriInfo + Web Tech Course Information Technologies Info + Web Tech Course Anselm Spoerri PhD (MIT) Rutgers University
Web Mining Research: A Survey
WebMiningResearchASurvey Web Mining Research: A Survey Raymond Kosala and Hendrik Blockeel ACM SIGKDD, July 2000 Presented by Shan Huang, 4/24/2007 Revised.
An Introduction to Software Visualization Dr. Jonathan I. Maletic Software DevelopMent Laboratory Department of Computer Science Kent State University.
Introduction Web Development II 5 th February. Introduction to Web Development Search engines Discussion boards, bulletin boards, other online collaboration.
Overview of Search Engines
Yuri de Lugt Collexis Karin Clavel TU Delft Library.
Text Analytics And Text Mining Best of Text and Data
Result presentation. Search Interface Input and output functionality – helping the user to formulate complex queries – presenting the results in an intelligent.
Thesaurusmanagement Quickstart Introduction. What are controlled vocabularies? organized arrangement of words and phrases used to index content and/or.
CS598CXZ Course Summary ChengXiang Zhai Department of Computer Science University of Illinois, Urbana-Champaign.
Research paper: Web Mining Research: A survey SIGKDD Explorations, June Volume 2, Issue 1 Author: R. Kosala and H. Blockeel.
Institute of Informatics and Telecommunications – NCSR “Demokritos” Bootstrapping ontology evolution with multimedia information extraction C.D. Spyropoulos,
Web Science and Web Archive L3S Wolfgang Nejdl L3S Research Center Hannover, Germany.
Chapter 7 Web Content Mining Xxxxxx. Introduction Web-content mining techniques are used to discover useful information from content on the web – textual.
University of Dublin Trinity College Localisation and Personalisation: Dynamic Retrieval & Adaptation of Multi-lingual Multimedia Content Prof Vincent.
Information Systems & Semantic Web University of Koblenz ▪ Landau, Germany Semantic Web - Multimedia Annotation – Steffen Staab
Personalized Interaction with Web Resources First Sino-German Symposium on KNOWLEDGE HANDLING: REPRESENTATION, MANAGEMENT AND PERSONALIZED APPLICATION.
Tag Data and Personalized Information Retrieval 1.
Knowledge Representation and Indexing Using the Unified Medical Language System Kenneth Baclawski* Joseph “Jay” Cigna* Mieczyslaw M. Kokar* Peter Major.
Web Searching Basics Dr. Dania Bilal IS 530 Fall 2009.
Towards an ecosystem of data and ontologies Mathieu d’Aquin and Enrico Motta Knowledge Media Institute The Open University.
Markup and Validation Agents in Vijjana – A Pragmatic model for Self- Organizing, Collaborative, Domain- Centric Knowledge Networks S. Devalapalli, R.
21/11/2002 The Integration of Lexical Knowledge and External Resources for QA Hui YANG, Tat-Seng Chua Pris, School of Computing.
® GRDC Hydrologic Metadata - core concepts - 5 th, WMO/OGC Hydrology DWG New York, CCNY, August 11 – 15, 2014 Irina Dornblut, GRDC of WMO at BfG Copyright.
VIKEF – Take the VIKEF train towards smart services …
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Visualizing Ontology Components through Self-Organizing.
An MPEG-7 Based Content- aware Album System for Consumer Photographs 2003/12/18 Chen-Hsiu Huang, Chih-Hao Shen, Chun-Hsiang Huang and Ja-Ling Wu Communication.
DemocracyApps, Inc. Community Budget Explorer A Technical Overview.
Mining Topic-Specific Concepts and Definitions on the Web Bing Liu, etc KDD03 CS591CXZ CS591CXZ Web mining: Lexical relationship mining.
Search Engine Architecture
GUIDED BY DR. A. J. AGRAWAL Search Engine By Chetan R. Rathod.
Vidispine Data Model Vidispine Bootcamp. Overview Collection Storage File Item Shape Item Component Shape Component Metadata abstract entity physical.
Research Topics/Areas. Adapting search to Users Advertising and ad targeting Aggregation of Results Community and Context Aware Search Community-based.
ARD Prasad Indian Statistical Institute, Bangalore.
Probabilistic Latent Query Analysis for Combining Multiple Retrieval Sources Rong Yan Alexander G. Hauptmann School of Computer Science Carnegie Mellon.
Information Dynamics & Interoperability Presented at: NIT 2001 Global Digital Library Development in the New Millennium Beijing, China, May 2001, and DELOS.
26/05/2005 Research Infrastructures - 'eInfrastructure: Grid initiatives‘ FP INFRASTRUCTURES-71 DIMMI Project a DI gital M ulti M edia I nfrastructure.
A Generalized Architecture for Bookmark and Replay Techniques Thesis Proposal By Napassaporn Likhitsajjakul.
Soon Joo Hyun Database Systems Research and Development Lab. US-KOREA Joint Workshop on Digital Library t Introduction ICU Information and Communication.
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
Colby Smart, E-Learning Specialist Humboldt County Office of Education
Contextual Text Cube Model and Aggregation Operator for Text OLAP
Web mining is the use of data mining techniques to automatically discover and extract information from Web documents/services
WEB STRUCTURE MINING SUBMITTED BY: BLESSY JOHN R7A ROLL NO:18.
Ricardo EIto Brun Strasbourg, 5 Nov 2015
Information Storage and Retrieval Fall Lecture 1: Introduction and History.
Designing Cross-Language Information Retrieval System using various Techniques of Query Expansion and Indexing for Improved Performance  Hello everyone,
Webtech has 3 main pages: WebTech Home, Students’ Page and Module’s Home. Using the top buttons, you can quickly navigate different levels. Each module.
Guangbing Yang Presentation for Xerox Docushare Symposium in 2011
Semantic Visualization
Search Engine Architecture
Embedding Knowledge in HTML
(VIP-EDC) Point 6 of the agenda
Web Mining Department of Computer Science and Engg.
Web Mining Research: A Survey
Information Retrieval and Web Design
Internet Skills ELEC135 Alan Noble Room 504 Tel:
Presentation transcript:

S YMPOSIUM ON B IAS AND D IVERSITY IN IR (aka L IVING K NOWLEDGE S UMMER S CHOOL ) LivingKnowledge Consortium ESSIR Summer School 2011 August 31, 2011, Koblenz, Germany

Bias and Diversity in IR, August 31, What are the different perspectives? What are the different perspectives? Are some sources biased? Are some sources biased? What types of images are used

VISION Diversity and bias in the Web today: High diversity of content and content creators But: no systematic support to explore the diversity Vision of Diversity-related research: Make diversity, bias and evolution traceable, understandable and exploitable Aim and scope of this symposium: Give an overview of leading-edge technology for dealing with diversity and bias in IR 3Bias and Diversity in IR, August 31, 2011

CONTEXT: LIVING KNOWLEDGE PROJECT Part of the presented material are relevant results from the LivingKnowledge project EU FET Project since February 2009 Project Goal: Improve navigation and search in large cross-media datasets and the Web by considering diversity, time, opinions and bias Relevant project results: – Testbed with more than 40 components (will be shown and used today) – Innovative methods and algorithms, e.g sentiment extraction from images, image forensics, contextualization of facts – First applications of diversity-aware technology, e.g. Time explorer Bias and Diversity in IR, August 31, 20114

CONTEXT: LIVING KNOWLEDGE PROJECT Relevant project results: – Testbed with more than 40 components (will be shown and used today) – Many innovative methods and algorithms, e.g. sentiment extraction from images, image forensics, contextualization of facts; – First applications of diversity-aware technology, e.g. Time explorer Bias and Diversity in IR, August 31, ModuleStatusDocumentation HTML Content AnalyserIn testbedYes Codebook AggregatorIn testbedYes URL ExtractorIn testbedYes Document Layout ExtractorIn testbedYes Places AnnotatorIn testbedYes Entity DisambiguatorIn testbedYes EXIF Image metadata ExtractorIn testbedYes Image ColourfulnessIn testbedYes Image NaturalnessIn testbedYes image Sentiment AnnotatorIn testbedYes Quote ExtractorIn testbedYes Phrase ExtractorIn testbedYes TimeML AnnotatorIn testbedYes Dictionary AnnotatorIn testbedYes Fact AnnotatorIn testbedYes ‘Best Image’ AnnotatorIn testbedNo Contrast adjustment impacts on opinion perceived by the users. I mpact of Contrast adjustment on opinion perceived by the users

SYMPOSIUM STRUCTURE & OVERVIEW Structured into 3 parts Part I: Discovering Facts and Opinions and their Evolution over Time Part II: Discovering Dias and Diversity in Multimedia Content Part III: A Testbed for Diversification in Search – Demonstration of selected technology – Time for further in depth discussions Bias and Diversity in IR, August 31, 20116

Part I: EXTRACTION OF OPINIONS FROM THE WEB Introduces basic concepts in opinion extraction Overview of standard coarse- and fine-grained opinion extraction methods Introduction to domain adaptation Overview of opinion resources Shows some recent research results in the area 7Bias and Diversity in IR, August 31, 2011

PART I: TEMPORAL FACT EXTRACTION, DISAMBIGUATION, AND EVOLUTION Factual knowledge evolves over time Bias and Diversity in IR, August 31, Bodybuilder Politician Actor Goal: Exploring the temporality of entities  Extract entities from Web documents  Disambiguate them to canonical entities  Aggregate facts to investigate their dynamics 8

Why look at images? Image representation and understanding Information extraction techniques PART II: A BRIEF INTRODUCTION TO EXTRACTING INFORMATION FROM IMAGES INFORMATION EXTRACTION FROM MULTIMEDIA CONTENT ON THE SOCIAL WEB How to exploit combined information from visual data and meta data Photos on the social web: attractiveness, maps and sentiment Videos on the social web: tagging 9Bias and Diversity in IR, August 31, 2011

FROM THEORY TO PRACTICE Part III: A BRIEF INTRODUCTION TO THE DIVERSITY ENGINE TESTBED Learn how to run these tools and visualize the results Add your own tools Build diversity-aware applications 10Bias and Diversity in IR, August 31, 2011