Presented by: Dardan Xhymshiti Fall 2015.  Authors: Eli Cortez, Philip A.Bernstein, Yeye He, Lev Novik (Microsoft Corporation)  Conference: VLDB  Type:

Slides:



Advertisements
Similar presentations
PRIMIS Third National Conference Tuesday 1 April 2003 Birmingham Rush for Practices A new GP Data Analysis Tool from PRIMIS Dr John Williams Dr John Robinson.
Advertisements

Discovering Queries based on Example Tuples
1.Accuracy of Agree/Disagree relation classification. 2.Accuracy of user opinion prediction. 1.Task extraction performance on Bing web search log with.
© 2009 Oracle Corporation – Proprietary and Confidential 1.
Web Mining Research: A Survey Authors: Raymond Kosala & Hendrik Blockeel Presenter: Ryan Patterson April 23rd 2014 CS332 Data Mining pg 01.
SEVENPRO – STREP KEG seminar, Prague, 8/November/2007 © SEVENPRO Consortium SEVENPRO – Semantic Virtual Engineering Environment for Product.
Information Retrieval Concerned with the: Representation of Storage of Organization of, and Access to Information items.
Shared Ontology for Knowledge Management Atanas Kiryakov, Borislav Popov, Ilian Kitchukov, and Krasimir Angelov Meher Shaikh.
Automatic Data Ramon Lawrence University of Manitoba
Important Task in Patents Retrieval Recall is an Important Factor Given Query Patent -> the Task is to Search all Related Patents Patents have Complex.
©2012 Microsoft Corporation. All rights reserved..
Alonso Robles Solutions Architect speakTECH
Databases & Data Warehouses Chapter 3 Database Processing.
Case Study Complete and integrated BI and Performance Management offering Agile products that adapt to how your need the.
Ihr Logo Data Explorer - A data profiling tool. Your Logo Agenda  Introduction  Existing System  Limitations of Existing System  Proposed Solution.
Databases and LINQ Visual Basic 2010 How to Program 1.
Search Engines and Information Retrieval Chapter 1.
1 Distributed Agents for User-Friendly Access of Digital Libraries DAFFODIL Effective Support for Using Digital Libraries Norbert Fuhr University of Duisburg-Essen,
Automated Creation of a Forms- based Database Query Interface Magesh Jayapandian H.V. Jagadish Univ. of Michigan VLDB
PAUL ALEXANDRU CHIRITA STEFANIA COSTACHE SIEGFRIED HANDSCHUH WOLFGANG NEJDL 1* L3S RESEARCH CENTER 2* NATIONAL UNIVERSITY OF IRELAND PROCEEDINGS OF THE.
Michael Cafarella Alon HalevyNodira Khoussainova University of Washington Google, incUniversity of Washington Data Integration for Relational Web.
Using SAS® Information Map Studio
Join Synopses for Approximate Query Answering Swarup Achrya Philip B. Gibbons Viswanath Poosala Sridhar Ramaswamy Presented by Bhushan Pachpande.
Text Mining In InQuery Vasant Kumar, Peter Richards August 25th, 1999.
Minor Thesis A scalable schema matching framework for relational databases Student: Ahmed Saimon Adam ID: Award: MSc (Computer & Information.
© 2015 Ascendum Solutions. All rights reserved. Welcome To Create Dazzling End-user applications using SharePoint Search Speaker: Bill Crider #sharepointcincy2015.
Instructor: Dema Alorini Database Fundamentals IS 422 Section: 7|1.
Microsoft Access Database Software.
Using Special Features to Solve Projects Robert Posener.
Binxing Jiao et. al (SIGIR ’10) Presenter : Lin, Yi-Jhen Advisor: Dr. Koh. Jia-ling Date: 2011/4/25 VISUAL SUMMARIZATION OF WEB PAGES.
Question Answering over Implicitly Structured Web Content
1 Why Learn About Database Systems? Database systems process and organize large amounts of data Examples –Marketing manager can access customer data –Corporate.
Database Systems Basic Data Management Concepts
Facilitating Document Annotation using Content and Querying Value.
1 Technology in Action Chapter 11 Behind the Scenes: Databases and Information Systems Copyright © 2010 Pearson Education, Inc. Publishing as Prentice.
WEB MINING. In recent years the growth of the World Wide Web exceeded all expectations. Today there are several billions of HTML documents, pictures and.
OFC291 Microsoft® Office Word XML (part 1 of 3): Introduction Martin Sawicki Lead Program Manager.
Information Integration BIRN supports integration across complex data sources – Can process wide variety of structured & semi-structured sources (DBMS,
1 CS 430: Information Discovery Sample Midterm Examination Notes on the Solutions.
Finding Experts Using Social Network Analysis 2007 IEEE/WIC/ACM International Conference on Web Intelligence Yupeng Fu, Rongjing Xiang, Yong Wang, Min.
3/18: Microsoft Access Refresher: What is a relational database? Why use a database? Sample database in MS access. –Fields, records, attributes. –Tables,
Presented by: Ashkan Malekloo Fall  Type: Demonstration paper  Authors:  VLDB 15 Daniel Haas, Sanjay Krishnan, Jiannan Wang, Michael J. Franklin,
SimDB Implementation & Browser IVOA InterOp 2008 Meeting, Theory Session 1. Baltimore, 26/10/2008 Laurent Bourgès This work makes use of EURO-VO software,
EMEA Beat Schwegler Architect Microsoft EMEA HQ Ingo Rammer Principal Consultant thinktecture
How Can I Use This Method? 2015 IEEE/ACM 37TH IEEE INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING HOW.
Declarative Languages and Model Based Development of Web Applications Besnik Selimi South East European University DAAD: 15 th Workshop “Software Engineering.
Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:
Presented by: Dardan Xhymshiti Fall  Paper type: Demo  Authors: Ashoke S., Jayant R. Harits (Indian Institute of Science, Bangalore)  Conference:
Jennifer Widom Relational Databases The Relational Model.
Presented by: Dardan Xhymshiti Fall  Authors: Dinesh Das, Jiaqi Yan, Mohamed Zait, Satyanarayana R Valluri, Nirav Vyas, Ramarajan Krishnamachari,
EMEA Beat Schwegler Architect Microsoft EMEA HQ Ingo Rammer Principal Consultant thinktecture
Title of Your Project Team Member 1 Team Member 2.
Facilitating Document Annotation Using Content and Querying Value.
1 DAFFODIL Effective Support for Using Digital Libraries Norbert Fuhr University of Duisburg-Essen, Germany.
1 Management Information Systems M Agung Ali Fikri, SE. MM.
©NIIT BCP and DTS Implementing Stored Procedures Lesson 2A / Slide 1 of 23 Objectives In this lesson, you will learn to: Perform bulk copy using the BCP.
From Frequency to Meaning: Vector Space Models of Semantics
Metadata.
Datamining : Refers to extracting or mining knowledge from large amounts of data Applications : Market Analysis Fraud Detection Customer Retention Production.
Lisa Ruff Business Productivity/Accessibility TS Microsoft Federal
Relational Databases The Relational Model.
Relational Databases The Relational Model.
DocuMelt 2.0 An EmpFinesseTM Fundamentals Solution.
Declarative Creation of Enterprise Applications
KDD Reviews 周天烁 2018年5月9日.
Data Integration for Relational Web
Module P4 Identify Data Products and Views So Their Requirements and Attributes Can Be Controlled Learning Objectives: Understand the value of data. Understand.
Spreadsheets, Modelling & Databases
Microsoft Access Date.
Contexualized Data In Document Authoring
Presentation transcript:

Presented by: Dardan Xhymshiti Fall 2015

 Authors: Eli Cortez, Philip A.Bernstein, Yeye He, Lev Novik (Microsoft Corporation)  Conference: VLDB  Type: Demonstration

 Data discovery of relevant information in relational databases.  Problem of generating reports.  To find relevant information, users have to find the database tables that are relevant to the task, for each of them understand its content to determine whether it is truly relevant etc.  The schema’s table and column names are often not very descriptive of the content.  Example: In 412 data columns in 639 tables from 29 databases used by Microsoft’s IT organization, 28% of all columns were very generic as: name, it, description, field, code

 A typical corporate database table with generic column names

 Such non-descriptive column names make it difficult to search and understand the table.  One solution: using data stewards to enrich the database tables and columns with textual description.  Time consuming  Ignoring databases that are less frequent.

 Barcelos: automatically annotate columns of database tables.  Annotate even those tables that are not frequent.  How it works?  It works by mining spreadsheets.  Many of these spreadsheets are generated by queries.  It uses spreadsheet’s column names as candidate annotations for the corresponding database columns.  For the table above Barcelos produces annotations:  TeamID and Team for the first column,  Delivery Team and Team for the second column.  Line of Business and Business for the third.

 The authors have provided a method to for extracting relevant tables from an enterprise database.  A method for identifying and ranking relevant column annotations.  An implementation of Barcelos and an experimental evaluation that shows its efficiency and effectiveness.