Usability and Integration H. V. Jagadish. Many Sources of Data Text XML/semi-structured Experimental measurements Public databases Some data may have.

Slides:



Advertisements
Similar presentations
College of Information Technology & Design
Advertisements

Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke1 Relational Algebra Chapter 4, Part A.
Prof. Carolina Ruiz Computer Science Department Bioinformatics and Computational Biology Program WPI WELCOME TO BCB4003/CS4803 BCB503/CS583 BIOLOGICAL.
Learning Goal Understand the evolution of complexity.
Management Information Systems, Sixth Edition
© 2005 Prentice Hall12-1 Stumpf and Teague Object-Oriented Systems Analysis and Design with UML.
Advanced Topics COMP163: Database Management Systems University of the Pacific December 9, 2008.
1 CIS607, Fall 2006 Semantic Information Integration Instructor: Dejing Dou Week 10 (Nov. 29)
Integration and Insight Aren’t Simple Enough Laura Haas IBM Distinguished Engineer Director, Computer Science Almaden Research Center.
Graphical User Interface (GUI) Nelson Padua-Perez Chau-Wen Tseng Department of Computer Science University of Maryland, College Park.
Copyright 2003 The McGraw-Hill Companies, Inc CHAPTER Application Software computing ESSENTIALS    
Attribute databases. GIS Definition Diagram Output Query Results.
Topics in Computational Biology (COSI 230a) Pengyu Hong 09/02/2005.
1 © Prentice Hall, 2002 Chapter 11: Data Warehousing.
Mehnaaz Asad IB2-3. Data logging is the collection of data over a period of time, and is something often used in scientific experiments. Data logging.
DASHBOARDS Dashboard provides the managers with exactly the information they need in the correct format at the correct time. BI systems are the foundation.
Introduction to Data Science Kamal Al Nasr, Matthew Hayes and Jean-Claude Pedjeu Computer Science and Mathematical Sciences College of Engineering Tennessee.
Graph Algebra with Pattern Matching and Aggregation Support 1.
Microsoft Access Ervin Ha.
Microsoft Access Database software. What is a database? … a database is an organized collection of data. A collection of data of similar information compiled.
McGraw-Hill/Irwin Copyright © 2013 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 5 Data Resource Management.
Microsoft Visual Basic 2012 CHAPTER ONE Introduction to Visual Basic 2012 Programming.
Microsoft Visual Basic 2005 CHAPTER 1 Introduction to Visual Basic 2005 Programming.
CRIUS: User-Friendly Database Design Li (Eric) Qian, Kristen LeFevre, H. V. Jagadish University of Michigan, Ann Arbor.
Chapter 4: Organizing and Manipulating the Data in Databases
Team Member: Dakuo Wang, Li Zhang, Xuejie Sun, Yang Liu NETWORK INFORMATION BASE (NIB) VISUALIZATION SYSTEM.
Chapter 11 Databases. 11 Chapter 11: Databases2 Chapter Contents  Section A: File and Database Concepts  Section B: Data Management Tools  Section.
Last Words COSC Big Data (frameworks and environments to analyze big datasets) has become a hot topic; it is a mixture of data analysis, data mining,
Universität Stuttgart Universitätsbibliothek Information Retrieval on the Grid? Results and suggestions from Project GRACE Werner Stephan Stuttgart University.
1 Mining surprising patterns using temporal description length Soumen Chakrabarti (IIT Bombay) Sunita Sarawagi (IIT Bombay) Byron Dom (IBM Almaden)
Tutor: Prof. A. Taleb-Bendiab Contact: Telephone: +44 (0) CMPDLLM002 Research Methods Lecture 8: Quantitative.
Chapter 9 Designing Databases Modern Systems Analysis and Design Sixth Edition Jeffrey A. Hoffer Joey F. George Joseph S. Valacich.
Chapter 4: Organizing and Manipulating the Data in Databases
Knowledge Representation and Indexing Using the Unified Medical Language System Kenneth Baclawski* Joseph “Jay” Cigna* Mieczyslaw M. Kokar* Peter Major.
Unit 1 – Lesson 3 (Representing Data)
Professor Michael J. Losacco CIS 1110 – Using Computers Database Management Chapter 9.
Visualization and analysis of microarray and gene ontology data with treemaps Eric H Baehrecke, Niem Dang, Ketan Babaria and Ben Shneiderman Presenter:
Domain and Persistence Patterns. Fundamental Pattern Types Design Patterns Business Logic Patterns.
MANAGING DATA RESOURCES ~ pertemuan 7 ~ Oleh: Ir. Abdul Hayat, MTI.
Making Database Systems Usable H.V. Jagadish Adriane Chapman Aaron Elkiss Magesh Jayapandian Yanyao Li Arnab Nandi Cong Yu By Shahana Shamim.
1:2 The Scientific Method. Step 1: State the Problem  Based on observations (gathering information using your senses)  Written in the form of a question:
Last Words DM 1. Mining Data Steams / Incremental Data Mining / Mining sensor data (e.g. modify a decision tree assuming that new examples arrive continuously,
1 Relational Algebra Chapter 4, Sections 4.1 – 4.2.
Database Management Systems (DBMS)
Project Estimation techniques Estimation of various project parameters is a basic project planning activity. The important project parameters that are.
CIS/SUSL1 Fundamentals of DBMS S.V. Priyan Head/Department of Computing & Information Systems.
Steven Seida How Does an RDF Knowledge Store Compare to an RDBMS?
11 Introduction to Neo4j. 2 We all have our own graphs...
Prepared by : Mazhar javed Classification of Software Prepared By Prepared By Mazhar Javed Awan Mazhar Javed Awan (Computer Instructor) (Computer Instructor)
 Students have the opportunity to gain information through different forms of representation.
Author: Akiyoshi Matonoy, Toshiyuki Amagasay, Masatoshi Yoshikawaz, Shunsuke Uemuray.
Introduction to OLAP and Data Warehouse Assoc. Professor Bela Stantic September 2014 Database Systems.
1 LM 7 Data Warehouse Dr. Lei Li. Learning Objectives Describe the needs for data warehouse Describe the three levels of a data warehouse Explain the.
Introduction: Databases and Database Systems Lecture # 1 June 19,2012 National University of Computer and Emerging Sciences.
1 Data Warehousing Data Warehousing. 2 Objectives Definition of terms Definition of terms Reasons for information gap between information needs and availability.
Management Information Systems by Prof. Park Kyung-Hye Chapter 7 (8th Week) Databases and Data Warehouses 07.
Data mining in web applications
NoSQL: Graph Databases
CS4222 Principles of Database System
Decision Support Systems
Introduction to Visual Basic 2008 Programming
Tomáš Jurníček, Jakub Jůza, Lenka Kmeťová
Datamining : Refers to extracting or mining knowledge from large amounts of data Applications : Market Analysis Fraud Detection Customer Retention Production.
Data Warehouse.
Relational Algebra 461 The slides for this text are organized into chapters. This lecture covers relational algebra, from Chapter 4. The relational calculus.
机器感知与智能教育部重点实验室学术报告 Key Laboratory of Machine Perception (Minister of Education) Peking University Scalable, Robust and Integrative Algorithms for Analyzing.
CIS16 Application Development – Programming with Visual Basic
Relational Algebra Chapter 4, Sections 4.1 – 4.2
Exploratory search: New name for an old hat?
Fading Schemas… Alon Y. Halevy.
Presentation transcript:

Usability and Integration H. V. Jagadish

Many Sources of Data Text XML/semi-structured Experimental measurements Public databases Some data may have time/space variation Need to make sense of this big mess

Find Patterns in Data Conventional data mining seeks patterns that can be mathematically specified over (usually) global extents. Typically assume simple data structure. Need new approaches to find patterns in messy data.

Human in the Loop Hard for a machine to tell an interesting pattern apart from one that is not. Problem exacerbated when we seek smaller/localized patterns, or work with large vocabularies of possible patterns. Need human in the loop to make this judgment.

Computer-Assisted (Human) Analytics Patterns found by human and not by computer. Job of computer is to make patterns easy to find. So computer system must effectively support queries and display results. Eg.Visual Analytics

Organize Data for Analysis Join multiple complex temporal data streams into a “windowed” model suitable for efficient analysis. [Manish Singh] Permit organic change to schema as information needs evolve. [Eric Qian] Provide a spreadsheet interface for direct manipulation of complex and large data. Choose small sets of representatives effectively. [Ben Liu]

Access Data for Analysis Under-specified queries, particularly keyword queries. Derive “qunit” as response unit, mined from observed query logs. [Arnab Nandi] Visual manipulation algebra for analyzing large time-varying graphs with data on nodes and edges. [Anna Shaverdian]

Scientific Data Analysis Explain analysis results in terms of source data, even when the source may have been updated since. [Jing Zhang] Analyze gene expression microarray data, and electronic health record data, in light of known biomedical knowledge. [Fernando Farfan]