Pixel Visualization of keyword search results in large email databases. Jay Koven Fall 2013.

Slides:



Advertisements
Similar presentations
Using CLUUZ. © 2008 Sprylogics International Corp. Enter your search term/terms. By default, CLUUZ will extract and display people, companies, phone numbers,
Advertisements

Seeing and Organizing Identity Online thoughts on digital context, perception of self and identity management.
XML DOCUMENTS AND DATABASES
Accessing and Using the e-Book Collection from EBSCOhost ® When an arrow appears, click to proceed to the next slide at your own pace. To go back, click.
Technical BI Project Lifecycle
Image Information Retrieval Shaw-Ming Yang IST 497E 12/05/02.
Universal Search and Social Networking Exploiting the features of each to enhance the other and the tools that make it possible Peter Wallqvist Ravn Systems.
INFORMATION MURAL A technique for displaying and navigating large information spaces Dean F. Jerding and John T. Stasko Graphics, Visualization, and Usability.
Learning similarity measure for natural image retrieval with relevance feedback Reporter: Francis 2005/3/3.
Small Displays Nicole Arksey Information Visualization December 5, 2005 My new kitty, Erwin.
Content Based Image Clustering and Image Retrieval Using Multiple Instance Learning Using Multiple Instance Learning Xin Chen Advisor: Chengcui Zhang Department.
Information Retrieval in Practice
ClearEye: An Visualization System for Document Revision CPSC 533C Project Update Qiang Kong Qixing Zheng.
1 © 2005 Nokia V1-MSI-SL.ppt / / SL Finding Communication Hot Spots of Location Based Postings Mobile Spatial Interaction Workshop ,
© Anselm Spoerri Lecture 13 Housekeeping –Term Projects Evaluations –Morse, E., Lewis, M., and Olsen, K. (2002) Testing Visual Information Retrieval Methodologies.
Table Lens From papers 1 and 2 By Tichomir Tenev, Ramana Rao, and Stuart K. Card.
Systems Analysis & Design Sixth Edition Systems Analysis & Design Sixth Edition Toolkit Part 5.
Methodology Conceptual Database Design
Overview of Search Engines
CORE 2: Information systems and Databases STORAGE & RETRIEVAL 2 : SEARCHING, SELECTING & SORTING.
Some extra tips on using LexisNexis® Butterworths.
Data Mining Techniques
NWU: Helpdesk Call handling ITC Training: Session 1 -Call Logging and Remedy -Campus Helpdesks and Escalation -Remedy Solution Database -Remedy Mechanisms.
Inspiration InspireData Kidspiration Available in Tech 204 Tech 210 Success Connection.
Solutions Summit 2014 Discrepancy Processing & Resolution Terri Sullivan.
IAT Text ______________________________________________________________________________________ SCHOOL OF INTERACTIVE ARTS + TECHNOLOGY [SIAT]
Software School of Hunan University Database Systems Design Part III Section 5 Design Methodology.
An Online Knowledge Base for Sustainable Military Facilities & Infrastructure Dr. Annie R. Pearce, Branch Head Sustainable Facilities & Infrastructure.
Sharad Oberoi and Susan Finger Carnegie Mellon University DesignWebs: Towards the Creation of an Interactive Navigational Tool to assist and support Engineering.
Nobody’s Unpredictable Ipsos Portals. © 2009 Ipsos Agenda 2 Knowledge Manager Archway Summary Portal Definition & Benefits.
Support.ebsco.com Basic Searching for K-12 School Libraries Tutorial.
Software Engineering Chapter 16 User Interface Design Ku-Yaw Chang Assistant Professor Department of Computer Science and Information.
Software Life Cycle Requirements and problem analysis. –What exactly is this system supposed to do? Design –How will the system solve the problem? Coding.
Introduction of Geoprocessing Topic 7a 4/10/2007.
INTERACTIVE ANALYSIS OF COMPUTER CRIMES PRESENTED FOR CS-689 ON 10/12/2000 BY NAGAKALYANA ESKALA.
MSF Design Example Conceptual Design Logical Design Physical Design.
GISMO/GEBndPlan Overview Geographic Information System Mapping Object.
IAT Text ______________________________________________________________________________________ SCHOOL OF INTERACTIVE ARTS + TECHNOLOGY [SIAT]
Systems Analysis and Design in a Changing World, 6th Edition
VisDB: Database Exploration Using Multidimensional Visualization Maithili Narasimha 4/24/2001.
Google Image Search, Code, Fusion Tables Audrey and Chris.
Intelligent Database Systems Lab N.Y.U.S.T. I. M. Externally growing self-organizing maps and its application to database visualization and exploration.
Managing Data Resources. File Organization Terms and Concepts Bit: Smallest unit of data; binary digit (0,1) Byte: Group of bits that represents a single.
VizDB A tool to support Exploration of large databases By using Human Visual System To analyze mid-size to large data.
Daniel A. Keim, Hans-Peter Kriegel Institute for Computer Science, University of Munich 3/23/ VisDB: Database exploration using Multidimensional.
Intelligent Agent Framework1 From Chapter 7 of Constructing Intelligent Agents with Java.
Introduction of Geoprocessing Lecture 9. Geoprocessing  Geoprocessing is any GIS operation used to manipulate data. A typical geoprocessing operation.
Pumpkin Math. Mathematical Investigations Has multidimensional content Is open-ended, with several acceptable solutions Is an exploration requiring a.
© 2013 IBM Corporation IBM Tivoli Composite Application Manager for Transactions Transaction Tracking Best Practice for Workspace Navigation.
MBAT User Workflows View an Atlas Open Data Upload Data Run a Query –Search Data Further Examination Microarray Data Further Examination of 2D Data –Search.
Uncovering Clusters in Crowded Parallel Coordinates Visualizations Alimir Olivettr Artero, Maria Cristina Ferreiara de Oliveira, Haim levkowitz Information.
111 Subsystems CS 4311 Wirfs Brock et al., Designing Object-Oriented Software, Prentice Hall, (Chapter 7)
NSF DUE ; Wen M. Andrews J. Sargeant Reynolds Community College Richmond, Virginia.
Dashboarding with IBM Cognos Cognos User Group August 5th 2011.
THE UNIVERSITY OF TEXAS AT AUSTIN School of Information Marie Hwang INF 385T: Knowledge Management Systems February 18, 2003 Week 6: .
On Using SIFT Descriptors for Image Parameter Evaluation Authors: Patrick M. McInerney 1, Juan M. Banda 1, and Rafal A. Angryk 2 1 Montana State University,
Introduction of Geoprocessing Lecture 9 3/24/2008.
We looked at these two presentations and talked about the structure of setting up the table.
Microsoft Office 2008 for Mac – Illustrated Unit D: Getting Started with Safari.
Friday, 10/4 Objective: In what ways do humans adorn their bodies? How is the way we perceive a person affected by their physical appearance and adornment.
Motivation Conclusion Effective Access Over Public Conversations William Lee, Hui Fang and Yifan Li University of Illinois at Urbana-Champaign Clustering.
Topical Analysis and Visualization of (Network) Data Using Sci2 Ted Polley Research & Editorial Assistant Cyberinfrastructure for Network Science Center.
Visualization Design Principles cs5984: Information Visualization Chris North.
PBL Project Based Learning. What is PBL? PBL is a model for classrooms that emphasizes long- term, interdisciplinary and student-centered activities.
Entity Relationship Diagram
CSE5544 Final Project Interactive Visualization Tool(s) for IEEE Vis Publication Exploration and Analysis Team Name: Publication Miner Team Members:
CSE5544 Final Project Interactive Visualization Tool(s) for IEEE Vis Publication Exploration and Analysis Team Name: Publication Miner Team Members:
Personalized Social Image Recommendation
Multi-Dimensional Data Visualization
Lesson Title: Famous “Techie” Research Project Grade Level: 6-8
Presentation transcript:

Pixel Visualization of keyword search results in large databases. Jay Koven Fall 2013

Research Overview ● The problem: Both criminal and Civil investigations are being over with with information in the cyber age. ● New techniques are needed to handle the overload ● Visualization of data can provide solutions

The Investigative Problem ● Datasets are rapidly growing in size for all types of investigation ○ National Security ○ Criminal ○ Civil ● The Datasets ○ Most investigations focus on communications ○ s are the largest portion of these communication ○ Chats, IM, Phone logs and other social communication channels are also becoming important.

Related Research ● Jigsaw ○ Open Source Investigative tool kit being developed at Georgia Tech. ○ Focus on entity relationships and time relationships ○ Views are traditional

Related Research continued ● Daniel Keim ○ Pixel oriented display visualization ○ Large amounts of data can be viewed at once ○ Alternative display methodologies ○ Personal mailbox analysis

Related Research continued ●Other Visual analysis Techniques ○ Time SFU Vancouver ■ Plots relationships overtime by sender or by threads ● Run on Enron dataset ● Not sure why ○ Thread arcs - IBM ■ Traces a single thread using arcs to show trends ● Interactive, highlights individuals, can highlight attributes ● Used to analyze trends ○ Graphs and maps ■ Show relationships but not very useful for Ultra large datasets

Related Research continued ●Chris North - Use of Large Displays ○ Not specific to but useful thoughts ●W. Bradford Paley - Textarc ○ Relationships of words in a concordance ○ Images behind my proposal

My proposed research ●Pixel Visualization of Large Datasets ○ Search by Keywords ○ Multiple displays of returned sets ■ Entity - Entity ■ Entity - Keyword ■ Keyword - Time ■ Entity - Time ○ Interaction to Refine Search ■ Add / Remove Keywords ■ Add / Remove Entities ■ Limit time frame ○ Interaction to Drill Down to actual messages ■ By Subject ■ By Message Content

Key issues to be solved for investigative visualization of s ●Relative weights of s must be calculated against some standard ●Visualizations should minimize the distance of related s between points to show important clusters around entities, keywords and time.

My proposal - “Document Galaxy” ●Basic idea is to treat documents as stars in a circular galaxy ○ Place relevant data points, such as entities, around outside with associated weights. ○ Place documents inside galaxy based on relative “attraction” to outside points. ●Possible to have multiple outside rings to add additional attributes to calculations ●User interacts with outside rings to add / remove / move attraction points. ●User can explore contents of inner points and clusters to derive information about document content. ●Colors of documents can used to show additional attributes

Might look something like this

What use is this? ●Might make a good lead in tool to add to jigsaw as a lead in to reduce size of document set to be explored ●Separate tool for exploring e-discovery datasets