Big Data Using Big Data for Cultures and Communities Jeremy Reffin Simon Wibberley CASM, University of Sussex Carl Miller CASM, Demos July 2014.

Slides:



Advertisements
Similar presentations
Data Mining and the Web Susan Dumais Microsoft Research KDD97 Panel - Aug 17, 1997.
Advertisements

AS ICT Finding your way round MS-Access The Home Ribbon This ribbon is automatically displayed when MS-Access is started and when existing tables.
Visualise | communicate | ENGAGE Instant Atlas™ is a registered trademark of GeoWise Limited ©Copyright 2008 | Geowise Limited IA Desktop to LIS Solution.
©2011 1www.id-book.com Data analysis, interpretation and presentation Chapter 8.
User Interface Design Yonsei University 2 nd Semester, 2013 Sanghyun Park.
Objective Understand web-based digital media production methods, software, and hardware. Course Weight : 10%
1 Knowledge Management Session 4. 2 Objectives 1.What is knowledge management? Why do businesses today need knowledge management programs and systems.
Creativity Design and Cognition Gopal Kaushik – Rohit Sureka.
CS 5764 Information Visualization Dr. Chris North.
Copyright 2003 The McGraw-Hill Companies, Inc CHAPTER Application Software computing ESSENTIALS    
ETT 429 Spring 2007 Inspiration. Left Brain vs. Right Brain  Left Brain Logical Logical Sequential Sequential Rational Rational Analytical Analytical.
Version 4 for Windows NEX T. Welcome to SphinxSurvey Version 4,4, the integrated solution for all your survey needs... Question list Questionnaire Design.
Chapter 2: Business Intelligence Capabilities
Database Design IST 7-10 Presented by Miss Egan and Miss Richards.
State of Connecticut Core-CT Project Query 4 hrs Updated 1/21/2011.
XP Information Information is everywhere in an organization Employees must be able to obtain and analyze the many different levels, formats, and granularities.
Databases & Data Warehouses Chapter 3 Database Processing.
UKOLN is supported by: Delivering (e-)services to the community : who, what, where, when, how? Dr Liz Lyon, UKOLN, University of Bath, UK SWMLAC ICT Masterclass.
Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization.
Free e-Sources for English Language Teachers by Wallace Barboza Carolina TESOL December 6th, 2008 Charleston, SC.
The SEASR project and its Meandre infrastructure are sponsored by The Andrew W. Mellon Foundation SEASR Overview Loretta Auvil and Bernie Acs National.
System Analysis & Design Chapter VII: User Interface Design Providing interactive and easy to use interfaces is an important task of system designer using.
1.Knowledge management 2.Online analytical processing 3. 4.Supply chain management 5.Data mining Which of the following is not a major application.
Introduction to Information Retrieval CS 5604: Information Storage and Retrieval ProjCINETViz by Maksudul Alam, S M Arifuzzaman, and Md Hasanuzzaman Bhuiyan.
Ihr Logo Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization Turban, Aronson, and Liang.
Enterprise & Intranet Search How Enterprise is different from Web search What to think about when evaluating Enterprise Search How Intranet use is different.
Data analysis, interpretation and presentation
DECISION SUPPORT SYSTEM ARCHITECTURE: The data management component.
© 2010 Pearson Addison-Wesley. All rights reserved. Addison Wesley is an imprint of Designing the User Interface: Strategies for Effective Human-Computer.
LAB CVP 2009 ‘Leveraging the LIMS Investment’. Invested in a Laboratory Information Management System (LIMS) Solution is limited to Storing and Reporting.
FAMILY AND CHILDREN’S TRUST FUND (FACT) RESEARCH AND DATA MATERIALS.
© 2005 Prentice Hall, Decision Support Systems and Intelligent Systems, 7th Edition, Turban, Aronson, and Liang 5-1 Chapter 5 Business Intelligence: Data.
MIS – 3030 Business Technologies Social Media & Conversation Big Data.
Tutorial session 2 Network annotation Exploring PPI networks using Cytoscape EMBO Practical Course Session 8 Nadezhda Doncheva and Piet Molenaar.
Information Systems & Enhancing Decision Making for the Digital Firm
2007. Software Engineering Laboratory, School of Computer Science S E Web-Harvest Web-Harvest: Open Source Web Data Extraction tool 이재정 Software Engineering.
2006 Census of Population and Dwellings Proposed Products and Services.
ITGS Databases.
Analyzing Data with Advanced Visualizations
Chapter 5: Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization DECISION SUPPORT SYSTEMS AND BUSINESS.
By N.Gopinath AP/CSE. There are 5 categories of Decision support tools, They are; 1. Reporting 2. Managed Query 3. Executive Information Systems 4. OLAP.
Creating A Worksheet and Embedded Chart Chapter 1.
Nature Reviews/2012. Next-Generation Sequencing (NGS): Data Generation NGS will generate more broadly applicable data for various novel functional assays.
C OMPUTING E SSENTIALS Timothy J. O’Leary Linda I. O’Leary Presentations by: Fred Bounds.
Sketches and prototypes for the Orlando Six Degrees of Separation Project.
Our simulation is based on Chris Starnes. original work by Reynolds [8] on the simulation of flocks of birds (or ‘Boids‘) in a manner not subject to the.
Distributed Data Analysis & Dissemination System (D-DADS ) Special Interest Group on Data Integration June 2000.
Foundations of Information Systems in Business. System ® System  A system is an interrelated set of business procedures used within one business unit.
Knowledge Management in Theory and Practice
DATA OUTPUT  maps  tables. DATA OUTPUT output from GIS does not have to be a map many GIS are designed with poor map output capabilities types of output:
McGraw-Hill/Irwin © 2008 The McGraw-Hill Companies, All Rights Reserved Chapter 7 Storing Organizational Information - Databases.
Data Visualization Data visualization is the presentation of data in a pictorial or graphical format. For centuries, people have depended on visual representations.
WORDLE AND NATIONAL PARK PRESENTATION BY:. WORDLE Wordle is a toy for generating ‘ word clouds ’ from text that you provide. The clouds give greater prominence.
introductionwhyexamples What is a Web site? A web site is: a presentation tool; a way to communicate; a learning tool; a teaching tool; a marketing important.
Twitter Community Discovery & Analysis Using Topologies Andrew McClain Karen Aguar.
UNEP Live. What is UNEP Live? - An on-line knowledge management platform - Focuses on open access to global, regional and national data and knowledge.
1 INTRODUCTION TO COMPUTER GRAPHICS. Computer Graphics The computer is an information processing machine. It is a tool for storing, manipulating and correlating.
Data mining in web applications
Unit 2: Lesson 11 & 12 Making Data Visualizations
Data Visualizer.
Quantitative and qualitative
Inquiry, Pedagogy, & Technology: Automated Textual Analysis of 30 Refereed Journal Articles David A. Thomas Mathematics Center, University of Great Falls,
Data analysis, interpretation and presentation
Data analysis, interpretation and presentation
Text Mining with JMP Pro 13: A Case Study
(VIP-EDC) Point 6 of the agenda
Objective Understand web-based digital media production methods, software, and hardware. Course Weight : 10%
Application Software EIT, © Author Gay Robertson, 2016.
Data analysis, interpretation and presentation
CHAPTER 7: Information Visualization
Presentation transcript:

Big Data Using Big Data for Cultures and Communities Jeremy Reffin Simon Wibberley CASM, University of Sussex Carl Miller CASM, Demos July 2014

Overview Emergence of “Big Data” “Open Data” initiatives Open source tools and processor power Analytics culture in decision making

Overview Emergence of “Big Data” Environment becoming saturated with digital devices that record data Social lives becoming increasingly digital (e.g. 500 million tweets / day) Access to big data now widespread through internet

Overview “Open Data” initiatives Key indices / measures have long been collected but sequestered Access to key data now widespread through internet Culture of “what gets measured gets managed” led to collection of more indices / measures Political initiatives to make data more widely available (“open”) as a market tool

Overview Open source tools and processor power Sophisticated data processing and visualisation tools now in the hands of most users

Overview Analytics culture in decision making 1980s: rise of data-driven approach to decision-making in business management Trend increasingly influenced formation of public policy Now potentially useful data accessible to community-level initiatives

Overview Emergence of “Big Data” “Open Data” initiatives Open source tools and processor power Analytics culture in decision making SOURCES TOOLS

Sources Social MediaCommunity Information Local Government Central Government Corporate

Sources Social MediaCommunity Information Local Government Central Government Corporate Availability Subject Matter UbiquitousIsolated SocialFactual Focus GeneralSpecific

Sources Social MediaCommunity Information Local Government Central Government Corporate Availability Subject Matter Focus Degree of Structure Degree of Oversight UbiquitousIsolated SocialFactual GeneralSpecific UnstructuredStructured Not curatedOften curated

Sources Social MediaCommunity Information Local Government Central Government Corporate Availability Subject Matter Focus Degree of Structure Degree of Oversight High Volume Open Access UbiquitousIsolated SocialFactual GeneralSpecific UnstructuredStructured Not curatedOften curated DependsNoSometimes NoSometimes Rarely

Sources Social MediaCommunity Information Local Government Central Government Corporate Availability Subject Matter Focus Degree of Structure Degree of Oversight UbiquitousIsolated SocialFactual GeneralSpecific UnstructuredStructured Not curatedOften curated DependsNoSometimes NoSometimes Rarely High Volume Open Access

Tools Extract & process Analyse Visualise

Sources x Tools Social MediaCommunity Information Local Government Central Government Corporate Extract & process Analyse Visualise

Sources x Tools Social MediaCommunity Information Local Government Central Government Corporate Extract & process Analyse Visualise

Sources x Tools Social MediaCommunity Information Local Government Central Government Corporate Extract & process Analyse Visualise APIs / Downloadable data sets Analytics tools (Excel, Google Analytics)

Sources x Tools Social MediaCommunity Information Local Government Central Government Corporate Extract & process Analyse Visualise

Wordle Wordle is a tool for generating “word clouds” from text that you provide. Word clouds give greater prominence to words that appear more frequently in the source text The word clouds can be customised with different fonts, layouts, and color schemes See: wordle.net

Text is Beautiful This resource creates interactive word clouds, concept webs, and correlation wheels -In a concept web, the position of concepts matter; those more related appear near each other. Related concepts are grouped into themes, denoted by colour -A correlation wheel visualises concepts that are correlated with each other. Two concepts are correlated if they appear together in the text often and appear apart rarely See: textisbeautiful.net

Circos Circos extends the “correlation wheel” concept into much more sophisticated data visualization using a cicular layout. Originally designed for genomic data, it produces attractive graphics that are particularly effective for displaying pairwise interactions in general and ”flows”in particular See: circos.ca

Circos (continued) Circos is often used to convey complex data in a condensed format Circos is relatively simple to use but is not ‘point and click’ It requires comfort / familiarity with ‘scripting’ approaches See: circos.ca

Sources x Tools Social MediaCommunity Information Local Government Central Government Corporate Extract & process Analyse Visualise

gephi Gephi visualises graphs. A graph is a representation of a set of objects where there are pairwise relationships between some of these objects. Any network can be usefully represented as a graph – as can most data with structural relationships between elements Gephi sits on the interface between data visualisation and data analysis. It has sophisticated tools for exploring data and analysing, filtering, clustering, manipulating and exporting it See: gephi.github.io

gephi: example output

tableau Tableau provides sophisticated tools for analysis and visualisation of data stored in a spreadsheet or similar data format. It can create interactive See: tableausoftware.com

Sources x Tools Social MediaCommunity Information Local Government Central Government Corporate Extract & process Analyse Visualise

Integrated analysis: DataShine Provides interactive mapping visualisation of UK Census data Source: datashine.org.uk

Sources x Tools Social MediaCommunity Information Local Government Central Government Corporate Extract & process Analyse Visualise

Social MediaCommunity Information Local Government Central Government Corporate Extract & process Analyse Visualise Unstructured Not curated High Volume Open Access General Method51

Twitter Boolean term scraper Relevancy Classifier Pattern Classifier (1) Pattern Classifier (2) Output 1 Output 2 Extract & process Analyse Visualise Method51 is a framework for collecting, analysing, and understanding Twitter data sets It helps users to locate tweets relevant to a precise topic of interest (i.e. separate the ‘wheat from the chaff’) and to gain the best possible insight into what is being said about that topic This is achieved using chains of classifiers – devices for placing tweets into different categories based on the words that they contain