United Nations Economic Commission for Europe Statistical Division NTTS 2015 – Satellite Workshop on Big Data March 9, 2015 Computing Energy Consumption.

Slides:



Advertisements
Similar presentations
The UK Census: future directions Peter J Fullerton Administrative Sources and Integration Division.
Advertisements

Attitude survey Maria Rugina ICEMENERG-ROMANIA. Good practice An initiative which has already proved successful and which has the potential to be transferred.
Introduction Build and impact metric data provided by the SGIG recipients convey the type and extent of technology deployment, as well as its effect on.
TRUST Fall Meeting November , 2010 │Stanford, California A Privacy-Aware Architecture For Demand Response Systems Steve Wicker, Bob Thomas School.
Daisuke Mashima and Arnab Roy Fujitsu Laboratories of America, Inc. Privacy Preserving Disclosure of Authenticated Energy Usage Data.
| March 26, 2013 National Summit on Integrating Energy Efficiency & Smart Grid HEMS and Smart Meter Integration.
Implementing the SET-plan proposed Energy Efficiency Directive The proposed Directive establishes a common framework for promoting energy efficiency in.
Smart Grid, Data and Behaviour – Privacy and Security Issues - Potential for Secure Computation Lexpert Seminar December 9, 2013David Young, Partner.
A Practical Smart Metering System Supporting Privacy Preserving Billing and Load Monitoring Hsiao-Ying Lin National Chiao Tung University Joint work with.
System Design and Analysis
SINTEF Energy Research 1 Remodece meeting January 2007 Nicolai Feilberg.
United Nations Economic Commission for Europe Statistical Division NTTS 2015 – Satellite Workshop on Big Data March 9, 2015 The Big Data Project – The.
ONS Big Data Project. Plan for today Introduce the ONS Big Data Project Provide a overview of our work to date Provide information about our future plans.
Meeting of the Management of Statistical Information System (MSIS 2014) Innovation – Open Data Initiative of Government of India – Fostering Innovations,
Smart grids and intelligent homes: eBay electricity. Stuart Williams. Global Lead Consultant – Sustainability Logica.
Patricia de Suzzoni, Chair of ERGEG Customer Working Group Citizens’ Energy Forum, London, September 2009 Regulatory aspects of smart metering in.
Balance of Payments Collection and Compilation 23 Feb 2012 Central Statistics Office Ireland.
ESTAT International Seminar on Modernizing Official Statistics: Meeting Productivity and New Data Challenges Tianjin, People’s Republic of China
CAMP Med Mapping HIPAA to the Middleware Layer Sandra Senti Biological Sciences Division University of Chicago C opyright Sandra Senti,
FUTURE IN ENERGY. The biggest co-generation high efficiency power plant in Romania, built in Suceava Investment value: over EUR 90 mil Used fuels: natural.
Introduction to Systems Analysis and Design Trisha Cummings.
CPSC203 Introduction to Computers Lab 69 By Jie Gao.
Prepared By :.  Introduction  Techniques Used  Case Study  Advantages  Application  Conclusion OUTLINE.
Energy Management System Industrial Controls & Drives (India) Private Limited Chennai
Big Data Quality, Partnerships and Privacy Teams.
SMART METER TEXAS Status Update June 3, AGENDA Release 1 Smart Meter Texas Online Portal Update – SMT Solution Update – Registration Statistics.
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Collaborating with iPlant.
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Collaborating with iPlant.
OAP Assessment Thursday, March 6, 2008 Team: Giant Slayers Team Members: Drew Bennett, Christopher Hain, Nick Liao, Jeffrey Spehar, Eric Wan.
Privacy Engineering for Digital Rights Management Systems By XiaoYu Chen.
Monitoring the acquisition process by web widgets Leonardo Tininini and Antonino Virgillito ISTAT Meeting on the Management of Statistical Information.
Brussels Workshop Use case 3 11/09/2015 Mario Sisinni.
United Nations Economic Commission for Europe Statistical Division UNECE Workshop on Consumer Price Indices Istanbul, Turkey,10-13 October 2011 Session.
United Nations Economic Commission for Europe Statistical Division The Importance of Databases in the Dissemination Process Steven Vale, UNECE.
Joint UNECE / Eurostat meeting on Population and Housing Censuses 7-9 July 2010, Geneva Disseminating Census information to maximise use and value Keith.
1 Demand Response: An international perspective March 3, 2008 California Energy Commission Richard Schomberg VP research EDF North America Gridwise Arch.
United Nations Economic Commission for Europe Statistical Division High-Level Group Achievements and Plans Steven Vale UNECE
Tier 2 Power Supply Planning Workshop Advanced AMI Benefits Overview.
Innovation Work Circle: Big Data Presented By: Innovation Work Circle Group.
Unit 5 Advanced Databases The Purpose and features of a relational database.
IT Architectures for Handling Big Data in Official Statistics: the Case of Scanner Data in Istat Gianluca D’Amato, Annunziata Fiore, Domenico Infante,
United Nations Economic Commission for Europe Statistical Division UNECE Big Data Work Steven Vale UNECE
Prepared by Andrew Robinson, Assistant Statistician Presented by Melinda Williams, Director, at 8 th Regional Statistical Research Seminar October 30,
Creating Open Data whilst maintaining confidentiality Philip Lowthian, Caroline Tudor Office for National Statistics 1.
Firmware - 1 CMS Upgrade Workshop October SLHC CMS Firmware SLHC CMS Firmware Organization, Validation, and Commissioning M. Schulte, University.
Workshop on Big Data. Agenda  Introduction  Results of 2014 Big Data Project  Plans of international organisations  Parallel discussions (Soapbox!)
United Nations Economic Commission for Europe Statistical Division Big Data Sandbox Antonino Virgillito Project manager “Big Data Project” UNECE Head of.
Modelling sample data from smart-type meter electricity usage Susan Williams NTTS Conference, March 2015, Brussels.
Building a contactless university examination system using NFC Speaker : Chih-Ching Chen Advisor : Dr. Ho-Ting Wu 2013/12/2 1.
United Nations Economic Commission for Europe Statistical Division International Collaboration to Modernise Official Statistics Steven Vale UNECE
Modernising Statistical Production: Modernising Statistical Production: Main recommendations from global assessments 7 th SPECA PWG on Statistics
MyHealth Journal: a User-Customizable Diary Software for Health Soufiane Berouel, Undergraduate Student Supervised by Prof. Lily Liang Department of Computer.
Globus Data Storage Interface (DSI) - Enabling Easy Access to Grid Datasets Raj Kettimuthu, ANL and U. Chicago DIALOGUE Workshop August 2, 2005.
Workshop on Big Data – 9 March 2015 Parallel discussions – Technology Focus.
Smart Grid Big Data: Automating Analysis of Distribution Systems Steve Pascoe Manager Business Development E&O - NISC.
Data Management: Data Processing Types of Data Processing at USGS There are several ways to classify Data Processing activities at USGS, and here are some.
United Nations Economic Commission for Europe Statistical Division Standards-based Modernisation Steven Vale UNECE
United Nations Economic Commission for Europe Statistical Division Achievements and Plans of the High-Level Group for the Modernisation of Official Statistics.
Expanding the Role of Synthetic Data at the U.S. Census Bureau 59 th ISI World Statistics Congress August 28 th, 2013 By Ron S. Jarmin U.S. Census Bureau.
Web Scraping for Collecting Price Data: Are We Doing It Right?
Elhub - Electricity data hub
Presentation to the Energy Data Summit Part 1
current PicUp capabilities and expected performance from SPIS
The Sandbox Task Team Antonino Virgillito Project Consultant, UNECE.
SpatialHadoop: A MapReduce Framework for Spatial Data
Scalable Policy-awarE Linked Data arChitecture for prIvacy, trAnsparency and compLiance H2020-ICT Big Data PPP: privacy-preserving Big Data technologies.
Use Cases CS/SWE 421 Introduction to Software Engineering Dan Fleck
File Management.
Smart Meter Data Privacy: A Survey
ELEC-E Smart Grid Smart Meters and Security Issues
Presentation transcript:

United Nations Economic Commission for Europe Statistical Division NTTS 2015 – Satellite Workshop on Big Data March 9, 2015 Computing Energy Consumption from Smart Meters Data Antonino Virgillito Project Consultant, UNECE Istat

The Role of Big Data in the Modernisation of Statistical Production and Services NTTS 2015 March 9, 2015 Introduction Smart meters are electronic meters that enable automated collection of electricity consumption data of households and small businesses Smart meter data is a good example of Big Data – High volume and high velocity – Low variety: highly structured Good potential for statistics – Instead of surveying individual utilities or households, we could collect the data directly from the smart meter entity, which would greatly reduce response burden. 2

The Role of Big Data in the Modernisation of Statistical Production and Services NTTS 2015 March 9, 2015 Objectives Showing the feasibility of computing statistics on energy consumptions starting from smart meters data Testing how to handle privacy-sensitive data in a shared environment Testing how to aggregate data using Big Data tools available in the Sandbox 3

The Role of Big Data in the Modernisation of Statistical Production and Services NTTS 2015 March 9, 2015 The Team 4 Lily Ma Andrew Murray Antonino Virgillito Marco Puts Stephen Ball

The Role of Big Data in the Modernisation of Statistical Production and Services NTTS 2015 March 9, 2015 The Datasets Irish Data – Real data, privacy sensitive – Power consumption of 6500 households over 1 year – 160 million records, 2.5Gb Canadian Data – Synthetic data generated from real counterpart – Power consumption of households over 1 month – 16 million records, 1Gb 5

The Role of Big Data in the Modernisation of Statistical Production and Services NTTS 2015 March 9, 2015 Handling Privacy Issues – Irish data Irish dataset could not be released freely under the terms of Irish legislation Proper precautions had to be taken to move and store the data to the Sandbox – Datasets were stored on a USB key (encrypted and protected with a password) only handled by the Irish institute representative (Andrew) – A directory was created on the Sandbox and access permission was granted only to team members – Andrew and Toni transferred the data from Toni’s computer to the Sandbox via FTP – The data was removed from the computer right after the completion of the operations 6

The Role of Big Data in the Modernisation of Statistical Production and Services NTTS 2015 March 9, 2015 Handling Privacy Issues – Canadian Data There were no possibilities of moving the Canadian datasets outside the boundaries of StatsCan Lily implemented a method to alter the data in order to remove each reference to the real data and change the values of the measurements Resulting statistics were detached from the real numbers although maintaining a realistic distribution 7

The Role of Big Data in the Modernisation of Statistical Production and Services NTTS 2015 March 9, 2015 Experiment Details Data aggregation was carried out in the Sandbox environment using the Pig tool The full datasets were aggregated in a single pass, computing the power consumption at hourly level – Script was only 4 lines long Aggregation performance was satisfactory – 2.5Gb aggregated in less than 2 minutes Since the Pig language does not natively define statistical functions a third-party extension (developed and freely made available by LinkedIn) was loaded and used – User Defined Function can indefinitely extend the power of the Pig language – Several useful functions freely available. Aggregated data was processed again in R in order to produce visualizations – Tools used: Processing and Pentaho 8

NTTS 2015 March 9, 2015 Visualizations Weekly consumption per hour of day over a year (IE) winter summer mid-seasons 9

NTTS 2015 March 9, 2015 Visualizations 10 Hourly consumption per day (CAN)

The Role of Big Data in the Modernisation of Statistical Production and Services NTTS 2015 March 9, 2015 Conclusions and Findings We proved that data from smart meters could potentially be used to compute statistics on energy consumption easily and at a very detailed level Key issue is data availability and privacy – Is this approach feasible in production? Technology findings: – Test of big data tools with positive results – Reuse of methods: quickly wrote aggregation scripts that could be used on both datasets Privacy findings: – two ways of overcoming privacy issues The use of synthetic data sets can enable working on common environments and sharing methods and techniques 11