Big and Open Data: Challenges and Issues

Slides:



Advertisements
Similar presentations
Stories from the field: studying urban, poor micro-entrepreneurs Helani Galpaya Dhaka, 30 April 2014 This work was carried out with the aid of a grant.
Advertisements

UN workshop for South Asian countries on collection & dissemination of socio-economic data from population & housing censuses New Delhi,28-31 may 2012.
Office of Rural Affairs High Speed Communications Cris Fulford Office of Lieutenant Governor Rebecca Skillman One North Capitol, Suite 600 Indianapolis,
Baseline knowledge about ICT in Nepal Rohan Samarajiva Nagarkot, March 2015 This work was carried out with the aid of a grant from the International.
Big Data and Predictive Analytics in Health Care Presented by: Mehadi Sayed President and CEO, Clinisys EMR Inc.
Frank Yu Australian Bureau of Statistics Unstructured Data 1.
Workshop on Energy Statistics, China September 2012 Electricity and Heat Statistics 1.
Current and future, business and consumer applications of LBS technology Andrew Grill, Mobile Advertising Evangelist British.
Integrating ICT questions in the work of National Statistical Organizations Harsha de Silva Lead Economist, LIRNEasia WDR Expert Forum.
Traffic Characteristics and Communication Patterns in Blogosphere A brilliant and insightful analysis of the access methods of the blogosphere community.
Origin-Destination matrix estimation in Sri Lanka using mobile network big data Danaja Maldeniya, Sriganesh Lokanathan and Amal Kumarage (Phd) 13th International.
25 Need-to-Know Facts. Fact 1 Every 2 days we create as much information as we did from the beginning of time until 2003 [Source]Source © 2014 Bernard.
ONS Big Data Project. Plan for today Introduce the ONS Big Data Project Provide a overview of our work to date Provide information about our future plans.
 Digital marketing: Uses digital media to develop communications and exchanges with customers  Electronic media (E-marketing): Refers to the strategic.
Big Data A big step towards innovation, competition and productivity.
Welcome to CMPE003 Personal Computer Concepts: Hardware and Software Winter 2003 UC Santa Cruz Instructor: Guy Cox.
SMARTCARDS. What we’ll cover: How does the Smart Card work (layout and operating system)? Security issues for the card holder The present and future of.
8/14/2015 SMS Exchange: Mobile credits as an electronic payment channel. Case of RWANDA Jean Pierre NSHIMIYIMANA Rwanda Utilities Regulatory Agency & Independent.
Basic Marketing Research Customer Insights and Managerial Action
AS Level ICT Selection and use of input devices and input media: Capturing transaction data.
Smart Cities & Smart Utility
Internet Usage in Pakistan & E-Marketing Potential Instructor: Hanniya Abid Lecture 2 E-Marketing.
The Information System Opportunity
What is the significance of ICTs to legislators? Rohan Samarajiva Yangon, 26 July 2014 This work was carried out with the aid of a grant from the International.
© 2012 IBM Corporation IBM Corporate Service Corps Developing Global Leaders for the 21st Century.
Charles Tappert Seidenberg School of CSIS, Pace University
Interactive Market
E-SCM Developments in Hong Kong Opening address by Mrs. Carrie Yau Secretary for Information Technology and Broadcasting.
Mobile Apps For Small Businesses Your customers are mobile. Is your business?
Where did you come from? Where did you go? Robust policy relevant evidence from mobile network big data Danaja Maldeniya, Amal Kumarage, Sriganesh Lokanathan,
Globalization. Hmmm…. How do you think technology effects globalization?
Task 1 Research on any 2 of the following: Online shopping Online banking Web broadcasting Social networking sites Discuss the disadvantages and advantages.
BRIDGING THE DIGITAL DIVIDE A Basic Understanding.
MIS – 3030 Business Technologies Social Media & Conversation Big Data.
Information Society Innovation Fund (ISIF) Grants Program Paul Wilson APNIC 27.
A Digital Vision for Scotland Dr Trudy Nicolson Head of Broadband Policy Scottish Government 27 March 2013.
European Board of National Archivists 18 November 2010 Business archives in the UK Nick Kingsley.
+ Big Data IST210 Class Lecture. + Big Data Summary by EMC Corporation ( More videos that.
Big Data for Development: New opportunities for emerging markets Presentation to Access to Information Unit, Bangladesh Prime Minister’s Office, Dhaka.
Systems that support electronically executed business transactions.
Innovation Work Circle: Big Data Presented By: Innovation Work Circle Group.
Review 2 Chapters 4, 5, 6. What is the Internet? Global network, a network of networks.
Industry Outlook November Manufacturing Matters in Canada  A $620 billion industry  12% of GDP (18% in 2004)  1.7.
Internet Studies. Faculty Members The specialty has now 2 faculty members Prof. Ronen Feldman: Text Mining, Data Mining, Social Media Analysis, Information.
Distribution Plan Week-8 Tutorial 12/19/2015Dr. Yuvaraj 1.
Defining Social Media Social Media Marketing Communications Digital Marketing Characteristics Types of Internet Advertising Mobile Marketing Social Behavior.
1 Unstructured Data (UD) What is unstructured data? How is it statistically valuable? Challenges of turning UD into information.
CISC 849 : Applications in Fintech Namami Shukla Dept of Computer & Information Sciences University of Delaware iCARE : A Framework for Big Data Based.
Geographic Information & Society: Some things to think about GEOG 370 Christine Erlien.
Using mobile network big data for land use classification Kaushalya Madhawa, Sriganesh Lokanathan, Danaja Maldeniya, Rohan Samarajiva CPRsouth 2015 Taipei.
Questions When have you used GPS? GPS technology uses satellites to pinpoint position on Earth with the aid of a GPS device or unit Have you ever used.
Nico Heerschap, Luxembourg, 2015 Mobile positioning and other ‘big’ data for tourism statistics Experience Statistics Netherlands.
Positioning geospatial information to address global challenges Positioning Geospatial Information to Address Global Challenges Greg Scott Inter-Regional.
Big Data Javad Azimi May First of All… Sorry about the language  Feel free to ask any question Please share similar experiences.
Ayubowan. Statistical Overview of the Telecommunications Sector in Sri Lanka A S W Bandusiri Statistical Officer TRCSL Tel: Fax:
MarketsandMarkets Presents Cross-Platform and Mobile Advertising Market worth $76.57 Billion by 2018
Unlock your Big Data with Analytics and BI on Office365 Brian Culver ● SharePoint Fest Seattle● BI102 ● August 18-20, 2015.
Big Data for Measuring the Information Society INTERNATIONAL TELECOMMUNICATION UNION BIG DATA PROJECT - INNOVATIVE WAYS TO UTILIZE BIG DATA AS A NEW DATA.
Big Data and Official Statistics: Philippine Context Erniel B. Barrios.
MarketsandMarkets Presents Heavy Construction Equipment Market worth $195.0 Billion by 2018
BaiRong Financial Information Services, Ltd. (100Credit.com)
Big Data: Analytics for public purposes
Data Partnerships: LIRNEasia’s experience in Sri Lanka
BIG Data 25 Need-to-Know Facts.
Challenges in making India a cashless economy.
Published: Aug 2017 Single User PDF: US$ 2500 No. of Pages: 499
Information Society Innovation Fund (ISIF) Grants Program
United Nations Development Account 10th Tranche Statistics and Data
Sample Analytics Categories
Big Data in Official Statistics: Generalities
Presentation transcript:

Big and Open Data: Challenges and Issues Sriganesh Lokanathan Team Leader – Big Data Research, LIRNEasia This work was carried out with the aid of a grant from the International Development Research Centre, Canada and the Department for International Development UK..

Big data An all-encompassing term for any collection of data sets so large or complex that it becomes difficult to process using traditional data processing applications. Challenges include: analysis, capture, curation, search, sharing, storage, transfer, visualization, and privacy violations. Examples: 100 million Call Detail Records per day generated by Sri Lanka companies 45 Terabytes of data from Hubble Telescope

Why big data? Why now? Proximate causes Increased “datafication”: Very large sets of schema-less (unstructured, but processable) data now available Advances in memory technology: No longer is it necessary to archive most data and work with small subset Advances in software: MapReduce, Hadoop

There are many potential sources of big data in an economy….. Administrative data E.g., digitized medical records, insurance records, tax records Commercial transactions (transaction-generated data) E.g., Stock exchange data, bank transactions, credit card records, supermarket transactions connected by loyalty card number Sensors and tracking devices E.g., road and traffic sensors, climate sensors, equipment & infrastructure sensors, mobile phones communicating with base stations, satellite/ GPS devices Online activities/ social media E.g., online search activity, online page views, blogs/ FB/ twitter posts

….but currently only mobile network big data has broad population coverage Mobile SIMs/100 Internet users/100 Facebook users/100 Myanmar 13 1 4 Bangladesh 67 7 6 Pakistan 70 11 8 India 71 15 9 Sri Lanka 96 22 12 Philippines 105 39 41 Indonesia 122 16 29 Thailand 138 46 Source: ITU Measuring Information Society 2014; Facebook advantage portal

Construct Behavioral Variables Mobile network big data + other data  rich, timely insights that serve private as well as public purposes Mobile network big data (CDRs, Internet access usage, airtime recharge records) Other Data Sources 1. Data from Dept. of Census & Statistics 2. Transportation data 3. Health data 4. Financial data 5. Etc. Dual purpose insights Private purposes 1. Mobility & location based services 2. Financial services 3. Richer customer profiles 4. Targeted marketing 5. New VAS Public purposes 1. Transportation & Urban planning 2. Crises response + DRR 3. Health services 4. Poverty mapping 5. Financial inclusion Construct Behavioral Variables 1. Mobility variables 2. Social variables 3. Consumption variables

What can we do with such data? Since 2012, LIRNEasia has been working with mobile network big data, having obtained historical and pseudonymized data from multiple operators in Sri Lanka Covering nearly 50% of population

Population density changes in Colombo region: weekday/ weekend Pictures depict the change in population density at a particular time relative to midnight Weekday Time 18:30 Time 12:30 Time 06:30 Sunday Decrease in Density Increase in Density

%age of Colombo’s daytime population 46.9% of Colombo City’s daytime population comes from the surrounding regions Colombo city is made up of Colombo and Thimbirigasyaya DSDs Home DSD %age of Colombo’s daytime population Colombo city 53.1 Maharagama 3.7 Kolonnawa 3.5 Kaduwela 3.3 Sri Jayawardanapura Kotte 2.9 Dehiwala 2.6 Kesbewa 2.5 Wattala Kelaniya 2.1 Ratmalana 2.0 Moratuwa 1.8

We can exploit the diurnal base station signatures to understand land use patterns Highly commercial Highly residential Mixed-use

We can develop new proxy measures of economic activity Estimated log(wage) High Estimated Wage Low Estimated Wage

Understanding the geo-spatial extent of communities The 11 detected communities The 9 provinces

So what are the challenges and issues?

Low levels of ‘datafication’ Big AND OPEN data doesn’t really exist in Sri Lanka But even if there were large open data sets easily available there are several issues Diagram source: Joel Gurin in http://www.theguardian.com/public-leaders-network/2014/apr/15/big-data-open-data-transform-government

Issues Standardization Accountability & liability Data and analytical literacy Private sector versus public sector data Competitive industries versus monopolies Privacy