Presentation is loading. Please wait.

Presentation is loading. Please wait.

Big Data Gulriz Kurban.

Similar presentations


Presentation on theme: "Big Data Gulriz Kurban."— Presentation transcript:

1 Big Data Gulriz Kurban

2 Four Vs of Big Data Volume Variety Velocity Veracity

3

4 Volume IRS: (Economic Mobility) During fiscal year 2017, the IRS processed more than 245 million tax returns and other forms and issued more than 121 million individual income tax refunds.

5

6 Velocity

7 Variety Structured Data: Databases
Unstructured Data: Text, Images, Audio and Videos

8 Structured Data: Databases
Sale Data: Netflix (118M subscribers), Amazon (Recommender systems) Electronic Health Data: Demographic information, diagnosis codes. Genotype data: SNPs

9 Unstructured Data: Text (NLP)
Social Networks: Twitter, Facebook (Topic modeling) News feeds Health Data: Doctor’s notes Scientific Publications (drug toxicity) Books (digital humanities)

10

11 Unstructured Data: Images
Instagram images Companies: Shutterstock Satellite Images

12

13 Kevin Matzen, Kavita Bala, and Noah Snavely at Cornell University in Ithaca, New York

14 Unstructured Data: Audio and Videos
Enhanced organization and search using the video content: process videos frame-by-frame analyze gestures transcribe spoken language use facial recognition (YouTube, IBM Cloud Video, thuuz.com – personalized sports highlights, Alexa)

15 Veracity: Biases and noise in data
Bias: Data mining social networks for election and referendum predictions Noise: Electronic health records data entry

16 Big Data and Poverty

17 Big Data and Poverty Low-income communities are among the most surveilled communities in America: public-benefits programs child-welfare systems monitoring programs for domestic-abuse offenders


Download ppt "Big Data Gulriz Kurban."

Similar presentations


Ads by Google