Ungraded quiz Unit 2.

Slides:



Advertisements
Similar presentations
Business Intelligence Systems
Advertisements

Nokia Technology Institute Natural Partner for Innovation.
Big Data and Predictive Analytics in Health Care Presented by: Mehadi Sayed President and CEO, Clinisys EMR Inc.
Exploratory Research & Secondary Data
Tennessee Technological University1 The Scientific Importance of Big Data Xia Li Tennessee Technological University.
CIS 9002 Kannan Mohan Department of CIS Zicklin School of Business, Baruch College.
Charles Tappert Seidenberg School of CSIS, Pace University
Machine Learning Lecture 1. Course Information Text book “Introduction to Machine Learning” by Ethem Alpaydin, MIT Press. Reference book “Data Mining.
Data Mining By Dave Maung.
 Frequent Word Combinations Mining and Indexing on HBase Hemanth Gokavarapu Santhosh Kumar Saminathan.
Vector and symbolic processors
Intelligent Database Systems Lab N.Y.U.S.T. I. M. Practical Lessons of Data Mining at Yahoo! Presenter: Jun-Yi Wu Authors: Ye Chen, Dmitry Pavlov, Pavel.
CISC 849 : Applications in Fintech Namami Shukla Dept of Computer & Information Sciences University of Delaware iCARE : A Framework for Big Data Based.
Mining of Massive Datasets Edited based on Leskovec’s from
1 Seattle University Master’s of Science in Business Analytics Key skills, learning outcomes, and a sample of jobs to apply for, or aim to qualify for,
Copyright © 2016 Pearson Education, Inc. Modern Database Management 12 th Edition Jeff Hoffer, Ramesh Venkataraman, Heikki Topi CHAPTER 11: BIG DATA AND.
Data Science Interview Questions 1.What do you mean by word Data Science? Data Science is the extraction of knowledge from large.
AZURE MACHINE LEARNING Bringing New Value To Old Data SQL Saturday #
Book web site:
Week 3: Ungraded review questions. Can you explain your answer?  Use your fingers to indicate your answer: 1=A, 2=B, 3=C, 4=D. For “check all that apply,”
Data Analytics 1 - THE HISTORY AND CONCEPTS OF DATA ANALYTICS
Accelerated B.S./M.S An approved Accelerated BS/MS program allows an undergraduate student to take up to 6 graduate level credits as an undergraduate.
Archival research: Ungraded review questions
Week 1: Ungraded review questions
Survey research: Ungraded review questions
Sushant Ahuja, Cassio Cristovao, Sameep Mohta
Data Mining, Data Science, Big Data
Classification of Research
Text Based Information Retrieval
Ungraded quiz Unit 6.
Ungraded quiz Unit 3.
Introduction to big data
Introduction Data Mining for Business Analytics.
Big Data - in Performance Engineering
Ungraded quiz Unit 1.
Information from Samples
Ungraded quiz Unit 1.
Ungraded quiz Unit 7.
Ungraded quiz Unit 5.
Quantitative design: Ungraded review questions
Quantitative design: Ungraded review questions
A Unifying View on Instance Selection
Ungraded quiz Unit 4.
Pre-Activity: 1. Recap? 2. Research Says?
Text Categorization Document classification categorizes documents into one or more classes which is useful in Information Retrieval (IR). IR is the task.
Big Data Young Lee BUS 550.
Dep. of Information Technology By: Raz Dara Mohammad Amin
Big DATA.
Best in Class Reporting to What’s Next: A/I Driven Insights
Quantitative design: Ungraded review questions
Ungraded quiz Unit 8.
Ungraded quiz Unit 2.
Ungraded quiz Unit 7.
Ungraded quiz Unit 4.
Ungraded quiz Unit 3.
Ungraded quiz Unit 6.
Ungraded quiz Unit 6.
Ungraded quiz Unit 5.
Ungraded quiz Unit 7.
Ungraded quiz Unit 1.
Ungraded quiz Unit 3.
Ungraded quiz Unit 1.
Ungraded quiz Unit 4.
Ungraded quiz Unit 6.
Ungraded quiz Unit 11.
Ungraded quiz Unit 1.
An Introduction to Data Science using Python
Ungraded quiz Unit 3.
Ungraded quiz Unit 9.
Ungraded quiz Unit 8.
Presentation transcript:

Ungraded quiz Unit 2

Show me your fingers Do not shout out the answer, or your classmates will follow what you said. Use your fingers One finger (the right finger) = A Two fingers = B Three fingers = C Four fingers = D No finger = I don’t know. I didn’t study

Which of the following is untrue? Business intelligence and data mining are specific applications of data science. Data science necessitates big data analytics. Machine learning is a specific application of artificial intelligence. Data science requires domain knowledge, statistics, computer programming, and database design.

Which of the following is not an advantage of data mining? Most data mining methods are non-parametric It is data-driven and exploratory, and thus it reduces confirmation bias It uses resampling-based ensemble methods rather than counting on one single analysis Unlike the alpha level, the absolute cut-off of data mining methods (e.g. AIC, BIC) is more objective.

What are the shortcomings of conventional data? Sampling bias (e.g. WEIRD) In experimental setting the participants might behave according to social desirability (e.g. the dictator game) In survey the respondents might not be able to recall the exact information. All of the above

Which of the following statement is untrue? By utilizing Google search-term data, the author of “Everybody lies” found that many Americans lied about their criteria for voting By utilizing Google search-term data, Google researchers developed a highly accurate model for predicting the outbreak of influenza. Two researchers claimed that a personality model based on Facebook data can accurately predict what people like. FTC investigated Facebook because Facebook handed over data to Cambridge Analytica without notifying Facebook users.

Which of the following is not a component of data science? Data architecture Data acquisition Data analysis Data ethics

Which of the following is not a characteristic of big data? High volume High velocity High variety High vector

Which of the following about data mining is untrue? Data mining includes mining structured numeric data and unstructured textual data Data mining utilizes machine learning Data mining necessitates pattern seeking Data mining is so named because the insight is buried and thus it takes filtering and extraction to learn about the data.

When the data size is extremely huge, it is inefficient to transfer the data from the database server to the analytical system. Which of the following is NOT a solution? Hodoop MapReduce Distributed file system In-memory analytics