Download presentation
Presentation is loading. Please wait.
Published byDorthy Garrison Modified over 6 years ago
2
TABLE OF CONTENTS
6
Not Possible in single computer and DB
Serialised solution not possible Large data backup difficult so data loss prevention difficult Above 3 points causes large cost
14
SAMPLE QUESTIONS Q1. All of the following accurately describe Hadoop, EXCEPT: a) Open source b) Real-time c) Java-based d) Distributed computing approach Q2. Which one is not one of the big data feature? a) Velocity b) Veracity c) Variety d) Volume
20
SAMPLE QUESTIONS Q1. As compared to RDBMS, Hadoop
a) Has higher data Integrity. b) Does ACID transactions c) Is suitable for read and write many times d) Works better on unstructured and semi-structured data. Q2. The hdfs command put is used to a) Copy files from local file system to HDFS. b) Copy files or directories from local file system to HDFS. c) Copy files from from HDFS to local filesystem. d) Copy files or directories from HDFS to local filesystem.
27
SAMPLE QUESTIONS Q1. If the IP address or hostname of a datanode changes a) The namenode updates the mapping between file name and block name b) The namenode need not update mapping between file name and block name c) The data in that data node is lost forever d) There namenode has to be restarted Q2. For a HDFS directory the replication factor(RF) is a) same as the RF of the files in that directory b) 0 c) 3 d) Does not apply
41
SAMPLE QUESTIONS Q1. Point out the incorrect statement :
a) Applications can use the Reporter to report progress b) The Hadoop MapReduce framework spawns one map task for each InputSplit generated by the InputFormat for the job c) The intermediate, sorted outputs are always stored in a simple (key- len, key, value-len, value) format d) None of the mentioned Q2. _________ is the default Partitioner for partitioning key space. a) HashPar b) Partitioner c) HashPartitioner d) None of the mentioned
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.