Presentation is loading. Please wait.

Presentation is loading. Please wait.

TABLE OF CONTENTS. TABLE OF CONTENTS Not Possible in single computer and DB Serialised solution not possible Large data backup difficult so data.

Similar presentations


Presentation on theme: "TABLE OF CONTENTS. TABLE OF CONTENTS Not Possible in single computer and DB Serialised solution not possible Large data backup difficult so data."— Presentation transcript:

1

2 TABLE OF CONTENTS

3

4

5

6 Not Possible in single computer and DB
Serialised solution not possible Large data backup difficult so data loss prevention difficult Above 3 points causes large cost

7

8

9

10

11

12

13

14 SAMPLE QUESTIONS Q1. All of the following accurately describe Hadoop, EXCEPT: a) Open source b) Real-time c) Java-based d) Distributed computing approach Q2. Which one is not one of the big data feature? a) Velocity b) Veracity c) Variety d) Volume

15

16

17

18

19

20 SAMPLE QUESTIONS Q1. As compared to RDBMS, Hadoop
a) Has higher data Integrity. b) Does ACID transactions c) Is suitable for read and write many times d) Works better on unstructured and semi-structured data. Q2. The hdfs command put is used to a) Copy files from local file system to HDFS. b) Copy files or directories from local file system to HDFS. c) Copy files from from HDFS to local filesystem. d) Copy files or directories from HDFS to local filesystem.

21

22

23

24

25

26

27 SAMPLE QUESTIONS Q1. If the IP address or hostname of a datanode changes a) The namenode updates the mapping between file name and block name b) The namenode need not update mapping between file name and block name c) The data in that data node is lost forever d) There namenode has to be restarted Q2. For a HDFS directory the replication factor(RF) is a) same as the RF of the files in that directory b) 0 c) 3 d) Does not apply

28

29

30

31

32

33

34

35

36

37

38

39

40

41 SAMPLE QUESTIONS Q1. Point out the incorrect statement :
a) Applications can use the Reporter to report progress b) The Hadoop MapReduce framework spawns one map task for each InputSplit generated by the InputFormat for the job c) The intermediate, sorted outputs are always stored in a simple (key- len, key, value-len, value) format d) None of the mentioned Q2. _________ is the default Partitioner for partitioning key space. a) HashPar b) Partitioner c) HashPartitioner d) None of the mentioned

42


Download ppt "TABLE OF CONTENTS. TABLE OF CONTENTS Not Possible in single computer and DB Serialised solution not possible Large data backup difficult so data."

Similar presentations


Ads by Google