Menu

(Solved) : 3 Consider Hadoop Job Processes Input Data File Size Equal 45 Disk Blocks 45 Different Blo Q44079052 . . .

3) Consider a Hadoop job that processes an input data file ofsize equal to 45 disk blocks (45 different blocks, you can assumethat HDFS replication factor is set to 1). The mapper in this jobrequires 1 minute to read and fully process a single block of data.For the purposes of this assignment, you can assume that the reducepart of this job takes zero time.

a) Approximately how long will it take to process the file ifyou only had one Hadoop worker node? You can assume that that onlyone mapper is created on every node.

b) 20 Hadoop worker nodes?

c) 50 Hadoop worker nodes?

d) 75 Hadoop worker nodes?

e) Now suppose you were told that the replication factor hasbeen changed to 3? That is, each block is stored in triplicate, butfile size is still 45 blocks. Which of the answers (if any) ina)-c) above will have to change?

Expert Answer


Answer to 3) Consider a Hadoop job that processes an input data file of size equal to 45 disk blocks (45 different blocks, you can…

OR