Which of the following phases occur simultaneously?
Shuffle and Sort
Excellent ! Your Answer is Correct. Explanation: The shuffle and sort phases occur simultaneously; while map-outputs are being fetched they are merged.
Hadoop achieves reliability by replicating the data across multiple hosts, and hence does not require ________ storage on hosts.
RAID
Excellent ! Your Answer is Correct. With the default replication value, 3, data is stored on three nodes: two on the same rack, and one on a different rack.
Excellent ! Your Answer is Correct. The Hadoop framework itself is mostly written in the Java programming language, with some native code in C and command line utilities written as shell-scripts.
Which of the following genres does Hadoop produce ?
Distributed file system
Excellent ! Your Answer is Correct. The Hadoop Distributed File System (HDFS) is designed to store very large data sets reliably, and to stream those data sets at high bandwidth to user.