HDFS, by default, replicates each data block _____ times on different nodes and on at least ____ racks.
3, 2
Excellent ! Your Answer is Correct. Explanation: HDFS has a simple yet robust architecture that was explicitly designed for data reliability in the face of faults and failures in disks, nodes and networks.
__________ support storing a copy of data at a particular instant of time.
Snapshots
Excellent ! Your Answer is Correct. Explanation: One usage of the snapshot feature may be to roll back a corrupted HDFS instance to a previously known good point in time.
Which of the following phases occur simultaneously?
Shuffle and Sort
Excellent ! Your Answer is Correct. Explanation: The shuffle and sort phases occur simultaneously; while map-outputs are being fetched they are merged.
Hadoop achieves reliability by replicating the data across multiple hosts, and hence does not require ________ storage on hosts.
RAID
Excellent ! Your Answer is Correct. With the default replication value, 3, data is stored on three nodes: two on the same rack, and one on a different rack.