Spark powers a stack of high-level tools including Spark SQL, MLlib for _________

Spark powers a stack of high-level tools including Spark SQL, MLlib for _________
machine learning
Excellent ! Your Answer is Correct. Explanation: Spark is used at a wide range of organizations to process large datasets.
statistics
regression models
reproductive research

Spark is engineered from the bottom-up for performance, running ___________ faster than Hadoop by exploiting in memory computing and other optimizations.

Spark is engineered from the bottom-up for performance, running ___________ faster than Hadoop by exploiting in memory computing and other optimizations.
100x
Excellent ! Your Answer is Correct. Explanation: Spark is fast on disk too; it currently holds the world record in large scale on-disk sorting.
150x
200x
None of the Option is Correct

Spark includes a collection over ________ operators for transforming data and familiar data frame APIs for manipulating semi-structured data.

Spark includes a collection over ________ operators for transforming data and familiar data frame APIs for manipulating semi-structured data.
80
Excellent ! Your Answer is Correct. Explanation: Spark provides easy-to-use APIs for operating on large datasets.
70
60
50

Spark is packaged with higher level libraries, including support for _________ queries.

Spark is packaged with higher level libraries, including support for _________ queries.
SQL
Excellent ! Your Answer is Correct. Explanation: Standard libraries increase developer productivity and can be seamlessly combined to create complex workflows.
C
C++
None of the Option is Correct

Which of the following language is not supported by Spark?

Which of the following language is not supported by Spark?
Pascal
Excellent ! Your Answer is Correct. Explanation: The Spark engine runs in a variety of environments, from cloud services to Hadoop or Mesos clusters.
Java
Scala
Python

Which of the following can be used to launch Spark jobs inside MapReduce?

Which of the following can be used to launch Spark jobs inside MapReduce?
SIMR
Excellent ! Your Answer is Correct. Explanation: With SIMR, users can start experimenting with Spark and use its shell within a couple of minutes after downloading it.
SIM
SIR
RIS

Spark runs on top of ___________ a cluster manager system which provides efficient resource isolation across distributed applications.

Spark runs on top of ___________ a cluster manager system which provides efficient resource isolation across distributed applications.
Mesos
Excellent ! Your Answer is Correct. Explanation: Mesos enables fine grained sharing which allows a Spark job to dynamically take advantage of the idle resources in the cluster during its execution.
Mesjs
Mesus
All Options are Correct

Point out the correct statement.

Point out the correct statement.
Spark enables Apache Hive users to run their unmodified queries much faster
Excellent ! Your Answer is Correct. Explanation: Shark can accelerate Hive queries by as much as 100x when the input data fits into memory, and up 10x when the input data is stored on disk.
Spark interoperates only with Hadoop
Spark is a popular data warehouse solution running on top of Hadoop
None of the Option is Correct

Users can easily run Spark on top of Amazon’s __________

Users can easily run Spark on top of Amazon’s __________
EC2
Excellent ! Your Answer is Correct. Explanation: Users can easily run Spark (and Shark) on top of Amazon’s EC2 either using the scripts that come with Spark.
Infosphere
EMR
None of the Option is Correct

Spark architecture is ___________ times as fast as Hadoop disk-based Apache Mahout and even scales better than Vowpal Wabbit.

Spark architecture is ___________ times as fast as Hadoop disk-based Apache Mahout and even scales better than Vowpal Wabbit.
10
Excellent ! Your Answer is Correct. Explanation: Spark architecture has proven scalability to over 8000 nodes in production.
20
50
100
Page 1 of 3
1 2 3