Spark For Big Data Set 2
Free Online Best Spark For Big Data MCQ Questions for improve your basic knowledge of Spark For Big Data. This Spark For Big Data set 2 test that contains 25 Multiple Choice Questions with 4 options. You have to select the right answer to a question.
Start
Congratulations - you have completed Spark For Big Data Set 2.
You scored %%SCORE%% out of %%TOTAL%%.
Your performance has been rated as %%RATING%%
Your answers are highlighted below.
Question 1 |
In Spark Streaming the data can be from what all sources?
A | Kafka |
B | Flume |
C | Kinesis |
D | All of the above |
Question 2 |
Apache Spark was made open-source in which year?
A | 2010 |
B | 2011 |
C | 2008 |
D | 2009 |
Question 3 |
Which is not a component on the top of Spark Core?
A | Spark RDD |
B | Spark Streaming |
C | MLlib |
D | None of the above |
Question 4 |
What are the parameters defined to specify window operation?
A | Window length, sliding interval |
B | State size, window length |
C | State size, sliding interval |
D | None of the above |
Question 5 |
Dataset was introduced in which Spark release?
A | Spark 1.6 |
B | Spark 1.4.0 |
C | Spark 2.1.0 |
D | Spark 1.1 |
Question 6 |
Which of the following is good for low-level transformation and actions?
A | RDD |
B | DataFrame |
C | Dataset |
D | None of the above |
Question 7 |
Which of the following language is not supported by Spark?
A | Python |
B | Scala |
C | Java |
D | Pascal |
Question 8 |
Which is the abstraction of Apache Spark?
A | Shared Variable |
B | RDD |
C | Both a and b |
D | none of the above |
Question 9 |
Dstream internally is____
A | Continuous Stream of RDD |
B | Continuous Stream of DataFrame |
C | Continuous Stream of DataSet |
D | None of the above |
Question 10 |
Which of the following algorithm is not present in MLlib?
A | Streaming Linear Regression |
B | Streaming KMeans |
C | Tanimoto distance |
D | None of the above |
Question 11 |
Which of the following is not the feature of Spark?
A | Supports in-memory computation |
B | Fault-tolerance |
C | It is cost efficient |
D | Compatible with other file storage system |
Question 12 |
Which of the following make use of an encoder for serialization?
A | RDD |
B | DataFrame |
C | Dataset |
D | None of the above |
Question 13 |
Which of the following can be used to launch Spark jobs inside MapReduce?
A | SIM |
B | SIMR |
C | SIR |
D | RIS |
Question 14 |
Which of the following is slow to perform simple grouping and aggregation operations?
A | RDD |
B | DataFrame |
C | Dataset |
D | None of the above |
Question 15 |
Spark is developed in which language?
A | Java |
B | Scala |
C | Python |
D | R |
Question 16 |
Apache Spark has API's in____
A | Java |
B | Scala |
C | Python |
D | All of the above |
Question 17 |
The basic abstraction of Spark Streaming is____
A | Dstream |
B | RDD |
C | Shared Variable |
D | None of the above |
Question 18 |
Which Cluster Manager do Spark Support?
A | Standalone Cluster Manager |
B | MESOS |
C | YARN |
D | All of the above |
Question 19 |
Which of the following provide the object-oriented programming interface?
A | RDD |
B | DataFrame |
C | Dataset |
D | None of the above |
Question 20 |
Which of the following is not output operation on DStream?
A | SaveAsTextFiles |
B | ForeachRDD |
C | SaveAsHadoopFiles |
D | ReduceByKeyAndWindow |
Question 21 |
The default storage level of cache() is?
A | MEMORY_ONLY |
B | MEMORY_AND_DISK |
C | DISK_ONLY |
D | MEMORY_ONLY_SER |
Question 22 |
Point out the correct statement______
A | Spark enables Apache Hive users to run their unmodified queries much faster |
B | Spark interoperates only with Hadoop |
C | Spark is a popular data warehouse solution running on top of Hadoop |
D | All of the above |
Question 23 |
________ is a distributed graph processing framework on top of Spark
A | MLlib |
B | Spark Streaming |
C | GraphX |
D | None of the above |
Question 24 |
Which of the following is not a component of Spark Ecosystem?
A | Sqoop |
B | GraphX |
C | MLlib |
D | BlinkDB |
Question 25 |
Which of the following organized a data into a named column?
a. RDD
b. DataFrame
c. Dataset
A | Both a and b |
B | Both b and c |
C | Both a and c |
D | None of the above |
Once you are finished, click the button below. Any items you have not completed will be marked incorrect.
Get Results
There are 25 questions to complete.
← |
List |
→ |
Return
Shaded items are complete.
1 | 2 | 3 | 4 | 5 |
6 | 7 | 8 | 9 | 10 |
11 | 12 | 13 | 14 | 15 |
16 | 17 | 18 | 19 | 20 |
21 | 22 | 23 | 24 | 25 |
End |
Return
You have completed
questions
question
Your score is
Correct
Wrong
Partial-Credit
You have not finished your quiz. If you leave this page, your progress will be lost.
Correct Answer
You Selected
Not Attempted
Final Score on Quiz
Attempted Questions Correct
Attempted Questions Wrong
Questions Not Attempted
Total Questions on Quiz
Question Details
Results
Date
Score
Hint
Time allowed
minutes
seconds
Time used
Answer Choice(s) Selected
Question Text
All done
Need more practice!
Keep trying!
Not bad!
Good work!
Perfect!