1. Which of the four components of Spark is not ()
A.spark streaming B Mlib C Graphx D Spark R
2. Which port below is not the port on which the spark comes in service ()
a.8080 b.4040 c.8090 d.18080
Maximum changes in version 3.spark (1.4)
A Spark SQL Release version B introduces Spark R C DataFrame D to support dynamic resource allocation
4. Spark Job default scheduling mode ()
A FIFO B FAIR C no D run-time designation
5. Which is not a condition for local mode operation ()
A spark.localexecution.enabled=true B explicitly specifies local run C finalstage no parent Stage D partition default value
6. Which of the following is not an RDD feature ()
A. Can be partitioned B serializable C modifiable D can be persisted
7. About the broadcast variable, which one of the following is wrong ()
A any function call B is read-only C stored on each node D stored on disk or HDFS
8. About the accumulator, which one of the following is wrong ()
A supports addition B supports numeric type C can be parallel D does not support custom types
Which of the 9.Spark supported distributed deployment methods is wrong ()
A standalone B Spark on Mesos C Spark on YARN D Spark on Local
The number of Task 10.Stage is determined by what ()
A Partition B Job C Stage D TaskScheduler
11. Which of the following operations is narrow-dependent ()
A Join B Filter C Group D sort
12. Which of the following operations must be wide-dependent ()
A map B flatMap C reducebykey D sample
How the master and worker of 13.spark communicate in any way. ( )
A http B nio C netty D Akka
14 default storage level ()
A memory_only B Memory_only_ser
C Memory_and_disk D Memory_and_disk_ser
The Spark.deploy.recoveryMode does not support that kind of ()
A.zookeeper B. FileSystem d NONE D Hadoop
16. Which of the following is not the RDD cache method ()
A persist () B Cache () C Memory ()
17.Task run down where the options in the Executor work unit ()
A Driver program b. Spark master c.worker node D Cluster Manager
What is the difference between 18.hive metadata storage in Derby and MySQL ()
A. No difference b. Multi-session C. Support for network environment D database differences
The biggest difference between 19.DataFrame and RDD ()
A. Scientific statistics support B. More schema C. Different storage methods D. External data source support
What to do after the 20.Master Electedleader event ()
A. Notice driver B. Notify Worker C. Register application D. Direct ALIVE
Answer:
Dcbad Cddda
Bcdad CCBBD