Spark test Questions

Source: Internet
Author: User

1. Which of the four components of Spark is not ()
A.spark streaming B Mlib C Graphx D Spark R


2. Which port below is not the port on which the spark comes in service ()
a.8080 b.4040 c.8090 d.18080


Maximum changes in version 3.spark (1.4)
A Spark SQL Release version B introduces Spark R C DataFrame D to support dynamic resource allocation


4. Spark Job default scheduling mode ()
A FIFO B FAIR C no D run-time designation


5. Which is not a condition for local mode operation ()
A spark.localexecution.enabled=true B explicitly specifies local run C finalstage no parent Stage D partition default value


6. Which of the following is not an RDD feature ()
A. Can be partitioned B serializable C modifiable D can be persisted


7. About the broadcast variable, which one of the following is wrong ()
A any function call B is read-only C stored on each node D stored on disk or HDFS


8. About the accumulator, which one of the following is wrong ()
A supports addition B supports numeric type C can be parallel D does not support custom types


Which of the 9.Spark supported distributed deployment methods is wrong ()
A standalone B Spark on Mesos C Spark on YARN D Spark on Local


The number of Task 10.Stage is determined by what ()

A Partition B Job C Stage D TaskScheduler


11. Which of the following operations is narrow-dependent ()
A Join B Filter C Group D sort


12. Which of the following operations must be wide-dependent ()
A map B flatMap C reducebykey D sample


How the master and worker of 13.spark communicate in any way. ( )
A http B nio C netty D Akka


14 default storage level ()
A memory_only B Memory_only_ser
C Memory_and_disk D Memory_and_disk_ser


The Spark.deploy.recoveryMode does not support that kind of ()
A.zookeeper B. FileSystem d NONE D Hadoop


16. Which of the following is not the RDD cache method ()
A persist () B Cache () C Memory ()


17.Task run down where the options in the Executor work unit ()
A Driver program b. Spark master c.worker node D Cluster Manager


What is the difference between 18.hive metadata storage in Derby and MySQL ()
A. No difference b. Multi-session C. Support for network environment D database differences


The biggest difference between 19.DataFrame and RDD ()
A. Scientific statistics support B. More schema C. Different storage methods D. External data source support


What to do after the 20.Master Electedleader event ()
A. Notice driver B. Notify Worker C. Register application D. Direct ALIVE


Answer:

Dcbad Cddda

Bcdad CCBBD












Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.