"Winning the cloud computing Big Data era"
Spark Asia Pacific Research Institute Stage 1 Public Welfare lecture hall [Stage 1 interactive Q & A sharing]
Q1: Are there many large companies using the tachyon + spark framework?
Yahoo! It has been widely used for a long time;
Some companies in China are also using it;
Q2: How can Impala and spark SQL be selected?
Impala has been officially announced as "Euthanasia" and has been gently abandoned by the official team;
Spark SQL is the core sub-framework of spark. It can be seamlessly integrated with graph computing and machine learning frameworks. It is strongly recommended!
Q3: What if a program uses streaming to write data to the tachyon cluster but the tachyon memory is insufficient?
Tachyon data has lineage;
You can configure the storage policy in tachyon.
[Interactive Q & A sharing] Stage 1 wins the public welfare lecture hall of spark Asia Pacific Research Institute in the cloud computing Big Data age