International - English

Cart Console

Topic Center

Contact Sales

首頁 > 熱門類別 > Big Data

【互動問答分享】第6期決勝雲端運算大資料時代Spark亞太地區研究院公益大講堂

最後更新：2014-08-04 來源：互聯網

上載者：User

創建阿里雲帳戶，並獲得超過 40 款產品的免費試用版；而企業帳戶則可以享有總值 $1200 的免費試用版。立即註冊！

標籤：des style 使用 os strong io 資料 for

“決勝雲端運算大資料時代” Spark亞太地區研究院100期公益大講堂【第6期互動問答分享】

Q1：spark streaming 可以不同資料流 join嗎？

Spark Streaming不同的資料流可以進行join操作；

Spark Streaming is an extension of the core Spark API that allows enables high-throughput, fault-tolerant stream processing of live data streams. Data can be ingested from many sources like Kafka, Flume, Twitter, ZeroMQ or plain old TCP sockets and be processed using complex algorithms expressed with high-level functions like map, reduce, join and window

join(otherStream, [numTasks])：When called on two DStreams of (K, V) and (K, W) pairs, return a new DStream of (K, (V, W)) pairs with all pairs of elements for each key.

Q2：flume 與 spark streaming 適合叢集模式嗎？

Flume與Spark Streaming是為叢集而生的；

For input streams that receive data over the network (such as, Kafka, Flume, sockets, etc.), the default persistence level is set to replicate the data to two nodes for fault-tolerance.

Using any input source that receives data through a network - For network-based data sources like Kafka and Flume, the received input data is replicated in memory between nodes of the cluster (default replication factor is 2).

Q3：spark有缺點嘛？

Spark的核心缺點在於對記憶體的佔用比較大；

在以前的版本中Spark對資料的處理主要的是粗粒度的，難以進行精細的控制；

後來加入Fair模式後可以進行細粒度的處理；

Q4：spark streming現在有生產使用嗎？

Spark Streaming非常易於在生產環境下使用；

無需部署，只需安裝好Spark，，就按照好了Spark Streaming；

國內像皮皮網等都在使用Spark Streaming；

本文章原先以中文撰寫並發佈於 aliyun.com，亦設英文版本，僅作資訊用途。本網站不對文章的準確性，完整性或可靠性或其任何翻譯作出任何明示或暗示的陳述或保證。如對該文章有任何疑慮或投訴，請傳送電郵至 info-contact@alibabacloud.com 並提供相關疑慮或投訴的詳細說明。職員會於 5 個工作天內與您聯絡，一經驗證之後，即會刪除該侵權內容。

相關關鍵詞：

大資料<javaSE + Linux精英實訓班>_day_07 03-24

全球100款大資料工具匯總（前50款） 10-16

51CTO大資料學習006--集合 06-03

大數計算機 12-04

蔡先生論道大資料之(十五) ：什麼是資料化運營？ 07-24

MYSQL大資料匯入 12-08

聯繫我們

該頁面正文內容均來源於網絡整理，並不代表阿里雲官方的觀點，該頁面所提到的產品和服務也與阿里云無關，如果該頁面內容對您造成了困擾，歡迎寫郵件給我們，收到郵件我們將在5個工作日內處理。

如果您發現本社區中有涉嫌抄襲的內容，歡迎發送郵件至： info-contact@alibabacloud.com 進行舉報並提供相關證據，工作人員會在 5 個工作天內聯絡您，一經查實，本站將立刻刪除涉嫌侵權內容。

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

Get Started for Free

Sales Support

1 on 1 presale consultation

Chat Contact Sales
After-Sales Support

24/7 Technical Support 6 Free Tickets per Quarter Faster Response

Open a Ticket
Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.

Learn More

【互動問答分享】第6期決勝雲端運算大資料時代Spark亞太地區研究院公益大講堂

聯繫我們

熱門內容

熱門主題

A Free Trial That Lets You Build Big!

Sales Support

After-Sales Support