An FPGA-based low-latency network processing for spark streaming

Kohei Nakamura, Ami Hayashi, Hiroki Matsutani

研究成果: Conference contribution

6 被引用数 (Scopus)

抄録

Low-latency stream data processing is a key enabler for on-line data analysis applications, such as detecting anomaly conditions and change points from stream data continuously generated from sensors and networking services. Existing stream processing frameworks are classified into micro-batch and one-at-a-time processing methodology. Apache Spark Streaming employs the micro-batch methodology, where data analysis is repeatedly performed for a series of data arrived during a short time period, called a micro batch. A rich set of data analysis libraries provided by Spark, such as machine learning and graph processing, can be applied for the micro batches. However, a drawback of the micro-batch processing methodology is a high latency for detecting anomaly conditions and change points. This is because data are accumulated in a micro batch (e.g., 1 sec length) and then data analysis is performed for the micro batch. In this paper, we propose to offload one-at-a-time methodology analysis functions on an FPGA-based 10Gbit Ethernet network interface card (FPGA NIC) in cooperation with Spark Streaming framework, in order to significantly reduce the processing latency and improve the processing throughput. We implemented word count and change-point detection applications on Spark Streaming with our FPGA NIC, where a one-at-a-time methodology analysis logic is implemented. Experiment results demonstrates that the word count throughput is improved by 22x and the change-point detection latency is reduced by 94.12% compared to the original Spark Streaming. Our approach can complement the existing micro-batch methodology data analysis framework with ultra low latency one-at-a-time methodology logic.

本文言語English
ホスト出版物のタイトルProceedings - 2016 IEEE International Conference on Big Data, Big Data 2016
編集者Ronay Ak, George Karypis, Yinglong Xia, Xiaohua Tony Hu, Philip S. Yu, James Joshi, Lyle Ungar, Ling Liu, Aki-Hiro Sato, Toyotaro Suzumura, Sudarsan Rachuri, Rama Govindaraju, Weijia Xu
出版社Institute of Electrical and Electronics Engineers Inc.
ページ2410-2415
ページ数6
ISBN(電子版)9781467390040
DOI
出版ステータスPublished - 2016
イベント4th IEEE International Conference on Big Data, Big Data 2016 - Washington, United States
継続期間: 2016 12月 52016 12月 8

出版物シリーズ

名前Proceedings - 2016 IEEE International Conference on Big Data, Big Data 2016

Other

Other4th IEEE International Conference on Big Data, Big Data 2016
国/地域United States
CityWashington
Period16/12/516/12/8

ASJC Scopus subject areas

  • コンピュータ ネットワークおよび通信
  • 情報システム
  • ハードウェアとアーキテクチャ

フィンガープリント

「An FPGA-based low-latency network processing for spark streaming」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル