PiCo: a Novel Approach to Stream Data Analytics

Contributo in Atti di convegno

Data di Pubblicazione:

2018

Abstract:

In this paper, we present a new C++ API with a fluent interface called PiCo (Pipeline Composition). PiCo's programming model aims at making easier the programming of data analytics applications while preserving or enhancing their performance. This is attained through three key design choices: 1) unifying batch and stream data access models, 2) decoupling processing from data layout, and 3) exploiting a stream-oriented, scalable, efficient C++11 runtime system. PiCo proposes a programming model based on pipelines and operators that are polymorphic with respect to data types in the sense that it is possible to re-use the same algorithms and pipelines on different data models (e.g., streams, lists, sets, etc.). Preliminary results show that PiCo can attain better performances in terms of execution times and hugely improve memory utilization when compared to Spark and Flink in both batch and stream processing.

Tipologia CRIS:

04A-Conference paper in volume

Elenco autori:

Claudia, Misale; Maurizio, Drocco; Guy, Tremblay; Marco, Aldinucci

Autori di Ateneo:

ALDINUCCI Marco

Link alla scheda completa:

https://iris.unito.it/handle/2318/1659344

Link al Full Text:

https://iris.unito.it/retrieve/handle/2318/1659344/409520/autodasp.pdf

Titolo del libro:

Proc. of Euro-Par Workshops: 1st Intl. Workshop on Autonomic Solutions for Parallel and Distributed Data Stream Processing (Auto-DaSP)

Pubblicato in:

LECTURE NOTES IN COMPUTER SCIENCE

Journal

LECTURE NOTES IN COMPUTER SCIENCE

Series