Data stream statistics over sliding windows: How to summarize 150 Million updates per second on a single node

Odysseas Papapetrou, Grigorios Chrysos, Dionysios Pnevmatikatos
2019
conference papers

Résumé

Traditional data management systems map information using centralized and static data structures. Modern applications need to process in real time datasets much larger than system memory. To achieve this, they use dynamic entities that are updated with streaming input data over a sliding window. For efficient and high performance processing, approximate sketch synopses of input streams have been proposed as effective means for the summarization of streaming data over large sliding windows with probabilistic accuracy guarantees. This work presents a system-level solution to accelerate the Exponential Count-Min (ECM) sketch algorithm on reconfigurable technology. Different reconfigurable architectures for the sketch structure that correspond to different cost and performance tradeoffs are presented. We map the proposed system-level ECM sketch architectures to a high-end modern HPC platform to achieve guaranteed and best-effort update rates up to 150 and 180 million tuples per second respectively. We compare the performance of the implemented system against the best optimized multi-thread software alternative and show that our scalable full-system accelerators outperform software solutions by 5-7.5x for Virtex6 devices and in excess of 10x for current Ultrascale devices.

Source officielle

https://infoscience.epfl.ch/entities/publication/e2a4f004-1f50-4d9e-9d9d-3720a06fec4a

À propos de ce résultat

Cette page est générée automatiquement et peut contenir des informations qui ne sont pas correctes, complètes, à jour ou pertinentes par rapport à votre recherche. Il en va de même pour toutes les autres pages de ce site. Veillez à vérifier les informations auprès des sources officielles de l'EPFL.

Data stream statistics over sliding windows: How to summarize 150 Million updates per second on a single node

Graph Chatbot

EdgeAI-Aware Design of In-Memory Computing Architectures

Highly Parallel RTL Simulation

Secure Interface Design Leveraging Hardware/Software Support

EdgeAI-Aware Design of In-Memory Computing Architectures

Highly Parallel RTL Simulation

Secure Interface Design Leveraging Hardware/Software Support