Yet Another Cache but for the Streaming World

By Andrei Paduroiu on Posted on April 1, 2020 in Storage Streaming Storage Technology

Traditional cache solutions treat each entry as an immutable blob of data, which poses problems for the append-heavy ingestion workloads that are common in Pravega. Each Event appended to a Stream would either require its own cache entry or need an expensive read-modify-write operation to be included in the Cache. To enable high-performance ingestion of […]

By Andrei Paduroiu on Posted on November 21, 2019 in Storage Streaming Storage

The ability to pipeline Events to the Segment Store is a key technique that the Pravega Client uses to achieve high throughput, even when dealing with small writes. A Writer appends an Event to its corresponding Segment as soon as it is received, without waiting for previous ones to be acknowledged. To guarantee ordering and […]

By Andrei Paduroiu on Posted on April 22, 2019 in Storage Streaming Storage

Streaming applications typically need to process the events as soon as they arrive. For example, being able to quickly react to events in applications such as fraud detection, manufacturing error detection can result in massive savings. However, due to the limitation of storage systems not being able to handle large numbers of small writes, producers […]

By Andrei Paduroiu on Posted on March 7, 2019 in Storage Stream Processing

The Pravega Segment Store Service is a subsystem that lies at the heart of the entire Pravega deployment. It is the main access point for managing Stream Segments, providing the ability to modify and read their contents. The Pravega Client communicates with the Pravega Stream Controller to identify which Segments need to be used (for […]

By Flavio Junqueira on Posted on October 17, 2018 in Storage Stream Processing

Several of the difficulties with tailing a data stream boil down to the dynamics of the source and of the stream processor. For example, if the source increases its production rate in an unplanned manner, then the ingestion system must be able to accommodate such a change. The same happens in the case a processor […]

By Flavio Junqueira on Posted on February 12, 2018 in Storage Streaming Storage

Introduction Reading and writing is the most basic functionality that Pravega offers. Applications ingest data by writing to one or more Pravega streams and consume data by reading data from one or more streams. To implement applications correctly with Pravega, however, it is crucial that the developer is aware of some additional functionality that complements […]

Storage Reimagined for a Streaming World

By Salvatore DeSimone on Posted on April 9, 2017 in Storage Streaming Storage

Driven by the desire to shrink to zero the time it takes to turn massive volumes of raw data into useful information and action, streaming is deceptively simple: just process and act on data as it arrives, quickly, and in a continuous and infinite fashion. For use cases from Industrial IoT to Connected Cars to […]

Yet Another Cache but for the Streaming World

Segment Attributes

Events Big or Small – Bring Them On

Segment Store Internals

Pravega Internals

Streams in and out of Pravega

Storage Reimagined for a Streaming World