Yet Another Cache but for the Streaming World

By Andrei Paduroiu on Posted on April 1, 2020 in Storage Streaming Storage Technology

Traditional cache solutions treat each entry as an immutable blob of data, which poses problems for the append-heavy ingestion workloads that are common in Pravega. Each Event appended to a Stream would either require its own cache entry or need an expensive read-modify-write operation to be included in the Cache. To enable high-performance ingestion of […]

By Andrei Paduroiu on Posted on November 21, 2019 in Storage Streaming Storage

The ability to pipeline Events to the Segment Store is a key technique that the Pravega Client uses to achieve high throughput, even when dealing with small writes. A Writer appends an Event to its corresponding Segment as soon as it is received, without waiting for previous ones to be acknowledged. To guarantee ordering and […]

By Andrei Paduroiu on Posted on April 22, 2019 in Storage Streaming Storage

Streaming applications typically need to process the events as soon as they arrive. For example, being able to quickly react to events in applications such as fraud detection, manufacturing error detection can result in massive savings. However, due to the limitation of storage systems not being able to handle large numbers of small writes, producers […]

By Andrei Paduroiu on Posted on March 7, 2019 in Storage Stream Processing

The Pravega Segment Store Service is a subsystem that lies at the heart of the entire Pravega deployment. It is the main access point for managing Stream Segments, providing the ability to modify and read their contents. The Pravega Client communicates with the Pravega Stream Controller to identify which Segments need to be used (for […]

Yet Another Cache but for the Streaming World

Segment Attributes

Events Big or Small – Bring Them On

Segment Store Internals