Pravega Flink connector 102

By Yumin Zhou on Posted on November 2, 2023 in Uncategorized

If you missed my previous post, Pravega Flink connector 101, we strongly recommend you take the time to read that one first. It introduced how Flink DataStream API works with reading from and writing to Pravega streams, which lays the necessary foundation for the topics we’ll cover in this post. To briefly recap the last […]

By Andrei Paduroiu on Posted on April 1, 2020 in Storage Streaming Storage Technology

Traditional cache solutions treat each entry as an immutable blob of data, which poses problems for the append-heavy ingestion workloads that are common in Pravega. Each Event appended to a Stream would either require its own cache entry or need an expensive read-modify-write operation to be included in the Cache. To enable high-performance ingestion of […]

By Andrei Paduroiu on Posted on November 21, 2019 in Storage Streaming Storage

The ability to pipeline Events to the Segment Store is a key technique that the Pravega Client uses to achieve high throughput, even when dealing with small writes. A Writer appends an Event to its corresponding Segment as soon as it is received, without waiting for previous ones to be acknowledged. To guarantee ordering and […]

By Tom Kaitchuck on Posted on November 8, 2019 in News/Updates Releases Stream Processing Watermarking

Pravega Watermarking Support Tom Kaitchuck and Flavio Junqueira Motivation Stream processing broadly refers to the ability to ingest data from unbounded sources and processing such data as it is ingested. The data can be user-generated, like in social networks or other online application, or machine-generated, like in server telemetry or sensor samples from IoT and […]

By Andrei Paduroiu on Posted on March 7, 2019 in Storage Stream Processing

The Pravega Segment Store Service is a subsystem that lies at the heart of the entire Pravega deployment. It is the main access point for managing Stream Segments, providing the ability to modify and read their contents. The Pravega Client communicates with the Pravega Stream Controller to identify which Segments need to be used (for […]

By Tom Kaitchuck on Posted on February 15, 2019 in Best Practices

Pravega allows the state to be shared in a consistent fashion across multiple cooperating processes distributed in a cluster using a State Synchronizer. This blog details how to use State Synchronizer [1] to build and maintain consistency in a distributed application. State Synchronizer In distributed systems, frequently state needs to be shared across multiple instances […]

Pravega Flink connector 102

Yet Another Cache but for the Streaming World

Segment Attributes

Pravega Watermarking Support

Segment Store Internals

Exploring State Synchronizer