[README]:consistencyModel #346

vaibhav5140 · 2025-08-12T15:41:00Z

Description of change

[README Update]
Follow-Up PR of (#338) to include Consistency Model documentation of AAL

Does this contribution need a changelog entry?

I have updated the CHANGELOG or README if appropriate

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license and I agree to the terms of the Developer Certificate of Origin (DCO).

fuatbasik

Hi @vaibhav5140, I put some comments.

fuatbasik · 2025-08-18T09:48:33Z

README.md


+## Consistency Model
+
+Analytics Accelerator Library for Amazon S3 implements a time-based consistency model using TTL (Time-To-Live) for metadata caching. This ensures bounded staleness while maintaining performance. The library caches object metadata with a configurable TTL period. When metadata TTL expires, the library performs a conditional HEAD request to S3 to verify if the object has changed. If the object is unchanged, the TTL period is reset. If the object has changed, the cache is updated with new metadata.


we can drop conditional from the sentence.

fuatbasik · 2025-08-18T09:50:40Z

README.md

+
+Within-Stream Consistency: Each stream maintains strict consistency throughout its lifetime by locking to a specific object version. If cached blocks are evicted and the object has changed in S3, subsequent reads will fail with 412 Precondition Failed errors rather than serving inconsistent data.
+
+Cross-Stream Consistency: Streams created within the same TTL window see the same object version. When an S3 object is modified, existing streams continue using their original version, while new streams will see the updated version after TTL expiration. This can result in different streams temporarily accessing different versions of the same object, but the maximum staleness is bounded by the configured TTL duration.


I think you can drop the sentence This can result ..... It might confuse to reader.

fuatbasik · 2025-08-18T09:53:41Z

README.md

+
+Cross-Stream Consistency: Streams created within the same TTL window see the same object version. When an S3 object is modified, existing streams continue using their original version, while new streams will see the updated version after TTL expiration. This can result in different streams temporarily accessing different versions of the same object, but the maximum staleness is bounded by the configured TTL duration.
+
+The metadata cache may evict entries when reaching its size limit (5000 entries by default), triggering fresh metadata fetching on the next access without affecting existing streams consistency.


We might want to add a sentence to describe the reasoning behind having a size-limit in addition. Something like, To avoid an ever-growing metdata cache, when TTL is set to a very large number, we also use a threshold to limit number of metadata entries kept in memory.. Next, you should also tell what is the eviction strategy when this threshold is reached.

fuatbasik · 2025-08-18T09:55:49Z

README.md

+* `metadatastore.capacity` - Maximum size for metadata cache entries, default is 5000
+
+TTL = 0 (Strong Consistency):
+Setting `metadata.ttl.default=0` provides the strongest consistency guarantees. Every metadata access triggers a fresh HEAD request to S3. This configuration results in higher latency due to increased S3 requests but is recommended when absolute freshness is required.


i think saying provides Strong Consistency is enough. Since Strong Consistency is a technical term, i am not sure if we can use stronger here.

fuatbasik · 2025-08-18T09:56:19Z

README.md

+TTL = 0 (Strong Consistency):
+Setting `metadata.ttl.default=0` provides the strongest consistency guarantees. Every metadata access triggers a fresh HEAD request to S3. This configuration results in higher latency due to increased S3 requests but is recommended when absolute freshness is required.
+
+High TTL Values (Relaxed Consistency):


you mean eventual consistency. We are not talking about Relaxed consistency models here.

fuatbasik · 2025-08-18T09:56:39Z

README.md

+Setting `metadata.ttl.default=0` provides the strongest consistency guarantees. Every metadata access triggers a fresh HEAD request to S3. This configuration results in higher latency due to increased S3 requests but is recommended when absolute freshness is required.
+
+High TTL Values (Relaxed Consistency):
+Setting a high TTL value (e.g., `metadata.ttl.default` for 1 hour) allows for longer caching of metadata. This configuration offers better performance due to reduced S3 requests and is suitable for static data or when staleness can be tolerated.


reduced HEAD requests?

REAdME:consistencyModel

7102377

vaibhav5140 temporarily deployed to integration-tests August 12, 2025 15:41 — with GitHub Actions Inactive

fuatbasik requested changes Aug 18, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[README]:consistencyModel #346

[README]:consistencyModel #346

Uh oh!

vaibhav5140 commented Aug 12, 2025

Uh oh!

fuatbasik left a comment

Uh oh!

fuatbasik Aug 18, 2025

Uh oh!

fuatbasik Aug 18, 2025

Uh oh!

fuatbasik Aug 18, 2025

Uh oh!

fuatbasik Aug 18, 2025

Uh oh!

fuatbasik Aug 18, 2025 •

edited

Loading

Uh oh!

fuatbasik Aug 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants


		## Consistency Model

		Analytics Accelerator Library for Amazon S3 implements a time-based consistency model using TTL (Time-To-Live) for metadata caching. This ensures bounded staleness while maintaining performance. The library caches object metadata with a configurable TTL period. When metadata TTL expires, the library performs a conditional HEAD request to S3 to verify if the object has changed. If the object is unchanged, the TTL period is reset. If the object has changed, the cache is updated with new metadata.


		Within-Stream Consistency: Each stream maintains strict consistency throughout its lifetime by locking to a specific object version. If cached blocks are evicted and the object has changed in S3, subsequent reads will fail with 412 Precondition Failed errors rather than serving inconsistent data.

		Cross-Stream Consistency: Streams created within the same TTL window see the same object version. When an S3 object is modified, existing streams continue using their original version, while new streams will see the updated version after TTL expiration. This can result in different streams temporarily accessing different versions of the same object, but the maximum staleness is bounded by the configured TTL duration.


		Cross-Stream Consistency: Streams created within the same TTL window see the same object version. When an S3 object is modified, existing streams continue using their original version, while new streams will see the updated version after TTL expiration. This can result in different streams temporarily accessing different versions of the same object, but the maximum staleness is bounded by the configured TTL duration.

		The metadata cache may evict entries when reaching its size limit (5000 entries by default), triggering fresh metadata fetching on the next access without affecting existing streams consistency.

[README]:consistencyModel #346

Are you sure you want to change the base?

[README]:consistencyModel #346

Uh oh!

Conversation

vaibhav5140 commented Aug 12, 2025

Description of change

Does this contribution need a changelog entry?

Uh oh!

fuatbasik left a comment

Choose a reason for hiding this comment

Uh oh!

fuatbasik Aug 18, 2025

Choose a reason for hiding this comment

Uh oh!

fuatbasik Aug 18, 2025

Choose a reason for hiding this comment

Uh oh!

fuatbasik Aug 18, 2025

Choose a reason for hiding this comment

Uh oh!

fuatbasik Aug 18, 2025

Choose a reason for hiding this comment

Uh oh!

fuatbasik Aug 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

fuatbasik Aug 18, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

fuatbasik Aug 18, 2025 •

edited

Loading