Skip to content

[content-hash 1/5] refactor: record passage_id_scheme in meta.json#330

Open
raoabinav wants to merge 1 commit into
yichuan-w:mainfrom
raoabinav:refactor/passage-id-scheme-field
Open

[content-hash 1/5] refactor: record passage_id_scheme in meta.json#330
raoabinav wants to merge 1 commit into
yichuan-w:mainfrom
raoabinav:refactor/passage-id-scheme-field

Conversation

@raoabinav
Copy link
Copy Markdown
Contributor

@raoabinav raoabinav commented May 20, 2026

Sub-PR 1 of 5 from #329.

Purely additive. Writes a new passage_id_scheme: "sequential" field into the .meta.json produced by both build_index and build_index_from_arrays. Existing index loaders ignore the field, so this changes nothing for any caller.

Also bumps meta_data["version"] from "1.0" to "1.1". No code currently reads version, so the bump is safe; it's documentation of the schema evolution for future migration logic.

Two module-level constants (PASSAGE_ID_SCHEME_SEQUENTIAL, PASSAGE_ID_SCHEME_CONTENT_HASH) document the value space. The content-hash scheme itself lands in sub-PR 2.

Content-hash passage IDs train (#329)

Sub-PR 1 of 5 from the plan in yichuan-w#329. Purely additive — no behavior change
for any caller, existing index loaders ignore the field.

Writes a new `passage_id_scheme: "sequential"` field into the .meta.json
produced by both build_index and build_index_from_arrays. Bumps version
to "1.1" for human-inspectable schema tracking (no code reads version today,
so the bump is safe).

Module-level constants PASSAGE_ID_SCHEME_SEQUENTIAL / _CONTENT_HASH document
the value space; the content-hash scheme itself ships in sub-PR 2.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant