fix(plpgsql): handle multibyte diagnostic offsets by psteinroe · Pull Request #737 · supabase-community/postgres-language-server

psteinroe · 2026-05-18T06:16:22Z

Prevent PL/pgSQL diagnostics from panicking when source text contains UTF-8 multibyte characters before a reported query span.

The diagnostic mapper now keeps internal source ranges as UTF-8 byte offsets while explicitly converting plpgsql_check's one-based character positions at the boundary. Query lookup also iterates on character boundaries instead of byte-by-byte slices, avoiding invalid string indexing inside characters like umlauts.

Regression Coverage

Added focused tests for umlauts in comments before a query and in string literals before a reported query position.

Fixes #735

fix(plpgsql): handle multibyte diagnostic offsets

62595d6

psteinroe merged commit 0fa637d into main May 18, 2026
9 checks passed

psteinroe deleted the fix/umalute branch May 18, 2026 06:56

BrewTestBot mentioned this pull request May 18, 2026

postgres-language-server 0.25.0 Homebrew/homebrew-core#283445

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(plpgsql): handle multibyte diagnostic offsets#737

fix(plpgsql): handle multibyte diagnostic offsets#737
psteinroe merged 1 commit into
mainfrom
fix/umalute

psteinroe commented May 18, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

psteinroe commented May 18, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant