feat: Support LangChain orchestration client streaming #711

KavithaSiva · 2025-05-07T21:49:08Z

Context

Closes SAP/ai-sdk-js-backlog#259.

What this PR does and why it is needed

This PR introduces streaming support for Langchain Orchestration client.

packages/langchain/src/orchestration/util.ts

…:SAP/ai-sdk-js into langchain-orchestration-client-streaming

KavithaSiva · 2025-05-12T07:58:07Z

~~Adding more tests in a follow-up PR.~~

ZhongpinWang

Suggestions for improvement. Also please do implement tests already in this PR to avoid changes later in case they don't work as expected. 😄

.changeset/chilly-steaks-divide.md

packages/langchain/src/orchestration/client.ts

packages/langchain/src/orchestration/orchestration-message-chunk.ts

packages/langchain/src/orchestration/util.ts

sample-code/src/langchain-orchestration.ts

Co-authored-by: Zhongpin Wang <[email protected]>

sample-code/src/index.ts

packages/langchain/src/orchestration/util.ts

ZhongpinWang

I see refactoring needed for the big *_streamResponseChunks function. It would make the code more readable and testable.

packages/langchain/src/orchestration/client.ts

sample-code/src/langchain-orchestration.ts

ZhongpinWang · 2025-05-19T12:59:36Z

I did another round of the review. Other than some naming issues, the biggest (potential) issue is that the multi choice logic is not implemented. This could be a problem in the future and since it is already properly handled in the vanilla orchestration client, consider implement this correctly in LangChain as well.

I want to hold off on implementing for multi-choice now as the orchestration service currently does not support multiple choices for streaming, it throws a 400. Moreover, in LangChain, each streamed AIMessageChunk is supposed to hold contents of only one choice, so there would be multiple chunks for one streamed chunk of the orchestration service. From what I understood, the consumer is supposed to group chunks together based on newTokenIndices.completion value.

I would prefer to do this in a separate BLI, when orchestration service starts supporting this and not in this PR.

It is normal to have multiple AIMessageChunk generated from one LLM chunk in the response. The AIMessageChunk is supposed to contain only one choice with index information.

I don't have strong opinion on whether we support n>2 already now as you are right, we get 400 from orchestration for such case. If we implement this already, the effort is not much by adding a foreach and not using the default 0 everywhere. Leave it to you to decide.

Co-authored-by: Zhongpin Wang <[email protected]>

KavithaSiva · 2025-05-20T09:44:30Z

I don't have strong opinion on whether we support n>2 already now as you are right, we get 400 from orchestration for such case. If we implement this already, the effort is not much by adding a foreach and not using the default 0 everywhere. Leave it to you to decide.

I talked with Christoph and he said multiple choices for orchestration streaming is not planned in the near future(Q2 or Q3)
You are right, the code change is easy, but testing it would imply adding imaginary responses for now.
I would feel more comfortable adding this in when the service supports it too.

ZhongpinWang

Reviewed some tests. Please pay attention to the code quality. If logic can be simplified in an equivalent form always do that. The test scope should also be narrowed to what is needed. Using, e.g., resilience for testing all streaming logic is unnecessary.

ZhongpinWang · 2025-05-20T09:58:04Z