Fix tool used error message not proper escaped in MLChatAgentRunner #4410

mingshl · 2025-11-10T03:25:04Z

Description

Fixes JSON parsing errors in ML agent error handling by properly escaping special characters in exception messages using StringUtils.processTextDoc().

What is the problem?

When ML agents encounter tool execution failures with complex error messages (containing newlines, quotes, etc.), the unescaped exception text breaks JSON structure in agent responses, causing parsing failures like:

Expected BEGIN_ARRAY but was STRING at line 1 column 1 path $

How does this PR solve it?

Wraps e.getMessage() with StringUtils.processTextDoc() in MLChatAgentRunner.java line 628
Ensures exception messages are properly escaped before inclusion in JSON responses
Prevents JSON structure corruption from special characters in error messages

Testing

Added unit tests in MLChatAgentRunnerTest.java to verify:

Complex exception messages with quotes and newlines are properly escaped
Gson parsing error messages are handled correctly
Normal error messages continue to work as expected

Check List

New functionality includes testing.
New functionality has been documented.
API changes companion pull request created.
Commits are signed per the DCO using --signoff.
Public documentation issue/PR created.

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.
For more information on following Developer Certificate of Origin and signing off your commits, please check here.

Signed-off-by: Mingshi Liu <[email protected]>

akolarkunnu · 2025-11-10T14:18:13Z

...lgorithms/src/test/java/org/opensearch/ml/engine/algorithms/agent/MLChatAgentRunnerTest.java

        Assert.assertTrue(result.containsKey(AgentUtils.TOOL_RESPONSE));
    }
+
+    @Test


These tests well suit to the test class StringUtilsTest. Basically these are testing the API StringUtils.processTextDoc(p with different inputs. Nothing doing with APIs in MLChatAgentRunner.

akolarkunnu · 2025-11-10T14:23:09Z

ml-algorithms/src/main/java/org/opensearch/ml/engine/algorithms/agent/MLChatAgentRunner.java

+                                        TOOL_CALL_ID,
+                                        toolCallId,
+                                        "tool_response",
+                                        "Tool " + action + " failed: " + StringUtils.processTextDoc(e.getMessage())


Here StringUtils class name is not required. Static import of processTextDoc API is there . "import static org.opensearch.ml.common.utils.StringUtils.processTextDoc;"

ylwu-amzn · 2025-11-10T20:47:21Z

ml-algorithms/src/main/java/org/opensearch/ml/engine/algorithms/agent/MLChatAgentRunner.java

                            substitute(
                                tmpParameters.get(INTERACTION_TEMPLATE_TOOL_RESPONSE),
-                                Map.of(TOOL_CALL_ID, toolCallId, "tool_response", "Tool " + action + " failed: " + e.getMessage()),
+                                Map


Do we need to fix other agent runners?

rithin-pullela-aws · 2025-11-10T23:36:26Z

Can you rebase with current main?
I believe JP's latest commit fixes this error

[Incubating] Problems report is available at: file:///__w/ml-commons/ml-commons/build/reports/problems/problems-report.html
FAILURE: Build failed with an exception.

* What went wrong:
Could not determine the dependencies of task ':opensearch-ml-plugin:dependencyLicenses'.
> Failed to query the value of task ':opensearch-ml-plugin:dependencyLicenses' property 'dependencies'.

   > Could not resolve all dependencies for configuration ':opensearch-ml-plugin:runtimeClasspath'.
      > Could not resolve software.amazon.awssdk:kms:2.32.29.
        Required by:
            project :opensearch-ml-plugin > project :opensearch-ml-algorithms > software.amazon.awssdk:bom:2.32.29
            project :opensearch-ml-plugin > project :opensearch-ml-algorithms > org.opensearch:opensearch-remote-metadata-sdk-ddb-client:3.4.0.0-SNAPSHOT:20251106.022320-9
         > Conflict found for module 'software.amazon.awssdk:kms': between versions 2.32.29 and 2.26.3
      > Could not resolve software.amazon.awssdk:dynamodb:2.32.29.
        Required by:
            project :opensearch-ml-plugin > project :opensearch-ml-algorithms > software.amazon.awssdk:bom:2.32.29
            project :opensearch-ml-plugin > project :opensearch-ml-algorithms > software.amazon.awssdk:bom:2.32.29 > software.amazon.awssdk:dynamodb-enhanced:2.32.29
Deprecated Gradle features were used in this build, making it incompatible with Gradle 9.0.

You can use '--warning-mode all' to show the individual deprecation warnings and determine if they come from your own scripts or plugins.

For more on this, please refer to https://docs.gradle.org/8.14.3/userguide/command_line_interface.html#sec:command_line_warnings in the Gradle documentation.
         > Conflict found for module 'software.amazon.awssdk:dynamodb': between versions 2.32.29 and 2.26.3
      > Could not resolve org.dafny:DafnyRuntime:4.9.0.
        Required by:
            project :opensearch-ml-plugin > project :opensearch-ml-algorithms > org.opensearch:opensearch-remote-metadata-sdk-ddb-client:3.4.0.0-SNAPSHOT:20251106.022320-9 > software.amazon.cryptography:aws-database-encryption-sdk-dynamodb:3.9.0
            project :opensearch-ml-plugin > project :opensearch-ml-algorithms > org.opensearch:opensearch-remote-metadata-sdk-ddb-client:3.4.0.0-SNAPSHOT:20251106.022320-9 > software.amazon.cryptography:aws-cryptographic-material-providers:1.11.0
         > Conflict found for module 'org.dafny:DafnyRuntime': between versions 4.9.0 and 4.1.0
> There are 5 more failures with identical causes.

fix error message not proper escape

d1404a1

Signed-off-by: Mingshi Liu <[email protected]>

mingshl requested review from HenryL27, Zhangxunmt, austintlee, b4sjoo, dhrubo-os, jngz-es, model-collapse, pyek-bot, rbhavna, sam-herman, xinyual, ylwu-amzn and zane-neo as code owners November 10, 2025 03:25

mingshl had a problem deploying to ml-commons-cicd-env November 10, 2025 03:27 — with GitHub Actions Failure

mingshl had a problem deploying to ml-commons-cicd-env November 10, 2025 03:27 — with GitHub Actions Error

mingshl had a problem deploying to ml-commons-cicd-env November 10, 2025 03:27 — with GitHub Actions Failure

pyek-bot approved these changes Nov 10, 2025

View reviewed changes

akolarkunnu reviewed Nov 10, 2025

View reviewed changes

ylwu-amzn reviewed Nov 10, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix tool used error message not proper escaped in MLChatAgentRunner #4410

Fix tool used error message not proper escaped in MLChatAgentRunner #4410

Uh oh!

mingshl commented Nov 10, 2025

Uh oh!

akolarkunnu Nov 10, 2025

Uh oh!

akolarkunnu Nov 10, 2025

Uh oh!

ylwu-amzn Nov 10, 2025

Uh oh!

rithin-pullela-aws commented Nov 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Fix tool used error message not proper escaped in MLChatAgentRunner #4410

Are you sure you want to change the base?

Fix tool used error message not proper escaped in MLChatAgentRunner #4410

Uh oh!

Conversation

mingshl commented Nov 10, 2025

Description

What is the problem?

How does this PR solve it?

Testing

Check List

Uh oh!

akolarkunnu Nov 10, 2025

Choose a reason for hiding this comment

Uh oh!

akolarkunnu Nov 10, 2025

Choose a reason for hiding this comment

Uh oh!

ylwu-amzn Nov 10, 2025

Choose a reason for hiding this comment

Uh oh!

rithin-pullela-aws commented Nov 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants