Add LLM usage and retry tracking for indexing stage #2104

june616 · 2025-10-16T17:44:03Z

Description

Add comprehensive LLM usage and retry tracking for the indexing stage, providing complete performance observability.

Related Issues

[Feature Request]: Enhance LLM usage logging in indexing workflows #2103

Proposed Changes

Data Structure Extensions

Added to PipelineRunStats:
- total_llm_retries: Total retry attempts across all workflows
- llm_usage_by_workflow[workflow]["retries"]: Per-workflow retry count

Context Injection Mechanism

Added inject_llm_context() helper function
Centralized context injection in run_pipeline.py
Propagated through ModelManager to all LLM models

Retry Tracking

Added _record_retries() common method to Retry base class
All retry strategies (Exponential, Native, Random, Incremental) record uniformly
Used finally blocks to ensure both successful and failed retries are tracked

Enhanced Logging

Output LLM usage (including retries) after each workflow
Output total statistics after pipeline completion
Added exception logging for context injection failures

Sample output in stats.json:

{
  "total_llm_calls": 20,
  "total_prompt_tokens": 104652,
  "total_completion_tokens": 9691,
  "total_llm_retries": 8,
  "llm_usage_by_workflow": {
    "extract_graph": {
      "llm_calls": 5,
      "prompt_tokens": 66766,
      "completion_tokens": 5757,
      "retries": 6
    }
  }
}

Checklist

I've validated the functionality with end-to-end indexing runs.

I have tested these changes locally.
I have reviewed the code changes.
I have updated the documentation (if necessary).
[N/A] I have added appropriate unit tests (if applicable).

Note: Both Linux and Windows smoke tests are failing with the same root cause: "ValidationError: API Key is required for chat when using api_key authentication". My changes do not affect configuration validation or authentication logic.

Additional Notes

[Add any additional notes or context that may be helpful for the reviewer(s).]

june616 · 2025-10-16T19:01:33Z

@microsoft-github-policy-service agree company="Microsoft"

Add LLM usage and retry tracking for indexing stage

e4fe69e

june616 requested a review from a team as a code owner October 16, 2025 17:44

June Feng added 3 commits October 16, 2025 10:47

chore: Add semversioner changeset for LLM tracking feature

3021b3d

code cleanup and fixing lint errors

5f143a4

fix hasattr access with linting requirements

e720edd

june616 mentioned this pull request Oct 16, 2025

[Feature Request]: Enhance LLM usage logging in indexing workflows #2103

Open

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add LLM usage and retry tracking for indexing stage #2104

Add LLM usage and retry tracking for indexing stage #2104

Uh oh!

june616 commented Oct 16, 2025 •

edited

Loading

Uh oh!

june616 commented Oct 16, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Add LLM usage and retry tracking for indexing stage #2104

Are you sure you want to change the base?

Add LLM usage and retry tracking for indexing stage #2104

Uh oh!

Conversation

june616 commented Oct 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Related Issues

Proposed Changes

Checklist

Additional Notes

Uh oh!

june616 commented Oct 16, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

june616 commented Oct 16, 2025 •

edited

Loading