feat: Add query-specific LLM configuration #2182

0fism · 2025-10-08T15:56:39Z

Add QUERY_BINDING, QUERY_MODEL, QUERY_BINDING_HOST, QUERY_BINDING_API_KEY env vars
Create separate LLM function for query operations
Support all LLM providers (openai, azure_openai, ollama, lollms, aws_bedrock)
Enable cost optimization: powerful model for queries, economical for extraction

Description

This PR adds support for using separate LLM configurations for query/retrieval operations versus entity extraction, enabling cost optimization by using powerful models for queries while keeping extraction on economical models.

Related Issues

Addresses the need for cost optimization in LightRAG deployments where users want to use different LLM models for different operations (extraction vs. query).
closes #1382

Changes Made

Added query-specific environment variables in lightrag/api/config.py: QUERY_BINDING, QUERY_MODEL, QUERY_BINDING_HOST, QUERY_BINDING_API_KEY
Created create_query_llm_func() in lightrag/api/lightrag_server.py to initialize query-specific LLM with support for all providers (openai, azure_openai, ollama, lollms, aws_bedrock)
Enhanced condition logic to create separate query LLM when either binding OR model differs from extraction LLM

Checklist

Changes tested locally
[V] Code reviewed
Documentation updated (if necessary)
Unit tests added (if applicable)

Additional Notes

[Add any additional notes or context for the reviewer(s).]

0fism · 2025-10-14T10:36:17Z

git commit --amend --no-edit
trim trailing whitespace.................................................Passed
fix end of files.........................................................Passed
fix requirements.txt.................................(no files to check)Skipped
ruff-format..............................................................Passed
ruff.....................................................................Passed

- Add QUERY_BINDING, QUERY_MODEL, QUERY_BINDING_HOST, QUERY_BINDING_API_KEY env vars - Create separate LLM function for query operations - Support all LLM providers (openai, azure_openai, ollama, lollms, aws_bedrock) - Enable cost optimization: powerful model for queries, economical for extraction

0fism force-pushed the feature/query-specific-llm branch from 1b01848 to 88b8ba6 Compare October 10, 2025 14:19

0fism force-pushed the feature/query-specific-llm branch from 88b8ba6 to 4c81706 Compare October 16, 2025 04:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: Add query-specific LLM configuration #2182

feat: Add query-specific LLM configuration #2182

Uh oh!

0fism commented Oct 8, 2025 •

edited

Loading

Uh oh!

0fism commented Oct 14, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

feat: Add query-specific LLM configuration #2182

Are you sure you want to change the base?

feat: Add query-specific LLM configuration #2182

Uh oh!

Conversation

0fism commented Oct 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Related Issues

Changes Made

Checklist

Additional Notes

Uh oh!

0fism commented Oct 14, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

0fism commented Oct 8, 2025 •

edited

Loading