Skip to content
Draft
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
83 changes: 0 additions & 83 deletions .claude/agents/changelog-editor.md

This file was deleted.

11 changes: 5 additions & 6 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -2,12 +2,11 @@
<p align="center">
<a href="https://agenta.ai?utm_source=github&utm_medium=referral&utm_campaign=readme">
<picture >
<source width="275" media="(prefers-color-scheme: dark)" srcset="https://github.com/user-attachments/assets/97e31bfc-b1fc-4d19-b443-5aedf6029017" >
<source width="275" media="(prefers-color-scheme: light)" srcset="https://github.com/user-attachments/assets/fdc5f23f-2095-4cfc-9511-14c6851c1262" >
<source width="275" media="(prefers-color-scheme: dark)" srcset="https://github.com/Agenta-AI/agenta/assets/4510758/cdddf5ad-2352-4920-b1d9-ae7f8d9d7735" >
<source width="275" media="(prefers-color-scheme: light)" srcset="https://github.com/Agenta-AI/agenta/assets/4510758/ab75cbac-b807-496f-aab3-57463a33f726" >
<img alt="Shows the logo of agenta" src="https://github.com/Agenta-AI/agenta/assets/4510758/68e055d4-d7b8-4943-992f-761558c64253" >
</picture>
</a>

<div align="center">
<strong> <h1> The Open-source LLMOps Platform </h1></strong>
Build reliable LLM applications faster with integrated prompt management, evaluation, and observability.
Expand Down Expand Up @@ -84,15 +83,15 @@ Agenta is a platform for building production-grade LLM applications. It helps **
Collaborate with Subject Matter Experts (SMEs) on prompt engineering and make sure nothing breaks in production.

- **Interactive Playground**: Compare prompts side by side against your test cases
- **Multi-Model Support**: Experiment with 50+ LLM models or [bring-your-own models](https://docs.agenta.ai/prompt-engineering/playground/custom-providers?utm_source=github&utm_medium=referral&utm_campaign=readme)
- **Multi-Model Support**: Experiment with 50+ LLM models or [bring-your-own models](https://docs.agenta.ai/prompt-engineering/playground/adding-custom-providers?utm_source=github&utm_medium=referral&utm_campaign=readme)
- **Version Control**: Version prompts and configurations with branching and environments
- **Complex Configurations**: Enable SMEs to collaborate on [complex configuration schemas](https://docs.agenta.ai/custom-workflows/overview?utm_source=github&utm_medium=referral&utm_campaign=readme) beyond simple prompts

[Explore prompt management →](https://docs.agenta.ai/prompt-engineering/concepts?utm_source=github&utm_medium=referral&utm_campaign=readme)
[Explore prompt management →](https://docs.agenta.ai/prompt-engineering/overview?utm_source=github&utm_medium=referral&utm_campaign=readme)

### 📊 Evaluation & Testing
Evaluate your LLM applications systematically with both human and automated feedback.
- **Flexible Testsets**: Create testcases from production data, playground experiments, or upload CSVs
- **Flexible Test Sets**: Create test cases from production data, playground experiments, or upload CSVs
- **Pre-built and Custom Evaluators**: Use LLM-as-judge, one of our 20+ pre-built evaluators, or you custom evaluators
- **UI and API Access**: Run evaluations via UI (for SMEs) or programmatically (for engineers)
- **Human Feedback Integration**: Collect and incorporate expert annotations
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -21,7 +21,7 @@
EvaluatorConfigDB,
HumanEvaluationDB,
HumanEvaluationScenarioDB,
TestsetDB,
TestSetDB,
)


Expand All @@ -38,7 +38,7 @@
EvaluatorConfigDB, # have workspace_id
HumanEvaluationDB, # have workspace_id
HumanEvaluationScenarioDB, # have workspace_id
TestsetDB, # have workspace_id
TestSetDB, # have workspace_id
]


Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -21,7 +21,7 @@
)
from oss.src.dbs.postgres.git.dao import GitDAO
from oss.src.core.testcases.service import TestcasesService
from oss.src.models.deprecated_models import DeprecatedTestsetDB
from oss.src.models.deprecated_models import DeprecatedTestSetDB
from oss.src.core.testsets.service import TestsetsService, SimpleTestsetsService


Expand Down Expand Up @@ -81,8 +81,8 @@ async def migration_old_testsets_to_new_testsets(
# Count total rows with a non-null project_id
total_query = (
select(func.count())
.select_from(DeprecatedTestsetDB)
.filter(DeprecatedTestsetDB.project_id.isnot(None))
.select_from(DeprecatedTestSetDB)
.filter(DeprecatedTestSetDB.project_id.isnot(None))
)
result = await connection.execute(total_query)
total_rows = result.scalar()
Expand All @@ -98,8 +98,8 @@ async def migration_old_testsets_to_new_testsets(
while offset < total_testsets:
# STEP 1: Fetch evaluator configurations with non-null project_id
result = await connection.execute(
select(DeprecatedTestsetDB)
.filter(DeprecatedTestsetDB.project_id.isnot(None))
select(DeprecatedTestSetDB)
.filter(DeprecatedTestSetDB.project_id.isnot(None))
.offset(offset)
.limit(DEFAULT_BATCH_SIZE)
)
Expand Down

This file was deleted.

This file was deleted.

9 changes: 4 additions & 5 deletions api/ee/docker/Dockerfile.dev
Original file line number Diff line number Diff line change
Expand Up @@ -18,13 +18,12 @@ RUN pip install --upgrade pip \
COPY ./ee /app/ee/
COPY ./oss /app/oss/
COPY ./entrypoint.py ./pyproject.toml /app/
#

RUN poetry config virtualenvs.create false \
&& poetry install --no-interaction --no-ansi
#
# && pip install -e /sdk/

ENV PYTHONPATH=/sdk:$PYTHONPATH
# ENV PYTHONPATH=/sdk:$PYTHONPATH

COPY ./ee/src/crons/meters.sh /meters.sh
COPY ./ee/src/crons/meters.txt /etc/cron.d/meters-cron
Expand All @@ -34,8 +33,8 @@ RUN cat -A /etc/cron.d/meters-cron
RUN chmod +x /meters.sh \
&& chmod 0644 /etc/cron.d/meters-cron

COPY ./oss/src/crons/queries.sh /queries.sh
COPY ./oss/src/crons/queries.txt /etc/cron.d/queries-cron
COPY ./ee/src/crons/queries.sh /queries.sh
COPY ./ee/src/crons/queries.txt /etc/cron.d/queries-cron
RUN sed -i -e '$a\' /etc/cron.d/queries-cron
RUN cat -A /etc/cron.d/queries-cron

Expand Down
9 changes: 4 additions & 5 deletions api/ee/docker/Dockerfile.gh
Original file line number Diff line number Diff line change
Expand Up @@ -18,11 +18,10 @@ RUN pip install --upgrade pip \
COPY ./ee /app/ee/
COPY ./oss /app/oss/
COPY ./entrypoint.py ./pyproject.toml /app/
COPY ./sdk /sdk/

RUN poetry config virtualenvs.create false \
&& poetry install --no-interaction --no-ansi \
&& pip install --force-reinstall --upgrade /sdk/
&& poetry install --no-interaction --no-ansi
#

#

Expand All @@ -34,8 +33,8 @@ RUN cat -A /etc/cron.d/meters-cron
RUN chmod +x /meters.sh \
&& chmod 0644 /etc/cron.d/meters-cron

COPY ./oss/src/crons/queries.sh /queries.sh
COPY ./oss/src/crons/queries.txt /etc/cron.d/queries-cron
COPY ./ee/src/crons/queries.sh /queries.sh
COPY ./ee/src/crons/queries.txt /etc/cron.d/queries-cron
RUN sed -i -e '$a\' /etc/cron.d/queries-cron
RUN cat -A /etc/cron.d/queries-cron

Expand Down
Loading
Loading