Skip to content
@Agenta-AI

agenta

The open-source LLMOps platform
image (37)

Agenta is an open-source LLMOps platform that helps developers and product teams build reliable LLM applications.

Agenta covers the entire LLM development lifecycle: prompt management, evaluation, and observability.

Features

Prompt Engineering and Management

Teams often struggle with prompt collaboration. They keep prompts in code where subject matter experts cannot edit them. Or they use spreadsheets in an unreliable process.

Agenta organizes prompts for your team. Subject matter experts can collaborate with developers without touching the codebase. Developers can version prompts and deploy them to production.

The playground lets teams experiment with prompts. You can load traces and test sets. You can test prompts side by side.

Evaluation

Most teams lack a systematic evaluation process. They make random prompt changes based on vibes. Some changes improve quality but break other cases because LLMs are stochastic.

Agenta provides one place to evaluate systematically. Teams can run three types of evaluation:

  • Automatic evaluation with LLMs at scale before production
  • Human annotation where subject matter experts review results and provide feedback to AI engineers
  • Online evaluation for applications already in production

Both subject matter experts and engineers can run evaluations from the UI.

Observability

Agenta helps you understand what happens in production. You can capture user feedback through an API (thumbs up or implicit signals). You can debug agents and applications with tracing to see what happens inside them.

Track costs over time. Find edge cases where things fail. Add those cases to your test sets. Have subject matter experts annotate the results.

Popular repositories Loading

  1. agenta agenta Public

    The open-source LLMOps platform: prompt playground, prompt management, LLM evaluation, and LLM observability all in one place.

    Python 3.4k 396

  2. job_extractor_template job_extractor_template Public

    Template for an AI application that extracts the job information from a job description using openAI functions and langchain

    Python 10 2

  3. .github .github Public

    Agenta.ai public readme

    5

  4. qa_llama_index_playground qa_llama_index_playground Public

    A question answering application built using llama index with a playground and evaluation in agenta

    Python 3 1

  5. simple_prompt simple_prompt Public

    First example for an LLM application for deployment in the Agenta LLMOps Platform

    Python

  6. agenta-blog agenta-blog Public

    Blog posts from Agenta

Repositories

Showing 6 of 6 repositories
  • agenta Public

    The open-source LLMOps platform: prompt playground, prompt management, LLM evaluation, and LLM observability all in one place.

    Agenta-AI/agenta’s past year of commit activity
    Python 3,393 396 50 (5 issues need help) 22 Updated Nov 20, 2025
  • .github Public

    Agenta.ai public readme

    Agenta-AI/.github’s past year of commit activity
    5 MIT 0 0 0 Updated Nov 10, 2025
  • agenta-blog Public

    Blog posts from Agenta

    Agenta-AI/agenta-blog’s past year of commit activity
    0 0 0 0 Updated Nov 2, 2025
  • simple_prompt Public

    First example for an LLM application for deployment in the Agenta LLMOps Platform

    Agenta-AI/simple_prompt’s past year of commit activity
    Python 0 MIT 0 0 0 Updated Mar 2, 2025
  • qa_llama_index_playground Public

    A question answering application built using llama index with a playground and evaluation in agenta

    Agenta-AI/qa_llama_index_playground’s past year of commit activity
    Python 3 1 0 0 Updated Apr 28, 2024
  • job_extractor_template Public

    Template for an AI application that extracts the job information from a job description using openAI functions and langchain

    Agenta-AI/job_extractor_template’s past year of commit activity
    Python 10 2 0 0 Updated Dec 21, 2023

Top languages

Loading…

Most used topics

Loading…