Skip to content

Agents component: Dashboard playground UI #78

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Draft
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

ghandic
Copy link

@ghandic ghandic commented Apr 29, 2025

Following on from this discussion by @ianmacartney - looking to add some dashboard functionality to add more traceability to agent performance similar to langsmith etc.

Couldnt find any references for whole feature drops onto this repo so please let me know how you'd like me to go about this to assist best with code reviews as dropping whole features isnt ideal but likewise half baked isnt great either

Current plan for first PR:

  • Show screen for users who arent using the component that links to the docs on how to get set up
  • Show paginated table of threads, click on row opens a sheet with the steps within that thread, the metadata and messages.
    • show input context
    • show by order top to bottom, with step order left to right
    • show graph based on parent ID (maybe only if non-linear?)

Future work (credits - @ianmacartney )

  • Visualize tool calling graphs.
  • Inspect and replay past conversations while tuning prompts.
  • Interactively debug failed generations.
  • Search messages while tuning search parameters for your usecase.
  • Replaying failed steps and exporting evals to prevent regressions once you get it working.
  • Allow re-running with different input context
  • Allow re-running sub-steps? how to find the tool to call?
  • Interactive playground
  • Log search history results

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant