Future AGI - open-source LLM evals & observability
End-to-end platform for evaluating, observing, and improving LLM and agent apps. Tracing, evals, simulations, datasets, and prompt management in one project.
Tag
1 entry tagged with #llm-eval.
End-to-end platform for evaluating, observing, and improving LLM and agent apps. Tracing, evals, simulations, datasets, and prompt management in one project.