contents
The Evidently API is evolving — and it’s getting better! Be among the first to explore the new features and improvements.
We are updating the Evidently API to make it simpler, more flexible, and easier to use. As we focus more on LLM-related evaluations, we’re also working to futureproof the library for these workflows.
These changes affect both the open-source library and the way how you programmatically interact with Evidently Cloud. We invite everyone to try out the new API and share feedback to help us refine the experience!
Starting today, you can access the updated API by importing it from evidently.future
.
Explore the updated documentation and examples to dive in.
New to Evidently? Welcome! Start with the new Quickstart Guides tailored for LLM or ML use cases.
Already a user? Be sure to read the “What Changed” section below to understand what’s changing and ensure a smooth transition.
These changes are paving the way for the Evidently 1.0 release, where the new API will become the default. The current API will remain available for a while, giving you time to adapt.
Here’s the breakdown of what’s new and improved.
The Metric object in Evidently used to be pretty complex. As the library evolved, Metrics grew organically and acted like widgets, bundling multiple calculations and visualizations together. Parsing Metric results, even for our own dashboards, was tricky because each one had its own unique JSON structure.
Now, the Metrics are much simpler. Each one has a fixed structure, outputs a single computation result, and lets you specify the visualization type directly as a parameter.
No need to worry about Presets: they are here to stay. You can still use pre-built templates that combine various Metrics for common evaluation scenarios. For instance, Data Drift Preset is still available under the same name.
What’s better now?
Other quality-of-life improvements are in the works, such as an option to rename “reference” and “current” datasets.
Reports and Test Suites are now merged into a single object.
If you’ve ever wondered whether to compute a Report or a Test Suite — wonder no more. You can now handle both at once. Each still has its distinct role:
What used to be a Test Suite is now an optional extension of Reports. If you choose it, both types of outputs will appear in different tabs of the same HTML file. This way, you don’t have to save two different files and switch between them.
You can still use the auto-generated Test conditions or define them yourself. It’s also now possible to set up relative conditions — like checking if values fall within +/-10% (or 20, or 30…) from the reference.
If you’ve been using the Evidently Dashboard to organize Reports and track evaluation results over time, the latest updates improve this experience.
Simpler API for Dashboard Panel setup. The dashboards-as-code feature used to be a bit tricky when it came to configuring monitoring Panels. Since now each Metric’s output is standardized, you no longer have to deal with complex paths to point to specific results for plotting.
Custom Metrics support. The Evidently UI fully supports custom Metrics. Even if you implemented a custom render for your Metric using Plotly, you’ll now be able to see the stored Report when you save it to your workspace — whether self-hosted or cloud. That’s a big deal!
Text descriptors are row-level text evaluations — anything from basic checks like text length to LLM-driven evals, such as determining whether an output contradicts its source.
With the increasing focus on LLM-related metrics, we’ve updated the text descriptors API to make it more logical and easier to use.
Descriptor computation is now split into two steps:
This change is especially useful when you want to perform multiple Tests or aggregations (no need to recompute the results!) or simply get the dataset with scores without the Report.
We know these updates introduce some breaking changes — we don’t make them lightly! A lot of this comes directly from the feedback we’ve gotten from the users.
We implement these updates to make the library more maintainable and pave the way for new features. Our goal is less confusion, less repetition, and more functionality — all in one place.
Check out the documentation preview and examples today, and let us know what you think! If you get stuck or have questions, drop by our Discord — we’re here to help! We’d love to hear your feedback! 🚀