Product Update - Online Evaluation

Product Update - Prompt & Model A/B Testing

Jul 17, 2024

0 min read

Jul 17, 2024

0 min read

Product Update - Online Evaluation

This month we have three exciting updates to share with you!

The online evaluation feature has been revamped:

The flow to create an online evaluation rule have been reworked.
You can now run online evaluation on runs (agents, chains, workflows), in addition to generations.
Online evaluation automation rules now supports tag creation in addition to score creation.
Rule params are now editable.

There are now three roles: Admin, AI Engineer and Domain Expert. We plan to add more based on your feedback.

For user feedback or annotations, we now track the user who created the score.
The dashboard was overall improved. There is an additional plot where you can track the usage per agent or chain type.
There is now a default project when onboarding to Literal AI.
Improved SDK: New version of the TypeScript SDK.
New integration with Mistral AI.
Bulk actions to create datasets from generations.

We had to change the data model and perform a migration about how we store steps. This unlocks simple step management and better user experience and developer experience.
Better UX for multi-modal input and outputs.

Try it here!

Ship AI with confidence

Create an account instantly to get started or contact us to self host Literal AI for your business.

Ship AI with confidence

Create an account instantly to get started or contact us to self host Literal AI for your business.

Ship AI with confidence

Create an account instantly to get started or contact us to self host Literal AI for your business.

Ship AI with confidence

Create an account instantly to get started or contact us to self host Literal AI for your business.