Skip to content

[Evals] Support full datasets in the Dev UI #2297

Open
@ssbushi

Description

@ssbushi

Overview

Support CRUD operations and management of full datasets (input, output, context, etc.) in the Dev UI. This will add 1st class support for all evaluation use cases (eg: prod traces) in the Dev UI.

This is a blocker for agent evals and supporting interrupts in evals.

User goal(s)

Create and manage full datasets in the Dev UI

Requirements

Acceptance Criteria

  • 1 Create full datasets
  • 2 Edit, Update, Delete examples, delete dataset
  • 3 Run evaluation (without inference) from the dataset

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    Status

    No status

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions