Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Appearance settings

Conversation

iitslamaa
Copy link
Contributor

Goal

Add baseline diff comparison inside the existing eval viewer so users can compare two eval runs side-by-side without leaving the page.

What works in this draft

  • UI control to pick a baseline run in the viewer
  • Side-by-side panel scaffolding/wiring
  • Basic state routing for {runId, baselineId}

Still in progress

  • Data plumbing for diff model
  • Edge cases (missing rows/provider mismatches)
  • Unit tests + light e2e
  • Types/comments/cleanup

Notes

Draft PR, feedback welcome

@iitslamaa
Copy link
Contributor Author

@mldangelo — just sharing early as a draft.
Core viewer wiring is in; still finishing debugging and testing.
Would love feedback on selector placement or component boundaries when you have a moment. thanks!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant

Morty Proxy This is a proxified and sanitized view of the page, visit original site.