Add JSONL dataset format support by christso · Pull Request #131 · EntityProcess/agentv

christso · Jan 8, 2026

Summary

Add support for JSONL (JSON Lines) format as an alternative to YAML for evaluation datasets.

Why

Enables large-scale evaluation workflows following industry standards (DeepEval, LangWatch, Hugging Face, OpenAI):

Memory efficiency: Line-by-line processing for datasets with thousands of cases
Git-friendly diffs: Clear line-based changes vs nested YAML
Programmatic generation: Easy append operations
Tool compatibility: Works with standard JSONL tools (jq, grep)
Industry alignment: Follows established ML/AI framework patterns

Design

Pure JSONL: One eval case per line (no embedded metadata)
Sidecar YAML: Optional companion file for metadata and defaults
Override precedence: Per-line fields override sidecar defaults
Backward compatible: Existing YAML files unchanged

Add support for JSONL (JSON Lines) format as an alternative to YAML for evaluation datasets, following industry standards from DeepEval, LangWatch, Hugging Face, and OpenAI. Key features: - Pure JSONL files (one eval case per line) - Optional sidecar YAML for metadata and defaults - Per-case overrides for execution and evaluators - Same file reference resolution as YAML - Fully backward compatible with existing YAML files Benefits: - Memory efficient for large datasets - Git-friendly line-based diffs - Easy programmatic generation and appending - Compatible with standard JSONL tools Includes complete proposal, design doc, implementation tasks, and spec with 8 requirements and 27 scenarios.

Add JSONL (JSON Lines) format as an alternative to YAML for evaluation datasets, following industry standards from DeepEval, LangWatch, and Hugging Face. Key features: - Pure JSONL data format (one eval case per line) - Optional sidecar YAML metadata file for dataset defaults - Per-case overrides for execution, evaluators, and rubrics - Line-by-line parsing with clear error messages - Same validation and file reference resolution as YAML - Full backward compatibility with existing YAML files Benefits: - Streaming-friendly for large datasets - Git-friendly line-based diffs - Easy programmatic generation - Standard tool compatibility (jq, grep, etc.)

- Replace jsonl-format example with basic-jsonl (mirrors basic example) - Add file reference examples in JSONL format - Update eval-builder skill with JSONL format documentation

The eval command's path resolver was only accepting .yaml files, preventing users from running JSONL datasets directly. Updated regex patterns to accept both .yaml/.yml and .jsonl file extensions, and improved error message to mention JSONL support.

christso marked this pull request as draft January 8, 2026 23:59

christso marked this pull request as ready for review January 9, 2026 00:56

christso added the enhancement New feature or request label Jan 13, 2026

christso force-pushed the feat/add-jsonl-dataset-format branch from c677ea3 to 504a7dc Compare January 20, 2026 00:06

christso added 8 commits January 20, 2026 16:10

docs: add JSONL examples and update eval-builder skill

25b50d6

- Replace jsonl-format example with basic-jsonl (mirrors basic example) - Add file reference examples in JSONL format - Update eval-builder skill with JSONL format documentation

chore: update tasks.md with accurate Phase 5 items

b979d41

changeset

c6510ce

version 2.2.0

91e9e6c

archive speces

8c9e50d

christso force-pushed the feat/add-jsonl-dataset-format branch from e2b774e to 8c9e50d Compare January 20, 2026 05:12

fix version

5767cf9

christso merged commit 13bbf9d into main Jan 20, 2026

christso deleted the feat/add-jsonl-dataset-format branch January 20, 2026 05:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

Add JSONL dataset format support#131

Add JSONL dataset format support#131
christso merged 9 commits intomainEntityProcess/agentv:mainfrom
feat/add-jsonl-dataset-formatEntityProcess/agentv:feat/add-jsonl-dataset-formatCopy head branch name to clipboard

christso commented Jan 8, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Search code, repositories, users, issues, pull requests...

Comments

Conversation

christso commented Jan 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Why

Design

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

christso commented Jan 8, 2026 •

edited

Loading