improvement: implement TOON format for AI prompts to reduce token consumption by breaking-brake · Pull Request #297 · breaking-brake/cc-wf-studio

breaking-brake · Dec 20, 2025

Problem

AI prompt construction for workflow operations currently uses freetext format with markdown sections, which consumes unnecessary tokens due to:

Current Behavior

Markdown formatting symbols (**, -, numbered lists) add overhead
Redundant section headers and separators
JSON data embedded as formatted strings
❌ Higher token consumption (e.g., Refinement: 6,252 tokens)

Expected Behavior

Structured format (TOON) for prompt construction
Reduced redundancy through array compression and hierarchical indentation
Direct object encoding without quotes and brackets
✅ Lower token consumption (e.g., Refinement: 5,793 tokens)

Solution

Implemented TOON (Token-Oriented Object Notation) format for all AI prompts using the official @toon-format/toon library:

Key Changes

Refinement Prompts: 7.3% token reduction (459 tokens saved per request)
Workflow Name Generation: Structured prompt with TOON encoding
Slack Description Generation: Structured prompt with TOON encoding
Locale Support: Added responseLocale field for multilingual responses
Simplified Implementation: Removed unnecessary abstractions and configurations

TOON Format Benefits

Array compression: nodes[3]: id,type,name (CSV-style)
No quotes needed for strings
Indentation-based hierarchy (YAML-like)
Direct library usage (no custom implementation)

Changes

New Files

File: src/extension/services/refinement-prompt-builder.ts (118 lines)

Builds TOON-formatted prompts for workflow refinement
Encodes all structured data (currentWorkflow, conversationHistory, guidelines)
Uses @toon-format/toon official library

File: src/extension/services/workflow-name-prompt-builder.ts (41 lines)

TOON prompt builder for workflow name generation
Simple structure with requirements array

File: src/extension/services/slack-description-prompt-builder.ts (40 lines)

TOON prompt builder for Slack description generation
Optimized for brief descriptions (100-150 chars)

Modified Files

File: src/extension/services/refinement-service.ts

Replaced 130+ lines of inline prompt construction with RefinementPromptBuilder
Added promptFormat: 'toon' to all logs and metrics
Simplified prompt construction logic

File: src/extension/commands/workflow-name-generation.ts

Integrated WorkflowNamePromptBuilder
Removed inline prompt construction (20 lines reduced)
Added promptFormat logging

File: src/extension/commands/slack-description-generation.ts

Integrated SlackDescriptionPromptBuilder
Removed inline prompt construction (20 lines reduced)
Added promptFormat logging

File: src/extension/services/ai-metrics-service.ts

Removed getConfiguredPromptFormat() (always use TOON)
Removed unused PromptFormat import

File: src/shared/types/ai-metrics.ts

Added PromptFormat type: 'freetext' | 'json' | 'toon'
Added promptFormat field to AIGenerationMetrics interface

File: package.json

Added dependency: @toon-format/toon: ^1.4.0
Removed cc-wf-studio.ai.promptFormat configuration (TOON is now always used)
Updated cc-wf-studio.ai.collectMetrics description

Impact

Token Reduction

Refinement Prompt: 7.3% reduction (6,252 → 5,793 tokens, 459 tokens saved)
Expected for Name/Description: Similar 7-10% reduction (not yet measured)

Cost Savings (Estimated)

Individual user (50 refinements/month): ~$0.07/month, ~$0.84/year
Team usage (500 refinements/month): ~$0.70/month, ~$8.40/year

User Experience

✅ No breaking changes to user-facing features
✅ Multilingual support maintained via responseLocale
✅ All existing workflows continue to work
✅ No configuration required (TOON always enabled)

Code Quality

✅ Removed ~228 lines of unnecessary code
✅ Eliminated custom TOON implementation
✅ Removed unused abstractions (PromptBuilderBase, toon-encoder.ts)
✅ Simplified language instruction logic

Testing

Manual E2E testing completed (Refinement prompt verified with 7.3% reduction)
Code quality checks passed (npm run format, npm run lint, npm run check)
Build successful (npm run build)
Locale support verified (responseLocale field added)

Notes

Why 7.3% instead of expected 28-35%?

The original plan estimated 28-35% reduction for fixed text portions only. In practice:

Fixed text (role, task, guidelines): ~20-30% reduction ✅
Variable data (currentWorkflow, conversationHistory): ~56% of total prompt
TOON encoding applies to entire structured object, but variable content limits compression

The 7.3% overall reduction is realistic given the large proportion of variable JSON data.

Design Simplifications Made

Removed LANGUAGE_INSTRUCTIONS: Using responseLocale only
Removed configuration options: TOON is always enabled (simpler UX)
Direct library usage: No custom wrapper (toon-encoder.ts)
No abstract base class: Each builder is self-contained

Future Optimization Opportunities

Dynamic schema filtering (only send relevant node types)
Workflow data compression for large workflows
Conversation history summarization

- Install @toon-format/toon official library for TOON encoding - Add RefinementPromptBuilder with TOON format support - Add locale support (responseLocale) to prompts for multilingual responses - Simplify prompt construction by fixing format to TOON (7.3% token reduction) - Add promptFormat to metrics collection for monitoring - Remove promptFormat configuration (always use TOON) 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>

- Add WorkflowNamePromptBuilder with TOON format support - Add SlackDescriptionPromptBuilder with TOON format support - Simplify by using responseLocale only (remove LANGUAGE_INSTRUCTIONS) - Add promptFormat to logs for monitoring

## [3.8.4](v3.8.3...v3.8.4) (2025-12-20) ### Improvements * add tool execution loading animation in AI chat ([#298](#298)) ([5615efe](5615efe)) * AI editing chat enhancements with allowed tools UI and strict tool restrictions ([#299](#299)) ([d72f241](d72f241)) * implement TOON format for AI prompts to reduce token consumption ([#297](#297)) ([64fe228](64fe228))

github-actions · Dec 20, 2025

🎉 This PR is included in version 3.8.4 🎉

The release is available on GitHub release

Your semantic-release bot 📦🚀

breaking-brake and others added 2 commits December 20, 2025 12:43

breaking-brake merged commit 64fe228 into main Dec 20, 2025
3 checks passed

breaking-brake deleted the improvement/prompt-structure-optimization branch December 20, 2025 04:03

breaking-brake mentioned this pull request Dec 20, 2025

Release: v3.8.4 #301

Merged

github-actions Bot added the released label Dec 20, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

improvement: implement TOON format for AI prompts to reduce token consumption#297

breaking-brake commented Dec 20, 2025

Uh oh!

Uh oh!

github-actions Bot commented Dec 20, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Search code, repositories, users, issues, pull requests...

Conversation

breaking-brake commented Dec 20, 2025

Problem

Current Behavior

Expected Behavior

Solution

Key Changes

TOON Format Benefits

Changes

New Files

Modified Files

Impact

Token Reduction

Cost Savings (Estimated)

User Experience

Code Quality

Testing

Notes

Why 7.3% instead of expected 28-35%?

Design Simplifications Made

Future Optimization Opportunities

Uh oh!

Uh oh!

github-actions Bot commented Dec 20, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant