-
Notifications
You must be signed in to change notification settings - Fork 1.4k
Comparing changes
Open a pull request
base repository: The-Pocket/PocketFlow-Tutorial-Codebase-Knowledge
base: main
head repository: gunslingerOP/PocketFlow-Tutorial-Codebase-Knowledge
compare: main
- 11 commits
- 14 files changed
- 2 contributors
Commits on Dec 6, 2025
-
add knowledge extraction mode and optimize Gemini 2.5 Pro performance
- Add bottom-up knowledge extraction pipeline (--mode knowledge) - Configure thinking_budget=1024 for Gemini 2.5 models (4 min vs 24-80 min) - Add JSON fallback parsing for LLM responses - Add checkpointing and batch utilities for large repos - Consolidate documentation into ARCHITECTURE.md, DEVELOPMENT.md, KNOWLEDGE_EXTRACTION.md 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>
Configuration menu - View commit details
-
Copy full SHA for 6ccc91e - Browse repository at this point
Copy the full SHA 6ccc91eView commit details -
Configuration menu - View commit details
-
Copy full SHA for d69b672 - Browse repository at this point
Copy the full SHA d69b672View commit details -
Configuration menu - View commit details
-
Copy full SHA for 2056671 - Browse repository at this point
Copy the full SHA 2056671View commit details
Commits on Dec 9, 2025
-
Refactor AggregateDataStructures with 3-pass repo-agnostic entity dis…
…covery Major improvements to entity aggregation in knowledge extraction mode: - Pass 1: Consolidate entities from component entities arrays (dedupe, merge fields, classify as core/supporting/integration/internal) - Pass 2: Extract infrastructure from entity storage fields + external_dependencies (finds actual table/bucket names) - Pass 3: Map service-to-service communication patterns Key changes: - Removed hardcoded patterns - now fully repo-agnostic - Entity classification by LLM (core domain vs external integrations) - Infrastructure extracted from entity storage fields containing actual resource names - Updated WriteMarkdownDocs to render domain entities with tier badges and separate integration models section - Updated documentation in KNOWLEDGE_EXTRACTION.md 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
Configuration menu - View commit details
-
Copy full SHA for d084a3f - Browse repository at this point
Copy the full SHA d084a3fView commit details -
Add LLM cleanup pass for agnostic table extraction
- Add _cleanup_extracted_tables() function for intelligent table filtering - Add _parse_llm_json_array() helper for parsing LLM array responses - Integrate LLM cleanup as Step 1.5 in AggregateEntities pipeline - Update KNOWLEDGE_EXTRACTION.md with comprehensive documentation The LLM cleanup pass is fully agnostic - no vendor names hardcoded: - Filters out queues, buckets, loggers, search indexes - Strips environment variable prefixes (process.env.xxx -> xxx) - Deduplicates table variants - Preserves parameterized names (${stage}-items) Example: 62 raw tables -> 40 clean database tables 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>Configuration menu - View commit details
-
Copy full SHA for d2c2dd3 - Browse repository at this point
Copy the full SHA d2c2dd3View commit details -
Make documentation generator codebase-agnostic
- Replace hardcoded "Lambda Functions" header with dynamic function_type - Replace "AWS Resources" with dynamic infrastructure_type - Replace "IAM Permissions" with generic "Permissions & Access Control" - Update infrastructure extraction prompt to support multi-cloud (Lambda, Cloud Functions, Azure Functions) - Make entity/table extraction examples generic (not DynamoDB-specific) - Update infrastructure categories to use dynamic naming based on detected DB type - Replace vendor-specific integration examples with generic categories The generator now works equally well on any codebase type (backend, frontend, CLI, library) without assuming specific cloud providers or vendors. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
Configuration menu - View commit details
-
Copy full SHA for 6f54408 - Browse repository at this point
Copy the full SHA 6f54408View commit details -
Make frontend extraction technology-agnostic
Remove hardcoded Redux/Zustand/Context mentions from frontend extraction prompts. The LLM now discovers whatever state management pattern the codebase actually uses instead of assuming specific libraries. Changes: - repo_instructions: Generic "any state management pattern" instead of library-specific examples - example_output: Generic "AppStore" instead of "Redux Store" 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
Configuration menu - View commit details
-
Copy full SHA for 7341b4d - Browse repository at this point
Copy the full SHA 7341b4dView commit details -
Document technology-agnostic design principle
- Add "Design Principle: Technology Agnostic" section to KNOWLEDGE_EXTRACTION.md - Document frontend/CLI/library extraction categories - Update README.md knowledge mode description to be generic - Clarify that infrastructure is auto-detected, not vendor-specific 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
Configuration menu - View commit details
-
Copy full SHA for 4de1889 - Browse repository at this point
Copy the full SHA 4de1889View commit details
Commits on Dec 15, 2025
-
fix: escape HTML tags in markdown generation for Docsify compatibility
Added escape_html_for_markdown() helper function that wraps HTML tags in backticks to prevent Docsify from interpreting them as actual HTML. Applied to all text content in _generate_component_chapter(): - Component descriptions and responsibilities - File summaries - Class/method/function purposes, params, returns, and logic - Workflow details and data flow formats Fixes rendering issues where tags like <style>, <defs>, <svg> would break Docsify's markdown parser. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
Configuration menu - View commit details
-
Copy full SHA for 25882bd - Browse repository at this point
Copy the full SHA 25882bdView commit details -
Configuration menu - View commit details
-
Copy full SHA for 1aae709 - Browse repository at this point
Copy the full SHA 1aae709View commit details -
Configuration menu - View commit details
-
Copy full SHA for e7a0ea9 - Browse repository at this point
Copy the full SHA e7a0ea9View commit details
This comparison is taking too long to generate.
Unfortunately it looks like we can’t render this comparison for you right now. It might be too big, or there might be something weird with your repository.
You can try running this command locally to see the comparison on your machine:
git diff main...main