Home

GEX LLM Patterns

PhD Research: Validating LLM Understanding of Market Microstructure Through Obfuscation Testing

Overview

This repository contains a novel obfuscation testing methodology that validates whether Large Language Models truly understand financial market constraints or simply memorize patterns from training data.

Core Innovation: Strip all temporal context (dates, tickers, events) and force LLMs to reason purely from market structure.

Test Domain: Options market dealer constraints (gamma exposure hedging)

Key Finding: LLMs detect structural dealer constraints with 71.5% detection rate and 91.2% predictive accuracy without any temporal context.

Quick Navigation

📚 Core Concepts

Methodology - Obfuscation testing framework explained
Pattern Taxonomy - Three validated dealer constraint patterns
Key Results - Paper #1 findings (full 2024 validation)

🚀 Getting Started

Getting Started - Installation and quick start guide
API Reference - Key components and usage

🗺️ Research

RoadMap - Multi-paper research trajectory (Papers #1-4+)
Publications - Papers and presentations

Key Results (Paper #1)

Status: Submitted to IEEE LLM-Finance 2025 Workshop (Oct 26, 2025)

Metric	Result
Detection Rate	71.5% average across 3 patterns
Predictive Accuracy	91.2% (predictions materialize)
Sample Size	726 tests (242 trading days × 3 patterns)
Validation Period	Full 2024 (all quarters)

Critical Finding: Detection-Profitability Divergence

LLM detection remains stable (84-100%) even as profitability declines to zero (Q1→Q4 2024)
Proves LLM detects market structure, not profits
Validates methodology rejects temporal context leakage

What Makes This Research Novel?

1. Obfuscation Testing Framework

Problem: How do we know LLMs understand vs memorize?
Solution: Strip all temporal context, force reasoning from structure alone
Validation: Compare obfuscated vs non-obfuscated detection rates

2. WHO → WHOM → WHAT Framework

Explicit causal identification required
Not just "pattern exists" but "dealers are forced by regulation to hedge negative gamma"
Mechanistic understanding, not statistical anomalies

3. Multi-Pattern Generalization

Tested 3 different narrative framings of same underlying constraint
LLM correctly identifies identical mechanism across framings
Proves detection is structural, not pattern-matching specific keywords

Repository Structure

gex-llm-patterns/
├── src/                    # Core system components
│   ├── agents/            # MarketMechanicsAgent (LLM orchestration)
│   ├── gex/               # GEXCalculator (gamma exposure metrics)
│   ├── validation/        # OutcomeCalculator, PatternTaxonomy
│   └── data_sources/      # Historical data fetching
├── scripts/               # Validation and experiment scripts
│   ├── validation/        # Pattern taxonomy validation
│   └── orchestrate_experiment.py  # Main entry point
├── docs/                  # Comprehensive documentation
│   ├── papers/           # Paper #1 content, research roadmap
│   ├── guides/           # Conceptual guides and tutorials
│   └── presentations/    # Symposium and conference materials
└── reports/              # Validation results (YAML)
    └── validation/
        └── pattern_taxonomy/  # Full 2024 results

Quick Start

# Clone repository
git clone https://github.com/iAmGiG/gex-llm-patterns.git
cd gex-llm-patterns

# Install dependencies
pip install -r requirements.txt

# Set up environment
export PYTHONPATH=$(pwd):$PYTHONPATH
export OPENAI_API_KEY="your-key-here"

# Run validation on single pattern
python scripts/validation/validate_pattern_taxonomy.py \
  --pattern gamma_positioning \
  --symbol SPY \
  --start-date 2024-01-02 \
  --end-date 2024-03-29

See Getting Started for detailed setup instructions.

Publications

Paper #1 (Submitted Oct 2025):

"Validating Large Language Model Understanding of Market Microstructure Through Obfuscation Testing"
IEEE LLM-Finance 2025 Workshop @ IEEE BigData 2025
Full paper content

Presentations:

PhD Symposium 2025 (October 2025)
Research presentation at academic institution

Contributing

This is an academic research project. For questions or collaboration inquiries:

Open an issue
Review active research directions

License

AGPL-3.0 - See LICENSE

Last Updated: October 25, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Home

GEX LLM Patterns

Overview

Quick Navigation

📚 Core Concepts

🚀 Getting Started

🗺️ Research

Key Results (Paper #1)

What Makes This Research Novel?

1. Obfuscation Testing Framework

2. WHO → WHOM → WHAT Framework

3. Multi-Pattern Generalization

Repository Structure

Quick Start

Publications

Contributing

License

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Clone this wiki locally

Search code, repositories, users, issues, pull requests...

Home

GEX LLM Patterns

Overview

Quick Navigation

📚 Core Concepts

🚀 Getting Started

🗺️ Research

Key Results (Paper #1)

What Makes This Research Novel?

1. Obfuscation Testing Framework

2. WHO → WHOM → WHAT Framework

3. Multi-Pattern Generalization

Repository Structure

Quick Start

Publications

Contributing

License

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Clone this wiki locally