Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Appearance settings

iAmGiG/gex-llm-patterns

Open more actions menu

Repository files navigation

GEX-LLM Pattern Analysis: LLM Structural Reasoning in Financial Markets

License: AGPL v3 Python 3.10+ Paper 1 Research

PostgreSQL Alpha Vantage


Overview

PhD research investigating whether Large Language Models can detect structural constraints in financial markets through genuine reasoning rather than training data memorization.

Core Innovation: Temporal obfuscation testing — stripping all dates, ticker symbols, and contextual markers from financial data, forcing LLMs to reason from numerical structure alone.

Application Domain: Options dealer gamma exposure (GEX) — the aggregate hedging constraints that market makers face due to their options inventory positions.


Key Results

Single-Day Detection (Paper 1)

Metric Result
Detection Rate 71.5% (obfuscated, unbiased prompts)
Predictive Accuracy 90.9% (forward returns)
Raw Chain Superiority 92.3% vs 61.5% GEX-assisted (+30.8pp)
Test Coverage 242 trading days (SPY, 2024)

Finding: LLMs reconstruct dealer positioning from raw strike-level data, outperforming pre-calculated metrics — empirical evidence that scalar GEX aggregation discards structural signal.

Multi-Day Regime Detection (Paper 2)

Metric Result
2024 Detection 81.2% (persistent regimes)
2020 Detection 12.1% (pre-0DTE baseline)
Discrimination 69.1pp separation (φ = 0.672, p < 0.0001)
False Positives 0% on transitional/low-magnitude controls
Coverage 1,412 windows + 809 controls (2020–2025)

Finding: Detection tracks 0DTE options adoption — 3.7% (2021) → 100% (2024) — with GEX magnitude growing 360%, revealing a structural market reorganization.

Detection ≠ Profitability

Stable detection (68–74% quarterly) persists while economic profitability collapses (Sharpe 1.8 → 0.1), confirming detected patterns are structural mechanics, not exploitable anomalies.


Research Papers

Paper 1: Single-Day Obfuscation Testing ✅ Published

Title: Inferring Latent Market Forces: Evaluating LLM Detection of Gamma Exposure Patterns via Obfuscation Testing

  • arXiv: 2512.17923
  • Venue: IEEE LLM-Finance 2025 (IEEE BigData, Macau)
  • LaTeX: docs/papers/paper1/
  • Status: Published, journal revision with 5 MC defenses complete

Paper 2: 30-Day Regime Detection

Title: Validating LLM Structural Reasoning: Detecting Persistent Market Regimes Through Temporal Obfuscation

IEEE Access: Combined Paper 1+2

Title: Validating LLM Structural Reasoning in Options Markets: From Single-Day Detection to Persistent Regime Identification

  • Format: IEEE Access journal (single-column)
  • LaTeX: docs/papers/ieee_access/
  • Status: 18 pages, sensitivity analysis complete

Future Directions

Research extensions not pursued within this repository (cross-asset generalization, intraday/per-strike analysis, GNN-based cross-asset hedging networks): docs/papers/extensions/.


Methodology

Obfuscation Testing

Raw:        SPY, 2024-03-15: Net GEX: -$32.9B, Flip: $485.00
Obfuscated: Day T+0, INDEX_1: Net GEX: -$32.9B, Flip: $485.00

Remove dates, tickers, events → preserve only quantitative structure → force structural reasoning.

WHO → WHOM → WHAT Causal Framework

Every detection must specify:

  • WHO: The constrained actor (e.g., dealers with negative gamma)
  • WHOM: The affected parties (e.g., directional traders)
  • WHAT: The forced mechanism (e.g., pro-cyclical hedging amplifying volatility)

Regime Classification (30-Day Windows)

Criterion Threshold Purpose
Persistence ≥ 70% days same sign Exceeds random binomial (~2.2σ)
Magnitude ≥ $5B average GEX
Stability ≤ 5 sign flips Sustained directional bias

Infrastructure

Database

  • PostgreSQL 18.1: 81.8M contracts, 50 symbols, 2020–2025 (20.58 GB)
  • Intraday snapshots: Yearly-partitioned table, 21 snapshots/day

Data Sources

  • Alpha Vantage Premium: Historical options chains (1000 calls/min)
  • Polygon.io: Stock price data (free tier)

LLM

  • OpenAI o4-mini: Reasoning model via Batch API
  • Cost: $11.07 for all 2,221 evaluations

Project Structure

gex-llm-patterns/
├── src/
│   ├── agents/              # LLM market mechanics agent
│   ├── analysis/            # Pattern library (15 patterns)
│   ├── gex/                 # GEX calculator (Black-Scholes)
│   ├── llm/                 # LLM integration
│   ├── validation/          # Obfuscation & regime classification
│   └── data_sources/        # Alpha Vantage, Polygon clients
├── scripts/
│   ├── validation/          # Paper 1 & 2 validation pipelines
│   ├── analysis/            # Sensitivity analysis, figures
│   └── data_collection/     # Intraday OI monitor
├── docs/
│   ├── papers/
│   │   ├── paper1/          # Single-day obfuscation (IEEE BigData 2025, published)
│   │   ├── paper2/          # Regime detection (multi-venue — AIAI accepted, JRFM under review)
│   │   ├── ieee_access/     # Combined journal paper (IEEE Access)
│   │   └── extensions/      # Forward-looking research directions (snapshot)
│   └── presentations/       # PhD symposium, fundamentals
├── reports/                  # Validation results (YAML)
└── config_defaults/          # Configuration templates

Getting Started

git clone https://github.com/iAmGiG/gex-llm-patterns.git
cd gex-llm-patterns
pip install -r requirements.txt

# Configure API keys
export OPENAI_API_KEY="your_key"
export POLYGON_API_KEY="your_key"

# Verify
python -c "from src.analysis.pattern_library import PatternLibrary; print('OK')"

Research Ethics

  • Academic research only — not trading advice
  • Public data only — Alpha Vantage options chains
  • Open source — methodology and code fully available
  • Not financial advice — detection rates do not imply profitable strategies

Citation

@inproceedings{regan2025obfuscation,
  author = {Regan, Christopher and Xie, Ying},
  title = {Inferring Latent Market Forces: Evaluating {LLM} Detection of
           Gamma Exposure Patterns via Obfuscation Testing},
  booktitle = {2nd IEEE International Workshop on Large Language Models
               for Finance, IEEE International Conference on Big Data},
  year = {2025},
  publisher = {IEEE},
  url = {https://arxiv.org/abs/2512.17923}
}

License

GNU Affero General Public License v3.0 — see LICENSE.


Last Updated: March 2026 Contact: Christopher Regan (cregan1@kennesaw.edu) · Ying Xie (yxie2@kennesaw.edu) Institution: Kennesaw State University, College of Computing and Software Engineering

Contributors

Languages

Morty Proxy This is a proxified and sanitized view of the page, visit original site.