VortexSplit

Auto-segment PreFLMR's query() into profiled, exportable components and run retrieval through the split model.

Requirements

uv for dependency management
NVIDIA GPU + CUDA 11.8
Graphviz (dot on your PATH)

Install

uv sync
uv run python main.py --help

Data

Retrieval needs the EVQA (M2KR) text, passages, and query images. Fetch them with:

uv run python fetch_datasets.py

A prebuilt ColBERT index is expected under /data/EVQA/index (see the paths in main.py: INDEX_ROOT, EXPERIMENT, INDEX_NAME).

Workflow

HF_HUB_OFFLINE=1 uv run python main.py generate --batch 16 --out /dev/shm/flmr_split.tspart --coarse
HF_HUB_OFFLINE=1 uv run python main.py demo --artifact /dev/shm/flmr_split.tspart --batch 16
HF_HUB_OFFLINE=1 uv run python main.py draw --artifact /dev/shm/flmr_split.tspart --out flow.svg

Tests

uv run pytest
uv run pytest -m slow

Example

Identical results between monolith and partitioned

baseline

partitioned

Name	Name	Last commit message	Last commit date
Latest commit History 1 Commit 1 Commit
assets	assets
extern/FLMR	extern/FLMR
tests	tests
vortexsplit	vortexsplit
.gitignore	.gitignore
.python-version	.python-version
README.md	README.md
fetch_datasets.py	fetch_datasets.py
main.py	main.py
pyproject.toml	pyproject.toml
uv.lock	uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

VortexSplit

Requirements

Install

Data

Workflow

Tests

Example

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Search code, repositories, users, issues, pull requests...

Folders and files

Latest commit

History

Repository files navigation

VortexSplit

Requirements

Install

Data

Workflow

Tests

Example

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages