Auto-segment PreFLMR's query() into profiled, exportable components and run retrieval through the split model.
- uv for dependency management
- NVIDIA GPU + CUDA 11.8
- Graphviz (
doton yourPATH)
uv sync
uv run python main.py --helpRetrieval needs the EVQA (M2KR) text, passages, and query images. Fetch them with:
uv run python fetch_datasets.pyA prebuilt ColBERT index is expected under /data/EVQA/index (see the paths in
main.py: INDEX_ROOT, EXPERIMENT, INDEX_NAME).
HF_HUB_OFFLINE=1 uv run python main.py generate --batch 16 --out /dev/shm/flmr_split.tspart --coarse
HF_HUB_OFFLINE=1 uv run python main.py demo --artifact /dev/shm/flmr_split.tspart --batch 16
HF_HUB_OFFLINE=1 uv run python main.py draw --artifact /dev/shm/flmr_split.tspart --out flow.svguv run pytest
uv run pytest -m slowIdentical results between monolith and partitioned

