Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Appearance settings

RuipingL/Situat3DChange

Open more actions menu

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

17 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Situat3DChange

Situat3DChange is a 3D visual-language benchmark designed to assess multimodal large language models (MLLMs) on real-world change understanding tasks, including change description, rearrangement planning, and question answering, all with situation awareness.

  • 📂 Dataset on Hugging Face: lrp123/Situat3DChange
  • 🤖 Baseline model: SCReasoner
  • 📊 Evaluation tools: for both traditional NLP metrics and GPT-based evaluation

📦 Installation

We recommend setting up the environment by following the steps in embodied-generalist, as SCReasoner builds on similar infrastructure.

Clone the repo:

git clone https://github.com/RuipingL/Situat3DChange.git
cd Situat3DChange

🚀 SCReasoner Setup & Training

  1. Download Checkpoints

Download checkpoints.zip from the Hugging Face dataset page, and extract it into:

Situat3DChange/SCReasoner/
  1. Launch Training

Use the following command to train SCReasoner with SLURM and Submitit:

python launch.py \
  --mode submitit \
  --config configs/default.yaml \
  --name default \
  --time 48 \
  --num_nodes 1 \
  --partition accelerated \
  --gpu_per_node 4 \
  --mem_per_gpu 100 \
  --port 2050

🧪 Evaluation

1. QA Task

Run:

python eval_qa/eval.py

2. Longform Tasks

For traditional metrics (BLEU-4, ROUGE, CIDEr, METEOR, BERTScore):

python eval_longform/eval.py

For GPT-based evaluation:

python eval_longform/eval_gpt.py

📁 Results

Results for SCReasoner including GPT scores are stored in:

results/SCReasoner/

📫 Citation

If you use this project or dataset, please cite us (citation coming soon).


🙏 Acknowledgment

We thank the LEO project, upon which our project is based.

About

NeurIPS 2025 D&B Track

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published
Morty Proxy This is a proxified and sanitized view of the page, visit original site.