🔍 Generalized Induction-Head Model (GIM) 🔍

Interpretable Next-token Prediction via the Generalized Induction Head (Kim*, Mantena* et al. 2024).

While large transformer models excel in predictive performance, their lack of interpretability restricts their usefulness in high-stakes domains. To remedy this, we propose the Generalized Induction-Head Model (GIM), an interpretable model for next-token prediction inspired by the observation of “induction heads” in LLMs. GIM is a retrieval-based module that identifies similar sequences in the input context by combining exact n-gram matching and fuzzy matching based on a neural similarity metric. We evaluate GIM in two settings: language modeling and fMRI response prediction. In language modeling, GIM improves next-token prediction by up to 25%p over interpretable baselines, significantly narrowing the gap with black-box LLMs. In an fMRI setting, GIM improves neural response prediction by 20% and offers insights into the language selectivity of the brain. GIM represents a significant step toward uniting interpretability and performance across domains. The code is available at https://github.com/ejkim47/generalized-induction-head.

Setup

Clone the repo and run pip install -e . to install the alm package locally
Set paths/env variables in alm/config.py and deep_fmri/encoding/config.py to point to the correct directories to store data
Experiments
- For language modeling experiments, please refer to experiments_lang/readme.md.
- For fMRI experiments, please refer to experiments_fmri/readme.md.

Organization

alm is the main package
- alm/config.py is the configuration file that points to where things should be stored
- alm/data contains data utilities
- alm/models contains source for models
  - alm/models/build_infinigram_py is heavily borrowed from Infini-gram implementation.
data contains code for preprocessing data
deep_fmri is the package for fmri experiments
- heavily borrowed from deep-fMRI-dataset
experiments_lang contains code for language modeling experiments
experiments_fmri contains code for fmri experiments
- heavily borrowed from deep-fMRI-dataset

Citation

If you find this work useful, please cite:

@misc{kim2024inductiongram,
      title={Interpretable Next-token Prediction via the Generalized Induction Head}, 
      author={Eunji Kim and Sriya Mantena and Weiwei Yang and Chandan Singh and Sungroh Yoon and Jianfeng Gao},
      year={2024},
      eprint={2411.00066},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2411.00066}, 
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🔍 Generalized Induction-Head Model (GIM) 🔍

Setup

Organization

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Name	Name	Last commit message	Last commit date
Latest commit History 5 Commits 5 Commits
alm	alm
data	data
deep_fmri	deep_fmri
experiments	experiments
experiments_fmri	experiments_fmri
experiments_lang	experiments_lang
LICENSE	LICENSE
readme.md	readme.md
setup.py	setup.py

Search code, repositories, users, issues, pull requests...

Folders and files

Latest commit

History

Repository files navigation

🔍 Generalized Induction-Head Model (GIM) 🔍

Setup

Organization

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages