Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Appearance settings

Releases: TransformerLensOrg/TransformerLens

v3.2.1

09 May 08:36
5f7b02e

Choose a tag to compare

What's Changed

Hot fix for issues with Gemma3 multimodal interp.

Full Changelog: v3.2.0...v3.2.1

v3.2.0

08 May 23:23
31d4f6a

Choose a tag to compare

What's Changed

New Contributors

Full Changelog: v3.1.0...v3.2.0

v3.1.0

30 Apr 01:52
6f56518

Choose a tag to compare

What's Changed

New Contributors

Full Changelog: v3.0.0...v3.1.0

TransformerLens 3.0

17 Apr 19:31
30baa16

Choose a tag to compare

What's Changed

Migrating to a new way to implement models via the TransformerBridge system. Increased model support from ~200 models to ~9,000 models

Read more

v2.18.0

24 Mar 17:03
589acd4

Choose a tag to compare

What's Changed

New Contributors

Full Changelog: v2.17.0...v2.18.0

v3.0.0b3

13 Mar 16:36
04ccabf

Choose a tag to compare

What's Changed

  • Support callable filters in TransformerBridge.add_hook() by @jlarson4 in #1186
  • Update Patching Hook to avoid causing conflicts by @jlarson4 in #1187
  • Prevent Stale Joint QKV values from being incorporated into weight folding after Layer Norm application by @jlarson4 in #1188
  • Updated to remove hardcoded .cpu() processing by @jlarson4 in #1189
  • Return true initial batch size information by @jlarson4 in #1190
  • hook_result & Hook Aliases issues by @jlarson4 in #1191
  • updated loading in exploratory analysis demo to use transformer bridge by @degenfabian in #1014
  • updated loading in patchscopes generation demo to use transformer bridge by @degenfabian in #1021
  • Additional Exploratory analysis Demo fixes by @jlarson4 in #1192
  • update loading in bert demo to use transformer bridge by @degenfabian in #1015
  • updating loading in qwen demo to use transformer bridge by @degenfabian in #1025
  • updated loading in activation patching demo to use transformer bridge by @degenfabian in #1011
  • updating loading in t5 demo to use transformer bridge by @degenfabian in #1022
  • updated loading in attribution patching demo to use transformer bridge by @degenfabian in #1013
  • v3.0.0b3 – Notebook Demo Update & Bug Fixes by @jlarson4 in #1196
  • Verifying Additional Models by @jlarson4 in #1199
  • Feature/multimodal architecture adapters by @jlarson4 in #1200
  • Fix boolean 4D attention-mask handling in joint-QKV bridge attention reconstruction by @speediedan in #1198
  • Feature/llava next and onevision variants by @jlarson4 in #1202

Full Changelog: v3.0.0b2...v3.0.0b3

v3.0.0b2

26 Feb 19:25
d5561da

Choose a tag to compare

What's Changed

New Contributors

Full Changelog: v3.0.0b1...v3.0.0b2

v2.17.0

21 Jan 23:32
7df72ff

Choose a tag to compare

We've got an exciting new release that includes several new models! Gemma 3, MedGemma, and Qwen3-0.6B-Base are now included in options for models. In addition to these new models, a handful of bugs and other small non-breaking changes were made.

What's Changed

New Contributors

Full Changelog: v2.16.1...v2.17.0

v3.0.0b1

07 Dec 16:37
b4e24d3

Choose a tag to compare

What's Changed

New Contributors

Full Changelog: v3.0.0a8...v3.0.0b1

v3.0.0a8

07 Sep 22:51
ee9b44b

Choose a tag to compare

v3.0.0a8 Pre-release
Pre-release

Another update that rounds out the API for our new module

What's Changed

Full Changelog: v3.0.0a7...v3.0.0a8

Morty Proxy This is a proxified and sanitized view of the page, visit original site.