Test llama architecture adapter by indrayani21 · Pull Request #1311 · TransformerLensOrg/TransformerLens

indrayani21 · 2026-05-18T12:20:26Z

Description

Adds focused unit tests for the Llama architecture adapter in tests/unit/model_bridge/supported_architectures/test_llama_adapter.py.

This PR adds validation for:

Config attribute initialization
Component mapping structure
GQA (n_key_value_heads) support
Weight conversion registration
Rotary embedding setup logic
Factory registration

The tests were modeled after the existing CodeGen adapter test suite and adapted for Llama-specific behaviors such as:

RMSNorm
RoPE (rotary embeddings)
gated MLPs
grouped-query attention

Related to #1302

Type of change

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
This change requires a documentation update

Checklist:

I have commented my code, particularly in hard-to-understand areas
I have made corresponding changes to the documentation
My changes generate no new warnings
I have added tests that prove my fix is effective or that my feature works
New and existing unit tests pass locally with my changes
I have not rewritten tests relating to key interfaces which would affect backward compatibility

-->

* Fix type of HookedTransformerConfig.device This is typed as `Optional[str]` but sometimes returns `torch.device`. Updated the code to just return the `str` instead of wrapping with a device. I'm not confident that every function which takes a device will always be passed a string, so I didn't change functions like warn_if_mps. Found while working on TransformerLensOrg#1219 * more cleanup * 3.0 CI Bugs (TransformerLensOrg#1261) * Fixing `utils` imports * skip gated notebooks on PR from forks * Updating notebooks * Ensure LLaMA only runs when HF_TOKEN is available --------- Co-authored-by: jlarson4 <jonahalarson@comcast.net>

TransformerLens 3.1.0

Release v3.2.0

Release v3.2.1

jlarson4

This PR is solid as well! Just one note on the rotary_emb assertion, and you'll need to clean up any formatting errors. Let me know if you have any questions @indrayani21

jlarson4 · 2026-05-18T15:19:26Z

+        )
+
+        # Verify method exists and adapter remains usable
+        assert hasattr(attn_bridge, "set_rotary_emb")


This assertion cannot fail due to the way the attention bridge constructs LLaMA's rotary_emb. It would be more valuable to assert assert attn_bridge._rotary_emb is rotary_emb

brendanlong and others added 7 commits April 20, 2026 14:50

Merge pull request TransformerLensOrg#1277 from TransformerLensOrg/dev

6f56518

TransformerLens 3.1.0

Merge pull request TransformerLensOrg#1294 from TransformerLensOrg/dev

31d4f6a

Release v3.2.0

Merge pull request TransformerLensOrg#1295 from TransformerLensOrg/dev

5f7b02e

Release v3.2.1

Add config tests for Bloom architecture adapter

d9753e5

Add focused unit tests for Llama architecture adapter

89f2223

Fix import sorting

dcc1b51

jlarson4 reviewed May 18, 2026

View reviewed changes

jlarson4 mentioned this pull request May 18, 2026

Adding adapter tests for Qwen2 #1309

Open

jlarson4 changed the base branch from main to dev May 18, 2026 15:32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Test llama architecture adapter#1311

Test llama architecture adapter#1311
indrayani21 wants to merge 7 commits into
TransformerLensOrg:devfrom
indrayani21:test-llama-architecture-adapter

indrayani21 commented May 18, 2026

Uh oh!

jlarson4 left a comment

Uh oh!

jlarson4 May 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

indrayani21 commented May 18, 2026

Description

Type of change

Checklist:

Uh oh!

jlarson4 left a comment

Choose a reason for hiding this comment

Uh oh!

jlarson4 May 18, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants