Partiful logo

Performing Per-Neuron Analysis on an LLM

Hosted by
•‿•
Sign in to see location
Ever wonder how transformers actually do some of their computation? Wanted to know more about what this “mechanistic interpretability” thing is and how it relates to AI safety? This is the workshop for you! We’ll be analyzing how transformers perform induction (in particular in the vein of mechanistic interpretability as first kicked off by Anthropic’s 2021 transformer circuits paper). This workshop is meant for people who have previously built and trained their own transformers previously. We will assume that people are already very familiar with a standard decoder-only GPT-2-style transformer. This workshop will consist of an introduction talk into mechanistic interpretability of LLMs, some guiding principles, and then a hands-on exercise where we actually do some interpretability exercises. You will need to bring a computer to participate!
4 on the list
CD
CS
SB
BM
RB
JR
SC
BK
BK
WP
ES
JL
ER
PC
ST
HL
CJ
JC
JE
DL
JD
LL
ᵔ▿ᵔ
⚆◟⚆
+7

Restricted Access

Verify your phone number to view event details and activity
Sign in

Photo Album

Activity

XX
Xxxx Xxxx (+1) rsvped Going 👍
3 months ago
XX
Xxxxx Xxxxxxxxx rsvped Going 👍
3 months ago
XX
Xxxxxxx Xxxxxxx rsvped Going 👍
3 months ago
XX
Xxxxxx Xxxxxxx rsvped Going 👍
3 months ago
XX
Xxxx Xxxxxxxx (+1) rsvped Going 👍
3 months ago
XX
Xxxxxxx Xxxxxx (+1) rsvped Going 👍
3 months ago
[Message hidden] - Sign in to view
XX
Xxxx Xxxxxx (+1) rsvped Going 👍
3 months ago
Dwight Office Tv GIF by The Office
XX
Xxxxxxxx Xxxxxx rsvped Going 👍
3 months ago
XX
Xxxx Xxxxxxx (+1) rsvped Going 👍
3 months ago
XX
Xxxxxxx Xxxx (+1) rsvped Going 👍
3 months ago
[Message hidden] - Sign in to view
Artificial Intelligence Design GIF by xponentialdesign