Skip to content

[Proposal] Add MLP transcoders #182

@dtch1997

Description

@dtch1997

Proposal

Support training, loading, and inference of MLP transcoders.

Motivation

MLP transcoders were trained by Jacob Dunefsky and Philippe Chlenski and have been shown to be useful.

Pitch

  • Implement a HookedTranscoder class analogous to HookedSAE and using similar functionality.
  • Implement a transcoder training runner.
  • Support loading pre-trained transcoder checkpoints.

Checklist

  • I have checked that there is no similar issue in the repo (required)

Metadata

Metadata

Assignees

Labels

No labels
No labels

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions