Michael Min Wah Leung
Notes on post-training, sequence modelling, and the occasional brain. Code-first, results-honest, written by an ML engineer with a research background in neuroscience.
Writing
Does a single direction mediate refusal? A small reproduction
interpretability
safety
steering
transformers
Patient-specific filters as biomarkers
neuroscience
signal-processing
interpretability
No matching items