home transformers glu_variants

Github Join Slact Twitter

#

Gated Linear Units and Variants

  • Experiment that uses labml.configs
  • Simpler version from scratch