AM-64M

A small from-scratch language model (RoPE, QK-Norm, RMSNorm, SwiGLU) trained on TinyTextbooks. Enter a prompt and generate a continuation.

8 512
0.1 2
0 100
0 1
Examples