Toy Transformer
Step
Task:
sorting
even-odd sum
2-sum
slapjack
Learning Rate =
0.0005
Embedding Dimension =
48
loading...