Discussion about this post

User's avatar
Daksh's avatar

I appreciate how you helped realize the importance the positional encodings and self attention!! Still a bit doubtful what the input and output would be between each layers in multiple stacked decoders.

Expand full comment
Chris Ried's avatar

Great overview!

Expand full comment

No posts