
deep learning
Attention Mechanisms, Part 1: From Seq2Seq to Learned Alignment
Most of the interesting things we ask neural networks to do look the same from a distance: take a sequence in, generate a sequence out. Translate an …

Most of the interesting things we ask neural networks to do look the same from a distance: take a sequence in, generate a sequence out. Translate an …

I wanted a way to share a curated weekly AI newsletter with colleagues. The idea: throughout the week, I come across interesting AI news (model …

Explore distributed training techniques through this interactive presentation. Navigate through the slides using arrow keys or the navigation …

Modern deep learning models have grown exponentially in size and complexity. GPT-4 has over a trillion parameters, and even “smaller” …














