Prompt: teach me how fourier transforms work and help me understand their hidden patterns
Prompt: explain how positional encodings work in transformers and why they matter
Prompt: teach me gradient descent and how it helps optimize neural networks
Prompt: explain what AdamW is and why it's better than regular Adam optimizer