Tagged: machine-learning
4 articles
CNN Regression on Rendered Meshes: What Improved It Getting a feature detector from terrible to useful: data leakage, camera projection, horizontal mirroring, memory management, and a second-layer MLP for 3D. Read article The Dot Product Is All You Need Every neural network, every embedding, every attention head reduces to the same operation: multiply and sum. How the dot product encodes meaning in high-dimensional space. Read article Attention Is All You Need: Building the Original Transformer that Started the LLM Revolution Attention Is All You Need replaced RNNs with self-attention and changed everything. I built the original encoder-decoder transformer from scratch and trained it to translate English to French. Read article Building a Tiny LLM From Scratch, Trained on Poe A decoder-only transformer, two tokenizers, and 1.9 million characters of Edgar Allan Poe. What building an LLM from zero teaches you. Read article