From Scratch Implementation of ResShift Paper for Image Super-Resolution
A deep dive into implementing the ResShift paper from scratch for efficient diffusion-based image super-resolution. Learn about U-Net architecture with Swin Transformer blocks, residual shifting mechanisms, and building state-of-the-art image enhancement models.
From Scratch Implementation of RNN, LSTM and BiLSTM: What I Learned
Exploring the inner workings of Recurrent Neural Networks by implementing RNN, LSTM, and Bidirectional LSTM from scratch. This post covers forward propagation, backpropagation through time (BPTT), and the key insights gained from building these architectures from the ground up.
ML Training Optimization: FLOPs, Profiling, and Learning Strategies
A comprehensive guide to optimizing machine learning training, covering computational constraints, performance profiling, and learning strategies that can save significant costs and time.
From Quantization to Inference: Beginners Guide for Practical Finetuning
A beginner-friendly guide that bridges the gap between quantization and inference, providing practical insights into fine-tuning techniques.
Building GPT from First Principles: Code and Intuition
An intuitive and code-driven exploration of building GPT models from scratch, unraveling the principles behind their architecture.
Understanding Quantization in Deep Learning
A comprehensive guide to memory optimization in deep learning, focusing on quantization techniques and their practical implementation in modern neural networks.
A Guide to Fine-tuning Methods in LLMs (Part 1)
A deep dive into modern fine-tuning techniques for Large Language Models, exploring methods like LoRA, QLoRA, and their practical implementations.