Posts

All the articles I've posted.

Fused Softmax::P1::Naive & Triton Implementation

2 Nov, 2025

Implementing Softmax on Torch and Triton version
Vector Addition::P4::Optimizing

26 Oct, 2025

Optimizing Cuda vector addition kernels to match Triton & Torch
Vector Addition::P3::Benchmarking

12 Oct, 2025

Benchmarking vector addition kernels in Cuda, Triton & Torch
Vector Addition::P2::Cuda Kernel

5 Oct, 2025

Investing vector addition kernel in Cuda

Fused Softmax::P1::Naive & Triton Implementation