Musings of Murali

Bridging the Three Gulfs of Agentic Development (and how they shape evals)

3 minute read

A practical framework for spotting and fixing evaluation blind spots in agentic LLM pipelines, based on Shankar et al.’s Three Gulfs model.

Let Agents do the talking: A Scalable Way to Evaluate Multi-Turn Chatbots

5 minute read

Interactive evaluations: lightweight, automated tests that use agents to measure multi-turn chatbot quality at scale.

CUDA Study Log 4: Optimizing Constrained Decoding with Triton Kernel

8 minute read

Update traditional CUDA matrix multiplication kernel for constrained decoding

CUDA Studylog 3 - Tiling and Shared Memory for Matrix Multiplication Optimization

8 minute read

Optimizing CUDA matrix multiplication using tiling and shared memory, with detailed explanations of memory access patterns and performance improvements

CUDA Studylog 2 - Matrix Multiplication and 2D Grid Organization

6 minute read

Deep dive into implementing efficient matrix multiplication using CUDA, with a focus on memory optimization techniques

Murali Manohar

Recent posts

Bridging the Three Gulfs of Agentic Development (and how they shape evals)

Let Agents do the talking: A Scalable Way to Evaluate Multi-Turn Chatbots

CUDA Study Log 4: Optimizing Constrained Decoding with Triton Kernel

CUDA Studylog 3 - Tiling and Shared Memory for Matrix Multiplication Optimization

CUDA Studylog 2 - Matrix Multiplication and 2D Grid Organization