Projects

Things I've built and worked on.

2 of 5 projects

Mini SGLang

A minimal implementation of SGLang for understanding LLM inference optimization techniques including continuous batching and KV cache management.

llm-inferencepythonresearch
PythonPyTorchCUDA

Attention Visualizer

Interactive visualization tool for exploring attention patterns in transformer models.

visualizationtypescriptresearch
TypeScriptReactD3.js