Vllm on hexfusion - Sam Batschelet

Vllm on hexfusion - Sam Batschelethttps://hexfusion.io/tags/vllm/Recent content in Vllm on hexfusion - Sam BatscheletHugoen-usSat, 14 Mar 2026 00:00:00 +0000Disaggregated Prefill/Decode on Consumer GPUshttps://hexfusion.io/posts/disaggregated-pd-consumer-gpus/Sat, 14 Mar 2026 00:00:00 +0000https://hexfusion.io/posts/disaggregated-pd-consumer-gpus/Running llm-d’s disaggregated prefill/decode architecture across an RTX 3060 and a Tesla T4 connected by 25GbE RDMA. What worked, what broke, and what I learned about KV cache transfer at the edge of what consumer hardware can do.