Writing

Blog

Notes from building software and running AI locally — first-principles explanations, no hype.

Hands On: Install Lemonade, Pull a Model, and Verify Your GPU Path

Parts 1 and 2 explained the stack. A model does math, the compute layer translates that math, and the hardware path decides whether your GPU or CPU d…

AILemonadeLocal AI Stack
Lemonade Server: One Local API for Text, Images, Speech, and Tools

Running one local model is useful. Running a local AI server is a bigger idea. A server gives every app on your machine one stable place to send AI w…

AILemonadeLocal AI Stack
ROCm Compatibility Is the Contribution

The useful contribution to Lemonade is not just "make local AI run." It is more specific than that: make the right AMD hardware take the right accele…

AILemonadeLocal AI StackOpen Source
Your GPU Is a Math Machine. So Is AI. Here's What Connects Them.

A modern graphics card spends most of its life doing one thing: multiplying enormous tables of numbers together, thousands of operations per second, …

AILemonadeLocal AI Stack
Every Chip Has a Secret Name: gfx Codes, ROCm, and the Fallback Ladder

Imagine installing a local AI tool on your gaming PC. You have a Radeon RX 7900 XT in there, a card with 20 GB of memory built for exactly the kind o…

AILemonadeLocal AI Stack
Token-to-Utility Density: Why AI-Assisted Development Demands a New Stack

The Context Window Crisis We're coding in a new era with an old mindset. Modern development practices, built for human developers with infinite patie…

AI DevelopmentSoftware Architecture
Episodic AI Agents: A Reconciliation-Based Approach to Autonomous Software Development

Abstract Traditional "genius agent" architectures maintain persistent context across long-running software development tasks, leading to context drif…

AIProgramming

Browse by topic

AI (6) AI Development (1) Lemonade (5) Local AI Stack (5) Open Source (1) Programming (1) Software Architecture (1)