The stack worth knowing — open-source, free, and changing what one person can build.

Llama

Meta's family — the one that started the open-weights wave. Sizes from runs-on-a-phone to runs-a-cluster. The safe default for general work.

Open weights llama.com

DeepSeek · reasoning

MIT github.com/deepseek-ai

DeepSeek (V3 · R1)

The one that shocked everyone — frontier-class reasoning at a fraction of the training cost, weights released MIT. Proof open can catch up fast.

Alibaba · multilingual

Qwen

Alibaba's family — strong at code, maths and non-English. Most sizes are Apache-2.0, so you can ship commercially with no asterisks.

Apache-2.0 qwen.ai

Mistral · efficient

Mistral & Mixtral

French lab, famous for small models that punch far above their weight. Mixtral pioneered cheap “mixture-of-experts” for the open world.

Apache-2.0 mistral.ai

Google · on-device

Open weights ai.google.dev/gemma

Gemma

Google's open siblings to Gemini. Small, fast, genuinely good — built to run on your own hardware, including phones and single GPUs.

Microsoft · tiny

MIT huggingface.co/microsoft

Phi

Microsoft's “small but mighty” line — trained on textbook-quality data so a tiny model reasons like a much bigger one. MIT-licensed.

The library · everything

Hugging Face

The GitHub of AI models — a million-plus models, datasets and demos in one place. Where every model above actually lives. Start here.

Hub huggingface.co

OpenAI · speech

MIT github.com/openai/whisper

Whisper

Speech-to-text that just works, in ~100 languages, fully open. It's the engine inside my own murmur app — your voice never leaves the machine.

// new ones land monthly

02 / Run it yourself

Get a model running on your own machine.

A model is just a file. These tools turn that file into something you can chat with — no cloud, no API bill, no data leaving your laptop.

Easiest start

Ollama

One command — ollama run llama3 — and you have a local model. We use it for first-pass work before spending a single paid token.

MIT ollama.com

No terminal needed

LM Studio

A friendly desktop app to download and chat with open models — point, click, run. The gentlest on-ramp if the command line scares you.

Free app lmstudio.ai

Runs on anything

MIT github.com/ggml-org/llama.cpp

llama.cpp

The engine under most local tools. Squeezes big models onto plain laptops — no fancy GPU required. The reason any of this works on cheap hardware.

When you scale up

Apache-2.0 github.com/vllm-project

vLLM

The serious server engine — what you reach for when you need to serve a model to a whole product fast. Powers production deployments everywhere.

Your own ChatGPT

MIT github.com/open-webui

Open WebUI

A polished ChatGPT-style interface that sits on top of Ollama. Self-host it and your whole team has a private chat — your data, your server.

LumiChats (offline)

Private AI on a Windows PC — no internet, no GPU, no cloud. Nine fine-tuned models, built on GPT4All. A fork I keep for fully air-gapped work.

03 / AI coding tools

The tools that change what one person can ship.

This is the part that's genuinely new. A single person with these can now build what used to take a team. Most are free or have a real free tier.

Anthropic · what I use

Claude Code

An agent that lives in your terminal, reads your whole codebase, and does the work — not autocomplete, a teammate. This entire site was built with it.

Subscription claude.com

Open · terminal

Aider

The open-source pair-programmer in your terminal. Bring your own model (even a local one), and it edits your repo and commits to git for you.

Apache-2.0 aider.chat

Open · in VS Code

Apache-2.0 github.com/cline

Cline

An autonomous coding agent right inside VS Code — it plans, edits files, runs commands and asks before anything risky. Fully open, model-agnostic.

Open · autocomplete+

Continue

Open-source Copilot you fully control — pick any model, point it at local ones, build custom assistants. The configurable, no-lock-in option.

Apache-2.0 continue.dev

The popular editor

Cursor

A whole code editor rebuilt around AI — chat with your codebase, multi-file edits, agent mode. Not open, but the free tier is where many people start.

Free tier cursor.com

Open Browser Use

Open alternative to letting an agent drive a real browser — click, type, fill forms, complete tasks on the live web. A fork I keep for web automation.

04 / Agents & the glue

Frameworks that make models do things, not just talk.

A chatbot answers. An agent acts — calls tools, browses, runs code, remembers. These are the building blocks, plus the standard that connects them all.

The standard · MCP

Open standard modelcontextprotocol.io

Model Context Protocol

The “USB-C for AI” — one open standard so any model can plug into any tool, database or app. The most important plumbing in the field right now.

MCP Servers

The reference collection of ready-made MCP connectors — GitHub, Slack, databases, the web. Fork it, run one, and your agent gains a new sense.

The big toolbox

LangChain / LangGraph

The most-used framework for chaining models, tools and memory into real apps. LangGraph adds proper, stateful agent workflows on top.

MIT langchain.com

Teams of agents

CrewAI

Build a “crew” of role-playing agents that collaborate on a task — researcher, writer, reviewer. The simplest way to grasp multi-agent work.

MIT crewai.com

Google · ADK

Apache-2.0 google.github.io/adk-docs

Agent Development Kit

Google's open framework for building and shipping production agents — the same one their own teams use. Clean, opinionated, well-documented.

Browser Harness

A self-healing layer that lets a model finish any task in a real browser, recovering when a page changes. One of my bets on web-acting agents.

05 / Token & memory tools

Make your AI cheaper, faster, and able to remember.

The unglamorous tools that decide whether AI work is affordable. Cut wasted tokens, give an agent a real memory, point it only at what matters. These are the ones I lean on hardest.

Context · pick

Code Review Graph

Builds a persistent map of a codebase so the AI reads only what matters — up to 49× fewer tokens on daily tasks. Massive cost saver on big repos.

Tokens · pick

caveman

“Why use many token when few token do trick” — a Claude Code skill that cuts ~65% of tokens by talking terse. Funny name, real savings.

Proxy · pick

rtk1

A tiny CLI proxy that reduces LLM token use 60–90% on common dev commands. One Rust binary, zero dependencies. Sits quietly and saves money.

Search · pick

claude-context

Code-search over MCP — makes an entire codebase searchable context for any agent, so it finds the right file instead of reading everything.

Harness · pick

everything-claude-code

A whole performance system for coding agents — skills, instincts, memory, security, research-first defaults. The playbook I mine for ideas.

06 / The frontier

Where it's going next — from robots to physical AI.

The same wave that hit software is reaching the physical world. Some of this is already open and hackable; some is just worth watching. Both matter if you want to see around the corner.

Robots · open

Apache-2.0 github.com/huggingface/lerobot

LeRobot

Hugging Face's open robotics stack — models, datasets and tutorials to teach a real robot arm new skills. The cheapest door into physical AI.

Simulation · open

Apache-2.0 genesis-embodied-ai.github.io

Genesis

A blazing-fast open physics engine for training robots in simulation before they ever touch the real world. Generates training worlds from a prompt.

Humanoids · affordable

Hardware + SDK unitree.com

Unitree

The company making humanoid and dog robots people can actually afford — and they open-source SDKs and training code. Where hobbyist robotics is heading.

NVIDIA · robot brains

Platform developer.nvidia.com/isaac

NVIDIA Isaac & GR00T

NVIDIA's platform for robot foundation models — train, simulate, deploy. GR00T is their open base model for humanoids. The picks-and-shovels play.

Humanoids · watch

Figure

One of the leaders racing to put general-purpose humanoid robots into real factories and homes. Closed, but the bellwether for how fast this is moving.

Worth watching figure.ai

Images & video · open

ComfyUI

A visual, node-based studio for open image and video models — drag boxes, wire them up, generate. The open creative counterweight to closed tools.

GPL-3.0 comfy.org

07 / Picks

The bets I keep coming back to.

When I find something brilliant I keep it on the shelf — so I can read the code and learn from it. Every link goes to the original repo. A few more beyond the ones above.

Method · pick

superpowers

An agentic skills framework and development method that actually works in practice — a structured way to give agents reusable abilities.

Personal AI · pick

openhuman

A private, simple, powerful personal super-intelligence — the “your own assistant, fully yours” idea I keep circling back to.

E-sign · pick

DocuSeal

Open-source DocuSign alternative. How we get HIPAA agreements signed on our own server, no per-seat bill. Already running in production for us.

Codebase IQ · pick

repowise

Codebase intelligence for AI teams — auto-docs, git analytics, dead-code detection and architecture notes, all over MCP. Keeps big repos legible.

Learn · pick

Transformer Explainer

An interactive, visual way to see how an LLM actually works inside. If you only open one link to learn the fundamentals, make it this one.