Personal projects
Research Blog
nanochat-mlx
Train your own ChatGPT on Apple Silicon. MLX port of nanochat.
SunShift
Adjust Night Shift intensity from the menu bar.
activation-probes-claim-correctness
Claim-level correctness probes on Llama hidden activations.
multi-model
Panel-of-experts scaffold vs native thinking on Qwen3-30B-A3B.
society-of-thought-bench
Multi-persona debate benchmark with traces, evidence, and a live demo.
hypothesis-forge
RL environment for novel, evidence-grounded, falsifiable ideas.
adaptive-rag-rlm
Verifiers RLM environment for adaptive recursive search over long corpora.
TextDrop
Turn pasted text into files with one click.
ttt-discover-autoresearch-mlx
Local-first MLX version of TTT-Discover AutoResearch.
TabPilot
Safari tab command center powered by Codex App Server.
SafariMarkdown
Convert any Safari tab to clean Markdown in one click.
Proofgrade
Reproducible LLM proof grading benchmark and API for math.
GhostLabel
Ghostty tab renamer powered by Codex App Server.
PasteForge
Clipboard text transformer with case, encode, format, hash, and stats tools.
ClipDrop
Read clipboard text, edit it, and save to any file format.
DiskPulse
Per-volume disk space monitoring with color-coded usage bars.
PortSentry
See all listening TCP ports with one-click kill.
ProcessBeacon
Monitor long-running processes and get notified on completion.
BrewPilot
Manage Homebrew services. Start, stop, and restart with one click.
gemma4-m4-pro
Gemma 4 on a 24GB MacBook: measured recipes, runtimes, and fallback paths.
train-gemma4-sudoku-on-your-macbook
One-notebook Gemma 4 RL on Apple Silicon.
autoresearch-evo
Novelty-search-inspired autonomous search with run memory and agentic review.