Real hardware · Real tasks · Honest verdicts
Your data doesn't
leave this machine.
Weekly experiments on Apple Silicon. Each one targets a specific task
for a specific regulated industry and returns a verdict you can actually act on,
not "promising" or "has potential," but viable, partial, or not yet.
FAQ
About the experiments
- What does Ground Floor test each week?
- Each Ground Floor experiment runs one real task for a regulated practice on a local, open-weight model: a SOAP note, a contract clause, a set of meeting minutes. It runs on Apple Silicon with the air-gap held. I publish what it did, the model and hardware used, and a plain verdict of viable, partial, or not yet.
- Are the failures published too?
- Yes. An experiment gets a "not yet" verdict when the local model could not do the task well enough to trust, and that verdict stays up. The point is an honest record of what local AI can and cannot do for regulated work right now, not a highlight reel.
- What hardware and models do the experiments use?
- The experiments run open-weight models on Apple Silicon Macs, from an M4 mini up to the two-M5-Max lab. Each write-up lists the exact model, the Mac, and the task, so you can judge whether it maps to your own setup.