The M5 Pro just dropped, so here's a real AI workload instead of another Geekbench score. We run Qwen3.5 as the brain of a fully local home security system and benchmarked it against OpenAI cloud models on a custom 96-test suite. The Qwen3.5-9B scores 93.8% — within 4 points of GPT-5.4 — while running entirely on the M5 Pro at 25 tok/s, 765ms TTFT, using only 13.8 GB of unified memory. The 35B MoE variant hits 42 tok/s with a 435ms TTFT — faster first-token than any OpenAI cloud endpoint we tested. Zero API costs, full data privacy, all local. Full results: https://www.sharpai.org/benchmark/

▲

Aurornis 2 hours ago | parent | next [-]

Thanks for sharing the results, but it's getting hard to cut through all of the AI generated hype on the page and in your comments to understand what's being testing.

Between the all the em-dashes, this:

> Zero API costs, full data privacy, all local.

and the way your comments have completely different voices it's pretty clear that you're letting AI write some of your HN comments, too.

Is there some place we can quickly go see what's actually being tested? The landing page has non-clickable entries for the categories

	▲	aegis_camera 2 hours ago \| parent [-]
		The comments are actually done by me... The benchmark suit is here: https://github.com/SharpAI/DeepCamera/tree/master/skills/ana...

▲

algo_trader 2 hours ago | parent | prev [-]

> fully local home security system

R u running the GPU at full throttle 24x7? Have you encounters silicon failures over time?

	▲	aegis_camera an hour ago \| parent [-]
		[dead]