You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
**Our Near tier (2.4B/s) beats an RTX 3060 by 5–12×.** Our Foveal tier (611M/s) matches RTX 3060 IVF-PQ — but with 0.4% error vs. PQ's 5–10%, and at $0 hardware cost. Only an H100 ($30K, 700W) comes close — and it still needs PCIe transfer + kernel launch overhead that we don't have.
21
+
A $35 Raspberry Pi 4 at 5 watts matches or beats a $350 RTX 3060 at 170 watts. A Sapphire Rapids server outperforms an H100 at half the power. A $15 Pi Zero 2W at 2 watts still beats FAISS CPU Flat by 60%.
20
22
21
23
The trick: GPU must FP32-multiply, FP32-divide, and transfer over PCIe. We read one u8 from a 64KB table that lives in L1 cache. No transfer, no kernel launch, no floating point.
0 commit comments