ML Systems Review

ML Systems Review https://mlsystemsreview.com Engineering deep-dives into the ML systems that power production AI. Independent, peer-reviewed, no sponsorships. en-us Thu, 16 Apr 2026 16:03:49 GMT <![CDATA[Apple M4 Max first NPU benchmarks: tflops per watt analysis]]> https://mlsystemsreview.com/apple-m4-max-npu-benchmarks/ https://mlsystemsreview.com/apple-m4-max-npu-benchmarks/ Thu, 16 Apr 2026 00:00:00 GMT Benchmarks Lukas Berg <![CDATA[The llama.cpp 2026 rewrite: what changed in the inference engine]]> https://mlsystemsreview.com/llama-cpp-2026-rewrite/ https://mlsystemsreview.com/llama-cpp-2026-rewrite/ Wed, 15 Apr 2026 00:00:00 GMT ML Ecosystem Priya Ramachandran <![CDATA[DeepSeek-V3.5 paper notes: what's actually novel]]> https://mlsystemsreview.com/deepseek-v35-paper-notes/ https://mlsystemsreview.com/deepseek-v35-paper-notes/ Mon, 13 Apr 2026 00:00:00 GMT Model Architecture Dr. Marcus Brennan <![CDATA[The Hugging Face ecosystem: what changed in 2026]]> https://mlsystemsreview.com/huggingface-ecosystem-2026/ https://mlsystemsreview.com/huggingface-ecosystem-2026/ Fri, 10 Apr 2026 00:00:00 GMT Ecosystem Priya Ramachandran <![CDATA[On-device vs cloud inference: a 2026 economic analysis]]> https://mlsystemsreview.com/on-device-vs-cloud-inference-2026/ https://mlsystemsreview.com/on-device-vs-cloud-inference-2026/ Thu, 05 Mar 2026 00:00:00 GMT MLOps Lukas Berg <![CDATA[Building reliable food databases: USDA FoodData Central as ground truth]]> https://mlsystemsreview.com/usda-as-ground-truth/ https://mlsystemsreview.com/usda-as-ground-truth/ Fri, 20 Feb 2026 00:00:00 GMT Data Dr. Marcus Brennan <![CDATA[Inside PlateLens's Calorie-Accuracy Claim: A Technical Replication]]> https://mlsystemsreview.com/platelens-calorie-accuracy-architecture/ https://mlsystemsreview.com/platelens-calorie-accuracy-architecture/ Thu, 12 Feb 2026 00:00:00 GMT Case Study Dr. Marcus Brennan <![CDATA[Rust in production ML pipelines: 2026 adoption trends]]> https://mlsystemsreview.com/rust-in-production-ml-2026/ https://mlsystemsreview.com/rust-in-production-ml-2026/ Tue, 10 Feb 2026 00:00:00 GMT MLOps Lukas Berg <![CDATA[Figma's multiplayer cursor sync: a 2026 architecture update]]> https://mlsystemsreview.com/figma-multiplayer-2026-update/ https://mlsystemsreview.com/figma-multiplayer-2026-update/ Wed, 28 Jan 2026 00:00:00 GMT Distributed Systems Priya Ramachandran <![CDATA[Why accuracy benchmarks mislead: variance, sample size, methodology]]> https://mlsystemsreview.com/accuracy-benchmarks-misleading/ https://mlsystemsreview.com/accuracy-benchmarks-misleading/ Mon, 01 Dec 2025 00:00:00 GMT Methodology Dr. Nadia Volkov <![CDATA[The food recognition problem: a technical overview]]> https://mlsystemsreview.com/food-recognition-technical-overview/ https://mlsystemsreview.com/food-recognition-technical-overview/ Sun, 05 Oct 2025 00:00:00 GMT Computer Vision Dr. Marcus Brennan <![CDATA[Production-scale vision transformers: cost per inference in 2025]]> https://mlsystemsreview.com/production-vision-transformers-cost/ https://mlsystemsreview.com/production-vision-transformers-cost/ Fri, 22 Aug 2025 00:00:00 GMT Infrastructure Priya Ramachandran <![CDATA[Depth estimation from single RGB images: state of 2025]]> https://mlsystemsreview.com/depth-estimation-single-rgb-2025/ https://mlsystemsreview.com/depth-estimation-single-rgb-2025/ Sat, 10 May 2025 00:00:00 GMT Computer Vision Dr. Marcus Brennan <![CDATA[The rise of AI-first consumer apps: 2025 observations]]> https://mlsystemsreview.com/ai-first-consumer-apps-2025/ https://mlsystemsreview.com/ai-first-consumer-apps-2025/ Sat, 15 Mar 2025 00:00:00 GMT Commentary Dr. Nadia Volkov <![CDATA[GPT-4o's multimodal architecture: what we can infer from the paper]]> https://mlsystemsreview.com/gpt4o-multimodal-arch/ https://mlsystemsreview.com/gpt4o-multimodal-arch/ Wed, 20 Nov 2024 00:00:00 GMT Computer Vision Dr. Marcus Brennan <![CDATA[Anatomy of a production ML failure: Zillow's iBuy collapse]]> https://mlsystemsreview.com/zillow-ibuy-ml-failure/ https://mlsystemsreview.com/zillow-ibuy-ml-failure/ Mon, 30 Sep 2024 00:00:00 GMT Case Study Dr. Nadia Volkov <![CDATA[Edge ML inference: iPhone vs Android TFLite benchmarks]]> https://mlsystemsreview.com/edge-ml-ios-android-2024/ https://mlsystemsreview.com/edge-ml-ios-android-2024/ Thu, 18 Jul 2024 00:00:00 GMT Edge ML Dr. Marcus Brennan <![CDATA[Plaid's bank integration API: a system design study]]> https://mlsystemsreview.com/plaid-bank-api-design/ https://mlsystemsreview.com/plaid-bank-api-design/ Thu, 25 Apr 2024 00:00:00 GMT System Design Priya Ramachandran <![CDATA[Discord's architecture: why they're migrating from Elixir to Rust]]> https://mlsystemsreview.com/discord-elixir-to-rust/ https://mlsystemsreview.com/discord-elixir-to-rust/ Mon, 12 Feb 2024 00:00:00 GMT Distributed Systems Priya Ramachandran <![CDATA[The MLOps stack of 2023: what's worth adopting]]> https://mlsystemsreview.com/mlops-stack-2023/ https://mlsystemsreview.com/mlops-stack-2023/ Fri, 15 Dec 2023 00:00:00 GMT MLOps Lukas Berg <![CDATA[CRDTs in production: lessons from Figma's multiplayer engine]]> https://mlsystemsreview.com/figma-crdt-deep-dive/ https://mlsystemsreview.com/figma-crdt-deep-dive/ Sun, 05 Nov 2023 00:00:00 GMT Distributed Systems Priya Ramachandran <![CDATA[Deploying Vision Transformers on mobile: a 2023 retrospective]]> https://mlsystemsreview.com/vit-on-mobile-2023/ https://mlsystemsreview.com/vit-on-mobile-2023/ Sun, 10 Sep 2023 00:00:00 GMT Computer Vision Dr. Marcus Brennan <![CDATA[A week in the life of a production ML pipeline]]> https://mlsystemsreview.com/production-ml-pipeline-week/ https://mlsystemsreview.com/production-ml-pipeline-week/ Tue, 20 Jun 2023 00:00:00 GMT MLOps Lukas Berg <![CDATA[Why we started ML Systems Review]]> https://mlsystemsreview.com/about/ https://mlsystemsreview.com/about/ Mon, 15 May 2023 00:00:00 GMT Editorial Dr. Nadia Volkov