The Most Expensive Word in AI Agents Is “Parallel”
AI agents love searching everything in parallel. Your token budget probably doesn’t. A practical look at where the cost actually comes from in multi-agent systems and how to reduce it.
AI agents love searching everything in parallel. Your token budget probably doesn’t. A practical look at where the cost actually comes from in multi-agent systems and how to reduce it.
A “one-click” ML observability feature looked like an easy win until we modeled the cost at production scale. What seemed like basic logging turned into a high-volume data pipeline with non-trivial latency and a surprising price curve.
We enabled tracing on a production ML system expecting a small latency overhead. Instead inference latency increased by ~700ms — far beyond the documented range, revealing how benchmarking assumptions may break under real workloads.
An end-to-end implementation of a UCITS‑compliant quarterly momentum rotation strategy, complete with Bayesian parameter tuning, walk‑forward validation and practical insights from real‑world retail constraints.
Analysis of Hierarchical Risk Parity (HRP) for crypto portfolios. Comparing HRP variants vs Bitcoin holding over 3 years with surprising results on portfolio optimisation strategies.