Alexey Golev

The Most Expensive Word in AI Agents Is “Parallel”

AI agents love searching everything in parallel. Your token budget probably doesn’t. A practical look at where the cost actually comes from in multi-agent systems and how to reduce it.

The Hidden Cost Curve of “One-Click” ML Observability

A “one-click” ML observability feature looked like an easy win until we modeled the cost at production scale. What seemed like basic logging turned into a high-volume data pipeline with non-trivial latency and a surprising price curve.

Tracing in Production: When 1–150ms Turns Into 700ms

We enabled tracing on a production ML system expecting a small latency overhead. Instead inference latency increased by ~700ms — far beyond the documented range, revealing how benchmarking assumptions may break under real workloads.

Building a UCITS‑Only Quarterly Top‑5 Rank‑Weighted Momentum Strategy

An end-to-end implementation of a UCITS‑compliant quarterly momentum rotation strategy, complete with Bayesian parameter tuning, walk‑forward validation and practical insights from real‑world retail constraints.

Hierarchical Risk Parity (HRP) for Crypto Portfolio Optimisation

Analysis of Hierarchical Risk Parity (HRP) for crypto portfolios. Comparing HRP variants vs Bitcoin holding over 3 years with surprising results on portfolio optimisation strategies.

Latest

The Most Expensive Word in AI Agents Is “Parallel”

The Hidden Cost Curve of “One-Click” ML Observability

Tracing in Production: When 1–150ms Turns Into 700ms

Building a UCITS‑Only Quarterly Top‑5 Rank‑Weighted Momentum Strategy

Hierarchical Risk Parity (HRP) for Crypto Portfolio Optimisation

Connect with me