About | inferops

MLOps is the single most underinvested discipline in enterprise AI. Every company has a model strategy. Almost none have a production strategy. And the gap between a working prototype and a system that runs reliably, scales under load, passes compliance, and survives its first real incident is not a gap you close with better prompts or bigger GPUs. It's an engineering problem. Infrastructure, operations, architecture, and the decision-making rigor to get them right. That is what I work on. That is what this entire practice is about.

What I've built

Over the past years as a consultant and solution architect, I've helped enterprises across manufacturing, energy, and public sector build AI platforms that went to production and stayed there.

Among these is a platform serving over 10,000 users on fully sovereign, on-premise infrastructure with dedicated GPU nodes, container-native model serving with automated retraining pipelines, and the full observability and rollback discipline that regulated environments demand. Not a demo. A production system that teams across the organization depend on daily.

In other projects I built and shipped IoT platforms across 200+ manufacturing sites industrial IT/OT environments. I designed MLOps toolchains for industrial AI at scale, connected IT and OT in settings where cloud was never an option, and learned what it means to deliver systems under constraints that don't bend for your architecture preferences.

Every one of those projects ran into the same pattern. The models were solvable. The infrastructure decisions, the architecture tradeoffs, and the operational discipline were what determined whether the project shipped or stalled.

Why MLOps is foundational

Most enterprises are past experimentation. They have working models, agentic workflows, promising use cases, and real pressure to deliver. What they don't have is the operational backbone to move from proof of concept to production safely and at scale.

MLOps is that backbone. Model versioning, pipeline orchestration, serving infrastructure, monitoring, governance, reproducibility, and the engineering practices that make AI systems trustworthy enough to run a business on. This is not a nice-to-have maturity stage. It is the difference between AI that creates value and AI that creates technical debt.

The companies that build this foundation now will set the pace for the next five years. The ones that keep treating infrastructure as an afterthought will keep producing impressive demos that never quite make it to production.

Where I come from

I started in aerospace engineering, working on simulation and large-scale computing. From there I moved into program and product management in German industry and automotive, leading cross-functional teams that shipped real systems under real constraints. That path eventually pulled me deep into cloud engineering, Kubernetes, GPU infrastructure, and the full MLOps and platform engineering stack.

I mention this because it shapes how I work. I've been in the room where infrastructure budgets get approved and platform strategies get challenged. I've also been the one designing the cluster architecture and troubleshooting the serving pipeline at midnight. I've built my practice on working across the lines and disciplines that effective MLOps and platform engineering requires.

I am happy to connect and learn about your challenges: GitHub X Email