Platform engineer and researcher at the University of Bologna specialising in MLOps, AIOps, and Kubernetes-native infrastructure. I architect production-grade LLM agent systems, anomaly detection pipelines, and hybrid HPC/cloud orchestration frameworks for datacenter-scale environments.
Seven years of hands-on systems and network engineering in mission-critical industrial environments (power plants, enterprise OT networks) before a PhD in High-Performance Computing gives me a lens most ML researchers lack: I care about uptime, observability, and correctness in production — not just benchmark performance.
My current focus is autonomous infrastructure control: systems that detect failures, reason about root causes, and propose or execute remediation with human approval gates.