Eugene Brodsky

platform engineering / cloud infrastructure / toronto

I architect and build enterprise-grade cloud infrastructure that scales — reliable, secure platforms that let teams ship faster while focusing on the product.

Twenty-plus years across platform architecture, infrastructure-as-code, MLOps, and compliance work — spanning design agencies, live streaming, VFX rendering farms, retail, biotech drug discovery, endpoint security, AI/ML infrastructure, and agentic systems. Multi-cloud by default, with the operational discipline that keeps products running while they grow.

Operator profile // id 0x01
Locale
Toronto, ON
Tenure
20+ yrs · platform & infra
Stack
Kubernetes · Terraform · Python
Domains
VFX · Bio · IoT · SaaS · AI
Status
◼ Building
0x0a/focus

What I Bring

expertise/core competencies
0x01 · feature

Platform Architecture

Designing and implementing scalable, enterprise-ready platforms from the ground up. Multi-cloud architectures that hold up under audit and scale — with the single-tenant and compliance patterns that make enterprise contracts possible.

AWS Azure GCP Kubernetes Docker
0x02

AI/ML Infrastructure

Building platforms that support AI and ML workloads in production — from model training pipelines and GPU cluster management to serving infrastructure for generative AI applications.

MLOps Ray Argo GPU Clusters Inference Serving
0x03

Infrastructure as Code

Treating infrastructure like software: versioned, reviewed, tested, and reproducible. Consistency across environments without drift, without snowflakes, without surprises.

Terraform Pulumi GitOps ArgoCD Ansible
0x04

Compliance & Security

SOC2 Type I & II, data segregation, single- and multi-tenant enterprise architectures. Compliance is shifted left and built in — not bolted on before an audit.

SOC2 Data Segregation IAM Vault
0x05

DevOps & CI/CD

CI/CD pipelines, deployment automation, and operational practices that enable rapid, reliable software delivery without trading away system stability.

Python TypeScript GitHub Actions CI/CD
0x06

Observability & SRE

Full-stack observability and reliability engineering — dashboards that answer questions, alerts that matter, and systems that explain themselves when things go wrong.

Prometheus Grafana Loki OpenTelemetry
0x07 · current focus

AgentOps & Applied Agentics

Operationalizing AI agents with safety and control built in — guardrails, behavioral evals, sandboxed execution, and isolation patterns that keep autonomous systems bounded. Sovereign inference for organizations with strict data residency requirements. Eval harnesses built on DeepEval and custom frameworks, wired into CI so agent regressions surface before they reach production.

Guardrails Evals Sandboxing Sovereign Inference MCP CrewAI OpenClaw
0x0b/reach

Let's talk

channels open/timezone EST

I work with organizations to build platforms that hold up — whether that's scaling existing infrastructure, migrating to the cloud, implementing compliance, or standing up something entirely new.

Currently open to advisory conversations with early-stage startups. If something here resonated, reach out.