Eugene Brodsky

platform engineering / cloud infrastructure / toronto

I architect and build enterprise-grade cloud infrastructure that scales — reliable, secure platforms that let teams ship faster while focusing on the product.

Twenty-plus years across platform architecture, infrastructure-as-code, MLOps, and compliance work — spanning design agencies, live streaming, VFX rendering farms, retail, biotech drug discovery, endpoint security, AI/ML infrastructure, and agentic systems. Multi-cloud by default, with the operational discipline that keeps products running while they grow.

Dive Deeper Get in touch

Locale: Toronto, ON
Tenure: 20+ yrs · platform & infra
Stack: Kubernetes · Terraform · Python
Domains: VFX · Bio · IoT · SaaS · AI
Status: ◼ Building

0x0a/focus

What I Bring

expertise/core competencies

0x01 · feature

Platform Architecture

Designing and implementing scalable, enterprise-ready platforms from the ground up. Multi-cloud architectures that hold up under audit and scale — with the single-tenant and compliance patterns that make enterprise contracts possible.

AWS Azure GCP Kubernetes Docker

0x02

AI/ML Infrastructure

Building platforms that support AI and ML workloads in production — from model training pipelines and GPU cluster management to serving infrastructure for generative AI applications.

MLOps Ray Argo GPU Clusters Inference Serving

0x03

Infrastructure as Code

Treating infrastructure like software: versioned, reviewed, tested, and reproducible. Consistency across environments without drift, without snowflakes, without surprises.

Terraform Pulumi GitOps ArgoCD Ansible

0x04

Compliance & Security

SOC2 Type I & II, data segregation, single- and multi-tenant enterprise architectures. Compliance is shifted left and built in — not bolted on before an audit.

SOC2 Data Segregation IAM Vault

0x05

DevOps & CI/CD

CI/CD pipelines, deployment automation, and operational practices that enable rapid, reliable software delivery without trading away system stability.

Python TypeScript GitHub Actions CI/CD

0x06

Observability & SRE

Full-stack observability and reliability engineering — dashboards that answer questions, alerts that matter, and systems that explain themselves when things go wrong.

Prometheus Grafana Loki OpenTelemetry

0x07 · current focus

AgentOps & Applied Agentics

Operationalizing AI agents with safety and control built in — guardrails, behavioral evals, sandboxed execution, and isolation patterns that keep autonomous systems bounded. Sovereign inference for organizations with strict data residency requirements. Eval harnesses built on DeepEval and custom frameworks, wired into CI so agent regressions surface before they reach production.

Guardrails Evals Sandboxing Sovereign Inference MCP CrewAI OpenClaw

0x0b/reach

Let's talk

channels open/timezone EST

I work with organizations to build platforms that hold up — whether that's scaling existing infrastructure, migrating to the cloud, implementing compliance, or standing up something entirely new.

Currently open to advisory conversations with early-stage startups. If something here resonated, reach out.

// channels ● open

Email hello@brodsky.dev LinkedIn in/brodsky GitHub github.com/ebr