Andrey Gubarev

Director of Infrastructure

Infrastructure leader who builds and runs the AWS/Kubernetes media platform behind GBNews, RawStory, TheBlaze, Penske, Axios, United Airlines, and many others, with current focus on agent-native infrastructure operations, next-generation Kubernetes platforms, unified observability, and FinOps.

01 Professional Experience

Jan 2024 — Present New York, US · Remote

Director of Infrastructure

RebelMouse

  • Pioneered agent-native infrastructure operations: standardized repository instructions, curated knowledge bases, and guarded execution conventions that let AI coding agents (Claude Code, Codex) plan and apply infrastructure changes safely under human review.
  • Directed the design and rollout of a next-generation Kubernetes (EKS) platform for high-traffic media workloads, combining Karpenter autoscaling across spot and on-demand capacity, Istio service mesh, and GitOps delivery via ArgoCD to enable zero-downtime blue-green cluster migrations and cost-aware scaling under viral traffic.
  • Led SOC 2 Type II compliance from scoping to certification, and architected automated Disaster Recovery environments for critical services using Terraform, Terragrunt, and Kubernetes.
  • Ran a company-wide AWS cost-optimization program that cut spend 12% in six months: statistical analysis of access logs to tier legacy media into S3 Glacier, PostgreSQL storage optimization, Kubernetes workload rightsizing, and eBPF-guided cross-AZ traffic reduction.
  • Established a Linear-first, async-by-default operating model for the DevOps team, with structured intake, explicit priorities, and written decision records, improving cross-timezone collaboration and delivery predictability.
skills
Strategic Planning, Team Management, Budget & Vendor Management, Async Operations
stack
AWS, EKS, Karpenter, Istio, ArgoCD, OpenTofu, Terragrunt, VictoriaMetrics, OpenTelemetry, ClickHouse, Grafana, SOPS, Claude Code, Codex
Feb 2021 — Dec 2023 New York, US · Remote

Head of DevOps

RebelMouse

  • Led the migration of core applications (Python/Django, Elixir) to Kubernetes (EKS) using Helm and ArgoCD, establishing GitOps workflows and significantly improving deployment velocity and reliability.
  • Architected and implemented advanced ingress and proxy layers using Nginx, HAProxy, and Istio, featuring sophisticated traffic routing (canary, mirroring, topology-aware).
skills
Project Management, Team Management
stack
AWS, Kubernetes, Istio, CI/CD, Infrastructure as Code (IaC), Python, OpenTelemetry, SOPS
Jan 2017 — Jan 2021 New York, US · Remote

Lead DevOps Engineer

RebelMouse

  • Led the adoption and implementation of Infrastructure as Code (IaC) using Terraform and Ansible, automating provisioning and configuration of AWS resources (EC2, ASG, VPC, ALB, RDS, S3, Route53).
  • Pioneered the company's migration to containerization and orchestration with Docker and early Kubernetes (EKS) adoption, establishing GitOps practices with ArgoCD and Kustomize.
skills
Project Management
stack
AWS (EC2, EKS, S3, RDS, VPC, WAF), Terraform, Terragrunt, Ansible, Packer, Docker, Kubernetes, ArgoCD, Istio, Fastly, Cloudflare, Prometheus, Grafana, ELK Stack, Python, Bash, TeamCity
Oct 2015 — Dec 2016 New York, US · Remote

Senior Software Engineer / DevOps Engineer

RebelMouse

Transitioned to a DevOps role, leading the design and automation of the company's cloud infrastructure on AWS.

  • Designed and implemented AWS cloud infrastructure as part of the DevOps transformation, building a serving and caching stack that enhanced platform reliability and security.
stack
Python, Django, Ansible, Packer, Docker, AWS (EC2, S3, ELB), Nginx, OpenResty, Varnish, HAProxy, uWSGI, Sentry, MongoDB, Redis, REST APIs
Nov 2012 — Sep 2015 New York, US · Remote

Software Engineer

RebelMouse

Worked as part of the team developing and scaling the core RebelMouse platform, contributing to backend systems, APIs, and performance optimizations.

  • Developed backend and frontend features (activity streams, APIs, settings engine, CTA, social integrations, content distribution features) for a high-traffic social content platform using Python/Django and JavaScript (Backbone.js, jQuery, RequireJS).
stack
Python, Django, Celery, eventlet, Redis, Memcached, MongoDB, MySQL, Backbone.js, REST APIs, S3
Dec 2011 — Oct 2012 New York, US · Remote

Software Engineer

CasaHop

Contributed to the backend and frontend development of a social travel platform, focusing on building performant backend features.

  • Developed features including Facebook authentication, photo uploads (S3, async processing), real-time notifications (WebSockets).
  • Implemented Redis-based autocomplete, activity feeds, and advanced caching mechanisms.
  • Automated development environment setup (Fabric, Makefile, Vagrant), and deployment to AWS/dotCloud.

02 Education

2006 — 2011 Kursk, Russia

Master's Degree (Specialist) in Information Security

South-West State University (formerly Kursk State Technical University)