Nelson Jaime

DevOps · Site Reliability Engineer

Professional Summary

Highly motivated SRE and DevOps professional with 4+ years of experience designing, automating, and managing large-scale cloud infrastructure. Currently managing 300+ Kubernetes clusters across AWS, GCP, Azure, and Linode at Hydrolix. Specialized in GitOps, Infrastructure as Code, and observability. Passionate about platform reliability, automation, and reducing operational toil.

Experience

Site Reliability Engineer

2025 – Present
  • Managing, upgrading, and maintaining 300+ Kubernetes clusters across AWS (EKS), GCP (GKE), Azure (AKS), and Linode (LKE) using k9s
  • Designed and continuously improved automated deployment pipelines for customer-dedicated stacks using Pulumi (IaC) and Argo Workflows as the orchestration layer
  • Built reusable WorkflowTemplate definitions for customer deployments (e.g., trafficpeak deployment workflows)
  • Managed production deployments using ArgoCD following GitOps principles
  • Delivered customer deployments across global regions: North America, Europe, and Asia
  • Built CI/CD automations using Argo Workflows and GitHub Actions

DevOps Engineer

2024 – 2025
  • Upgraded Kubernetes platform across 8 clusters with their respective nodes on AWS
  • Implemented full monitoring stack from scratch: Prometheus, AlertManager, Grafana, and Loki for microservices log management
  • Deployed and configured exporters to collect metrics from various sources and services
  • Implemented Ansible playbooks for on-premise servers: ELK (2-node) and Kubernetes cluster (3-node via kubespray)

DevOps Engineer

2021 – 2024
  • Designed and implemented DevOps methodologies and infrastructure on AWS using Kubernetes, Terraform, Jenkins, Vault, Nexus, and GitLab
  • Designed and built CI/CD pipelines with Jenkins for multi-team and multi-project applications; integrated with GitLab and SonarQube
  • Implemented monitoring and alerting with Prometheus, AlertManager, and Grafana; configured ELK Stack for log management
  • Installed and maintained HashiCorp Vault for secure secrets and credentials management across production and development environments
  • Managed GitLab servers for source code and CI/CD pipeline management including branching, merging, and conflict resolution

Education

Master in Java Development

2021

Technical Skills

Orchestration

Kubernetes, Helm, Docker, ArgoCD, Argo Workflows

Cloud

AWS (EKS), GCP (GKE), Azure (AKS), Linode (LKE)

Infrastructure as Code

Terraform, Pulumi (Python), Ansible

CI/CD

Jenkins, GitLab CI, Azure DevOps, GitHub Actions

Monitoring

Prometheus, AlertManager, Grafana, Loki, ELK Stack

Scripting & Dev

Bash, Python, Groovy, .NET, Java (SpringBoot)

Security & Tooling

HashiCorp Vault, Nexus, SonarQube, Linux, k9s

Certifications

AWS Cloud Practitioner

2023

Amazon Web Services

AZ-900 Microsoft Azure Fundamentals

2023

Microsoft

DevOps Foundations

2023

Linux Foundation

Projects

Homelab — K3s Cluster

Self-hosted 3-node K3s cluster managed with ArgoCD GitOps. Runs Sealed Secrets, Cloudflare Tunnel for secure external access, and a local AI stack (Ollama + Open WebUI) with LLM-driven SRE automation workflows.

Core Metrics

300+

Kubernetes Clusters

4

Cloud Providers

4+

Years of Experience

3

Certifications

Languages

Spanish

Native

English

Professional