Indianapolis, Indiana

Saroj Priyadarshi

Site Reliability Engineer

13+ years engineering cloud infrastructure at scale. Bridging deep SRE expertise with emerging AI capabilities to build systems that are fast, reliable, and intelligent.

Saroj Priyadarshi - Senior Site Reliability Engineer
AWS
Kubernetes
Terraform
Prometheus
Scroll

Who I Am

About Me

Senior SRE with 13 years of hands-on experience, now building at the intersection of infrastructure and AI.

0+
Years Experience
0+
Apps Managed
0%
MTTR Reduction
0%
Infrastructure Cost Savings
Get In Touch

SRE Engineer with 13+ years of experience leading capacity planning, autoscaling, performance optimization, and cloud cost management for large-scale, globally distributed SaaS platforms. At OPENLANE, I architect and operate cloud infrastructure across AWS (ECS, EKS, EC2, RDS, S3, CloudFront, CloudWatch, KMS) and Azure — where uptime isn't a metric, it's a promise.

My work sits at the intersection of reliability and intelligence. With deep expertise in Terraform, Python, Go, AppDynamics, OpenTelemetry, and Splunk, I build systems that self-heal, scale gracefully, and surface insights before problems become incidents — delivering a 40% MTTR reduction and 25% infrastructure cost savings across 100+ production applications.

Today, I'm channeling that operational depth into the AI space. Armed with my AWS AI Practitioner certification and a Master's in IT Management from Indiana University's Kelley School of Business, I'm applying AI tooling to SRE workflows to build infrastructure that doesn't just respond to failures — it predicts and prevents them.

AWS Certified AI Practitioner
Certified Kubernetes Administrator
Gremlin Chaos Engineering
MS — Indiana University

Technical Expertise

Skills & Stack

A decade-plus of hands-on experience across the full infrastructure lifecycle — from cloud provisioning to production observability.

Cloud Platforms

AWS
Microsoft Azure
GCP
Cloudflare

Container Orchestration

Kubernetes (EKS/AKS)
Docker
Docker Swarm
OpenShift
Helm
Kustomize
Linkerd

Infrastructure as Code

Terraform
Pulumi
Ansible
Puppet
Chef
CloudFormation
CDK

CI/CD & GitOps

GitHub Actions
Azure DevOps
Jenkins
ArgoCD
FluxCD

Observability

Splunk
AppDynamics
Prometheus
Grafana
Datadog
OpenTelemetry

Scripting & Automation

Python
Go
Bash
Ruby
Linux
PowerShell

Featured Work

Key Projects

Real-world infrastructure projects delivering measurable reliability, efficiency, and cost improvements at scale.

Enterprise Observability Platform
observability

Enterprise Observability Platform

Unified observability stack for 100+ microservices with real-time alerting, distributed tracing, and AI-assisted anomaly detection.

Reduced MTTR by 40%, enabling proactive incident response
PrometheusGrafanaOpenTelemetryAWSKubernetes
Infrastructure as Code Automation
automation

Infrastructure as Code Automation

End-to-end IaC framework managing 500+ cloud resources with automated drift detection, policy-as-code enforcement, and self-service provisioning.

Reduced provisioning time by 85%, eliminated configuration drift
TerraformAnsibleGitOpsOPAAzure DevOps
Enterprise Cloud Migration
migration

Enterprise Cloud Migration

Led the migration of 100+ legacy applications to AWS and Azure, achieving zero-downtime cutover with automated rollback capabilities.

Zero-downtime migration of 100+ apps, 65% reliability improvement
AWSAzureDockerKubernetesTerraform
SLO Implementation Framework
observability

SLO Implementation Framework

Service Level Objective framework with automated error budget tracking, burn rate alerting, and stakeholder dashboards for 50+ services.

Enabled data-driven reliability decisions for 50+ services
PrometheusGrafanaPythonPagerDutyKubernetes
Chaos Engineering Platform
reliability

Chaos Engineering Platform

Automated chaos engineering program using Gremlin and custom tooling to proactively identify and remediate system weaknesses.

Prevented 23 critical incidents through proactive failure testing
GremlinPythonKubernetesAWSPagerDuty
Cloud Cost Optimization Engine
automation

Cloud Cost Optimization Engine

Data-driven cost optimization platform that identified and automated elimination of cloud waste, achieving 35% spend reduction.

Delivered 35% cloud cost reduction, saving $2M+ annually
AWSAzurePythonTerraformGrafana

Career Journey

Experience

13 years of progressive growth from systems engineering to senior SRE leadership across finance, consulting, and technology.

OPENLANE

Indianapolis, Indiana

Senior Site Reliability Engineer

May 2020Present
  • Architected and operated large-scale cloud infrastructure across AWS (ECS, EKS, EC2, RDS, S3, CloudFront, CloudWatch, KMS) and Azure — leading capacity planning, autoscaling strategies, and performance optimization for a globally distributed SaaS platform
  • Led zero-downtime migration of 100+ on-premises applications to AWS EC2, ECS, and Elastic Load Balancers — reducing infrastructure costs by 25% through right-sizing, reserved instances, and autoscaling optimization
  • Designed and maintained enterprise CI/CD pipelines using Azure DevOps, GitHub Actions, and Jenkins with blue-green and canary release strategies, enabling zero-downtime production deployments
  • Spearheaded observability platform using AppDynamics, Splunk, Prometheus, Grafana, Datadog, CloudWatch, Azure Monitor, and OpenTelemetry — reducing service downtime by 20% and ensuring full telemetry coverage across Data Plane and Control Plane components
  • Defined and tracked SLIs, SLOs, and error budgets across 100+ services; owned incident management and ran blameless postmortems, reducing MTTR by 40% and using error budgets to balance feature velocity with reliability investments
  • Planned and led chaos engineering and disaster recovery initiatives — simulating failure scenarios to validate fault tolerance and build organizational resilience
  • Automated Kubernetes cluster lifecycle management across EKS and AKS — deploying and operating clusters with Helm, Kustomize, ArgoCD, FluxCD, and custom Operators/CRDs; implemented GitOps workflows for declarative, auditable deployments and designed golden-path application abstractions that empowered development teams to manage their own infrastructure
  • Developed internal SRE tooling and automation using Python, Go, Bash, and PowerShell — eliminating manual toil and increasing engineering velocity across distributed systems
  • Managed on-call processes and escalation best practices across a globally distributed team, minimizing production downtime and improving response coordination
  • Ensured compliance with SOC2 and ISO 27001 standards using IAM, Key Vault, and Secrets Manager; contributed to enterprise messaging architecture using Kafka and Azure Event Hub
  • Led a team of 6 engineers, improving overall system reliability by 65% through technical mentorship, architectural decision-making, and cross-functional collaboration

Site Reliability Engineer

May 2017Apr 2020
  • Modernized hybrid cloud infrastructure and built CI/CD pipelines that improved release velocity and deployment reliability
  • Drove adoption of Kubernetes and OpenShift for scalable, resilient microservices architecture — migrating legacy services and improving deployment agility
  • Enhanced platform observability using Splunk, AppDynamics, and SolarWinds, improving system uptime and incident response times
  • Automated server patching and provisioning, significantly reducing routine manual workload and human error
  • Implemented IAM governance, secret rotation policies, and vulnerability management in collaboration with security teams
  • Built infrastructure blueprints for onboarding new applications with built-in observability, alerting, and compliance standards
  • Supported SOC2 readiness by enforcing infrastructure security compliance and maintaining comprehensive documentation
  • Conducted regular disaster recovery drills across multiple business units, ensuring minimal service disruption during outages
AWSAzureKubernetesTerraformArgoCDFluxCDAppDynamicsSplunkPrometheusGrafanaDatadogOpenTelemetryPythonGoHelmKustomize

Wells Fargo

Remote / Charlotte, NC

Senior Analyst — IT Infrastructure

Oct 2015Apr 2017
  • Managed mission-critical banking infrastructure under Community Banking Services, ensuring high availability and 24x7 resilience
  • Led large-scale IT infrastructure deployments, improving system performance by 20% and reliability by 26%
  • Collaborated with cross-functional teams to drive technical decision-making and operational efficiency across distributed environments
  • Received "Achieving Excellence" award for outstanding contributions to infrastructure modernization initiatives
LinuxPythonBashJenkins

Infosys

Bangalore, India

Senior Systems Engineer

Aug 2013Sep 2015
  • Led infrastructure transformation projects for BMW (Germany) — focusing on factory IT automation, logistics systems, and resilience engineering, improving system resilience by 30%
  • Transitioned legacy infrastructure to scalable platforms, improving uptime by 20% while maintaining SLA compliance
  • Provided global infrastructure support, troubleshooting distributed systems across multiple client environments
  • Recognized with "Most Valuable Player" and "Bravo Award" for exceptional performance and client satisfaction

Systems Engineer

Jul 2012Aug 2013
  • Modernized application hosting infrastructure for Baker Hughes (USA), including performance tuning and migration support
  • Developed SOPs for Windows and Linux environments supporting global operations
  • Delivered automation scripts for administrative tasks and implemented monitoring dashboards to improve operational visibility
LinuxShell ScriptingITILWindows Server

Education

20212023

Master of Science (MS)

Information Technology Management

Indiana University, Kelley School of Business

20082012

Bachelor of Engineering (BE)

Computer Science & Engineering

Vinayaka Mission's Research Foundation University

Professional Credentials

Certifications

Industry-recognized certifications validating expertise across cloud platforms, orchestration, and infrastructure automation.

Gremlin Enterprise Chaos Engineering

Gremlin

2025

AWS Certified Cloud Practitioner

Amazon Web Services

2025

AWS Certified AI Practitioner

Amazon Web Services

2025

Certified Kubernetes Administrator (CKA)

The Linux Foundation

2023

HashiCorp Certified: Terraform Associate

HashiCorp

2022

PagerDuty Foundational Practitioner & Certified Incident Responder

PagerDuty

2021

ITIL 4 Foundation

AXELOS Global Best Practice

2021

Azure Infrastructure Solutions (Exam 533)

Microsoft

2018

Recognition

Awards & Honors

Recognition from global industry bodies and organizations for technical excellence and professional achievement.

2025

Member of Jury

Business Intelligence Group

Selected as a subject matter expert to evaluate nominations for the BIG Innovation Awards, recognizing outstanding business and technology innovations.

2023

Indian Achiever Award

Indian Achievers Forum

Recognized for outstanding professional achievement and contribution to the technology industry by the Indian Achievers Forum.

2023

Member of Jury

Globee Awards

Appointed as a jury member for the Globee Business and Technology Excellence Awards, evaluating global innovation and leadership.

2016

Achieving Excellence Award

Wells Fargo

Awarded for exceptional contributions to infrastructure modernization initiatives and outstanding performance within the technology organization.

2015

Most Valuable Player Award

Infosys

Recognized as the Most Valuable Player for leading a high-performing operations team and delivering exceptional client satisfaction outcomes.

2014

Bravo Award

Infosys

Received the Bravo Award for demonstrating exceptional teamwork, technical expertise, and commitment to excellence in client delivery.

Let's Connect

Get In Touch

Open to senior SRE roles, AI infrastructure opportunities, and technical leadership positions. Let's talk.

Direct Contact

I'm actively exploring opportunities at the intersection of platform engineering and AI. Whether it's a full-time role, consulting engagement, or just a conversation — reach out.

Indianapolis, Indiana

Send a Message