Indianapolis, Indiana
Saroj Priyadarshi
13+ years engineering cloud infrastructure at scale. Bridging deep SRE expertise with emerging AI capabilities to build systems that are fast, reliable, and intelligent.

Who I Am
About Me
Senior SRE with 13 years of hands-on experience, now building at the intersection of infrastructure and AI.
SRE Engineer with 13+ years of experience leading capacity planning, autoscaling, performance optimization, and cloud cost management for large-scale, globally distributed SaaS platforms. At OPENLANE, I architect and operate cloud infrastructure across AWS (ECS, EKS, EC2, RDS, S3, CloudFront, CloudWatch, KMS) and Azure — where uptime isn't a metric, it's a promise.
My work sits at the intersection of reliability and intelligence. With deep expertise in Terraform, Python, Go, AppDynamics, OpenTelemetry, and Splunk, I build systems that self-heal, scale gracefully, and surface insights before problems become incidents — delivering a 40% MTTR reduction and 25% infrastructure cost savings across 100+ production applications.
Today, I'm channeling that operational depth into the AI space. Armed with my AWS AI Practitioner certification and a Master's in IT Management from Indiana University's Kelley School of Business, I'm applying AI tooling to SRE workflows to build infrastructure that doesn't just respond to failures — it predicts and prevents them.
Technical Expertise
Skills & Stack
A decade-plus of hands-on experience across the full infrastructure lifecycle — from cloud provisioning to production observability.
Cloud Platforms
Container Orchestration
Infrastructure as Code
CI/CD & GitOps
Observability
Scripting & Automation
Featured Work
Key Projects
Real-world infrastructure projects delivering measurable reliability, efficiency, and cost improvements at scale.

Enterprise Observability Platform
Unified observability stack for 100+ microservices with real-time alerting, distributed tracing, and AI-assisted anomaly detection.

Infrastructure as Code Automation
End-to-end IaC framework managing 500+ cloud resources with automated drift detection, policy-as-code enforcement, and self-service provisioning.

Enterprise Cloud Migration
Led the migration of 100+ legacy applications to AWS and Azure, achieving zero-downtime cutover with automated rollback capabilities.

SLO Implementation Framework
Service Level Objective framework with automated error budget tracking, burn rate alerting, and stakeholder dashboards for 50+ services.

Chaos Engineering Platform
Automated chaos engineering program using Gremlin and custom tooling to proactively identify and remediate system weaknesses.

Cloud Cost Optimization Engine
Data-driven cost optimization platform that identified and automated elimination of cloud waste, achieving 35% spend reduction.
Career Journey
Experience
13 years of progressive growth from systems engineering to senior SRE leadership across finance, consulting, and technology.
OPENLANE
Indianapolis, Indiana
Senior Site Reliability Engineer
May 2020 — Present- Architected and operated large-scale cloud infrastructure across AWS (ECS, EKS, EC2, RDS, S3, CloudFront, CloudWatch, KMS) and Azure — leading capacity planning, autoscaling strategies, and performance optimization for a globally distributed SaaS platform
- Led zero-downtime migration of 100+ on-premises applications to AWS EC2, ECS, and Elastic Load Balancers — reducing infrastructure costs by 25% through right-sizing, reserved instances, and autoscaling optimization
- Designed and maintained enterprise CI/CD pipelines using Azure DevOps, GitHub Actions, and Jenkins with blue-green and canary release strategies, enabling zero-downtime production deployments
- Spearheaded observability platform using AppDynamics, Splunk, Prometheus, Grafana, Datadog, CloudWatch, Azure Monitor, and OpenTelemetry — reducing service downtime by 20% and ensuring full telemetry coverage across Data Plane and Control Plane components
- Defined and tracked SLIs, SLOs, and error budgets across 100+ services; owned incident management and ran blameless postmortems, reducing MTTR by 40% and using error budgets to balance feature velocity with reliability investments
- Planned and led chaos engineering and disaster recovery initiatives — simulating failure scenarios to validate fault tolerance and build organizational resilience
- Automated Kubernetes cluster lifecycle management across EKS and AKS — deploying and operating clusters with Helm, Kustomize, ArgoCD, FluxCD, and custom Operators/CRDs; implemented GitOps workflows for declarative, auditable deployments and designed golden-path application abstractions that empowered development teams to manage their own infrastructure
- Developed internal SRE tooling and automation using Python, Go, Bash, and PowerShell — eliminating manual toil and increasing engineering velocity across distributed systems
- Managed on-call processes and escalation best practices across a globally distributed team, minimizing production downtime and improving response coordination
- Ensured compliance with SOC2 and ISO 27001 standards using IAM, Key Vault, and Secrets Manager; contributed to enterprise messaging architecture using Kafka and Azure Event Hub
- Led a team of 6 engineers, improving overall system reliability by 65% through technical mentorship, architectural decision-making, and cross-functional collaboration
Site Reliability Engineer
May 2017 — Apr 2020- Modernized hybrid cloud infrastructure and built CI/CD pipelines that improved release velocity and deployment reliability
- Drove adoption of Kubernetes and OpenShift for scalable, resilient microservices architecture — migrating legacy services and improving deployment agility
- Enhanced platform observability using Splunk, AppDynamics, and SolarWinds, improving system uptime and incident response times
- Automated server patching and provisioning, significantly reducing routine manual workload and human error
- Implemented IAM governance, secret rotation policies, and vulnerability management in collaboration with security teams
- Built infrastructure blueprints for onboarding new applications with built-in observability, alerting, and compliance standards
- Supported SOC2 readiness by enforcing infrastructure security compliance and maintaining comprehensive documentation
- Conducted regular disaster recovery drills across multiple business units, ensuring minimal service disruption during outages
Wells Fargo
Remote / Charlotte, NC
Senior Analyst — IT Infrastructure
Oct 2015 — Apr 2017- Managed mission-critical banking infrastructure under Community Banking Services, ensuring high availability and 24x7 resilience
- Led large-scale IT infrastructure deployments, improving system performance by 20% and reliability by 26%
- Collaborated with cross-functional teams to drive technical decision-making and operational efficiency across distributed environments
- Received "Achieving Excellence" award for outstanding contributions to infrastructure modernization initiatives
Infosys
Bangalore, India
Senior Systems Engineer
Aug 2013 — Sep 2015- Led infrastructure transformation projects for BMW (Germany) — focusing on factory IT automation, logistics systems, and resilience engineering, improving system resilience by 30%
- Transitioned legacy infrastructure to scalable platforms, improving uptime by 20% while maintaining SLA compliance
- Provided global infrastructure support, troubleshooting distributed systems across multiple client environments
- Recognized with "Most Valuable Player" and "Bravo Award" for exceptional performance and client satisfaction
Systems Engineer
Jul 2012 — Aug 2013- Modernized application hosting infrastructure for Baker Hughes (USA), including performance tuning and migration support
- Developed SOPs for Windows and Linux environments supporting global operations
- Delivered automation scripts for administrative tasks and implemented monitoring dashboards to improve operational visibility
Education
2021 — 2023
Master of Science (MS)
Information Technology Management
Indiana University, Kelley School of Business
2008 — 2012
Bachelor of Engineering (BE)
Computer Science & Engineering
Vinayaka Mission's Research Foundation University
Professional Credentials
Certifications
Industry-recognized certifications validating expertise across cloud platforms, orchestration, and infrastructure automation.
Gremlin Enterprise Chaos Engineering
Gremlin
2025AWS Certified Cloud Practitioner
Amazon Web Services
2025AWS Certified AI Practitioner
Amazon Web Services
2025Certified Kubernetes Administrator (CKA)
The Linux Foundation
2023HashiCorp Certified: Terraform Associate
HashiCorp
2022PagerDuty Foundational Practitioner & Certified Incident Responder
PagerDuty
2021ITIL 4 Foundation
AXELOS Global Best Practice
2021Azure Infrastructure Solutions (Exam 533)
Microsoft
2018Recognition
Awards & Honors
Recognition from global industry bodies and organizations for technical excellence and professional achievement.
Member of Jury
Business Intelligence Group
Selected as a subject matter expert to evaluate nominations for the BIG Innovation Awards, recognizing outstanding business and technology innovations.
Indian Achiever Award
Indian Achievers Forum
Recognized for outstanding professional achievement and contribution to the technology industry by the Indian Achievers Forum.
Member of Jury
Globee Awards
Appointed as a jury member for the Globee Business and Technology Excellence Awards, evaluating global innovation and leadership.
Achieving Excellence Award
Wells Fargo
Awarded for exceptional contributions to infrastructure modernization initiatives and outstanding performance within the technology organization.
Most Valuable Player Award
Infosys
Recognized as the Most Valuable Player for leading a high-performing operations team and delivering exceptional client satisfaction outcomes.
Bravo Award
Infosys
Received the Bravo Award for demonstrating exceptional teamwork, technical expertise, and commitment to excellence in client delivery.
Let's Connect
Get In Touch
Open to senior SRE roles, AI infrastructure opportunities, and technical leadership positions. Let's talk.
Direct Contact
I'm actively exploring opportunities at the intersection of platform engineering and AI. Whether it's a full-time role, consulting engagement, or just a conversation — reach out.