
Validation & Performance Automation Engineer (RARR Job 5418)
Job Skills
Job Description
Core Responsibilities:
-
Design & implement GitOps-compliant pipelines for validation across hardware, OS, Kubernetes, and platform layers.
-
Integrate Sonobuoy for Kubernetes conformance testing.
-
Orchestrate chaos engineering workflows via LitmusChaos for resilience testing.
-
Implement performance tests with k6 and system benchmarks in CI/CD.
-
Develop/maintain end-to-end test frameworks (pytest / Go) for cluster lifecycle, upgrades, and GPU workloads.
-
Ensure test coverage: conformance, performance, fault injection, post-upgrade checks.
-
Build dashboards/reports for test results, compliance, and drift detection.
-
Collaborate with Infra, SRE, and platform teams to embed validation early in deployments.
-
Own QA gates for all automation-driven releases.
Required Skills:
-
Mandatory: Ansible, Terraform, Puppet, Chef, Shell/PowerShell/Python scripting
-
Core: pytest, Go, k6, Sonobuoy, LitmusChaos, CI/CD (GitHub Actions, GitLab CI, Jenkins)
-
Kubernetes architecture & upgrade expertise
-
GitOps (ArgoCD, Flux)
-
Infra validation (GPU drivers, kernel modules, CNI, CRI)
-
Debugging, RCA, incident analysis skills
-
GPU infra knowledge (plus point)