Logo

DevOps & Infrastructure Specialist (RARR Job 6303)

For India'S Leading Diversified Group Of Manufacturing And Services
2 - 4 Years
Full Time
Immediate
Up to 7 LPA
1 Position(s)
Gurugram/ Gurgaon
Posted 7 Days Ago

Job Skills

Job Description

DevOps & Infrastructure Engineer Engineering 

  • CI/CD Automation: Design, implement, and maintain zero-downtime deployment pipelines using GitHub Actions for Django microservices, Go stream consumers, and FastAPI ML engines.
  • Container Orchestration: Architect, secure, and scale local and production container topologies using Docker, Docker Compose, and Kubernetes (EKS) networks.
  • Multi-Engine Infrastructure Tuning: Optimize cloud environments handling diverse technical stacks—including Python/Django, high-throughput Go stream parsers, and specialized legacy data processing components written in Java.
  • Data Layer Operations: Implement automated provisioning, clustering, and backup configurations for a hybrid database cluster featuring high-write ClickHouse time-series nodes and transactional PostgreSQL instances.
  • Edge Ingestion Monitoring: Configure, scale, and maintain cloud routing layers for MQTT brokers and Kafka messaging clusters processing sub-minute inverter packets.
  • Observability & Telemetry: Deploy comprehensive monitoring, logs collection, and real-time alerts across all system boundaries using Prometheus, Grafana, and the ELK stack.
  • Security & Guardrails: Own multi-tier network topologies, secure VPC configurations, IAM lease access controls, secret encryption policies, and routine vulnerability scanning blocks.

Required Skills

  • Cloud Orchestration: Deep production experience inside AWS ecosystems (EC2, RDS, S3, IAM, VPC networks, EKS, Route53).
  • Infrastructure as Code (IaC): Solid understanding of Terraform or Ansible for provisioning stable, immutable staging and production setups.
  • Container Fluency: Expert command over Dockerfile optimization, multi-stage builds, local container caching, and Kubernetes pod management.
  • Polyglot Runtime Awareness: Strong competency configuring runtime environments, memory budgets, and dependency trees across Python, Go, and Java systems.
  • Linux System Administration: Deep mastery over Bash scripting, network topology tracing, process isolation, SSH key management, and internal performance benchmarking.
  • Messaging Infrastructure: Solid foundational experience provisioning, scaling, or troubleshooting distributed message buses (Apache Kafka or RabbitMQ) and pub/sub brokers.

Bonus Skills (Good to Have)

  • Advanced ClickHouse/PostgreSQL Ops: Direct experience scaling time-series databases or configuring high-write replication setups.
  • Java Ecosystem Monitoring: Familiarity with JVM tuning, garbage collection profiling, and monitoring metrics for enterprise Java backends.
  • MQTT Scale Experience: Configuring clustering setups or load-balancer routes explicitly tailored for high-volume MQTT messaging traffic.

You'll Thrive Here If You

  • Believe that any configuration step executed manually twice should immediately be converted into code.
  • Prioritize highly observable, predictable, and resilient infrastructure choices over complex cutting-edge tooling.
  • Can systematically trace performance issues across physical computing layers, application code boundaries, and cloud networks.
  • Communicate with transparent, async-friendly precision within a cross-functional technical circle.

What You Get

  • Total architecture-level ownership over a high-scale industrial IoT and automated machine learning infrastructure layer.
  • Direct collaboration with core engineering leaders to establish the fundamental scaling policies of the brand.
  • A flat team configuration where infrastructure decisions directly enable rapid velocity across web, mobile, and machine learning teams.