
DevOps & Infrastructure Specialist (RARR Job 6303)
For India'S Leading Diversified Group Of Manufacturing And Services
2 - 4 Years
Full Time
Immediate
Up to 7 LPA
1 Position(s)
Gurugram/ Gurgaon
Posted By : RARR Technologies Pvt Ltd
Posted 7 Days Ago
Job Skills
Job Description
DevOps & Infrastructure Engineer Engineering
- CI/CD Automation: Design, implement, and maintain zero-downtime deployment pipelines using GitHub Actions for Django microservices, Go stream consumers, and FastAPI ML engines.
- Container Orchestration: Architect, secure, and scale local and production container topologies using Docker, Docker Compose, and Kubernetes (EKS) networks.
- Multi-Engine Infrastructure Tuning: Optimize cloud environments handling diverse technical stacks—including Python/Django, high-throughput Go stream parsers, and specialized legacy data processing components written in Java.
- Data Layer Operations: Implement automated provisioning, clustering, and backup configurations for a hybrid database cluster featuring high-write ClickHouse time-series nodes and transactional PostgreSQL instances.
- Edge Ingestion Monitoring: Configure, scale, and maintain cloud routing layers for MQTT brokers and Kafka messaging clusters processing sub-minute inverter packets.
- Observability & Telemetry: Deploy comprehensive monitoring, logs collection, and real-time alerts across all system boundaries using Prometheus, Grafana, and the ELK stack.
- Security & Guardrails: Own multi-tier network topologies, secure VPC configurations, IAM lease access controls, secret encryption policies, and routine vulnerability scanning blocks.
Required Skills
- Cloud Orchestration: Deep production experience inside AWS ecosystems (EC2, RDS, S3, IAM, VPC networks, EKS, Route53).
- Infrastructure as Code (IaC): Solid understanding of Terraform or Ansible for provisioning stable, immutable staging and production setups.
- Container Fluency: Expert command over Dockerfile optimization, multi-stage builds, local container caching, and Kubernetes pod management.
- Polyglot Runtime Awareness: Strong competency configuring runtime environments, memory budgets, and dependency trees across Python, Go, and Java systems.
- Linux System Administration: Deep mastery over Bash scripting, network topology tracing, process isolation, SSH key management, and internal performance benchmarking.
- Messaging Infrastructure: Solid foundational experience provisioning, scaling, or troubleshooting distributed message buses (Apache Kafka or RabbitMQ) and pub/sub brokers.
Bonus Skills (Good to Have)
- Advanced ClickHouse/PostgreSQL Ops: Direct experience scaling time-series databases or configuring high-write replication setups.
- Java Ecosystem Monitoring: Familiarity with JVM tuning, garbage collection profiling, and monitoring metrics for enterprise Java backends.
- MQTT Scale Experience: Configuring clustering setups or load-balancer routes explicitly tailored for high-volume MQTT messaging traffic.
You'll Thrive Here If You
- Believe that any configuration step executed manually twice should immediately be converted into code.
- Prioritize highly observable, predictable, and resilient infrastructure choices over complex cutting-edge tooling.
- Can systematically trace performance issues across physical computing layers, application code boundaries, and cloud networks.
- Communicate with transparent, async-friendly precision within a cross-functional technical circle.
What You Get
- Total architecture-level ownership over a high-scale industrial IoT and automated machine learning infrastructure layer.
- Direct collaboration with core engineering leaders to establish the fundamental scaling policies of the brand.
- A flat team configuration where infrastructure decisions directly enable rapid velocity across web, mobile, and machine learning teams.
Matching Jobs
No matching jobs found.