Logo

SRE (NCS/Job/ 3073)

For A French Mnc It Company
6 - 12 Years
Full Time
Up to 30 Days
Up to 25 LPA
1 Position(s)
Chennai, Coimbatore, Gurgaon / Gurugram, Hyderabad, Kolkata, Mumbai, Noida, Pune
Posted 10 Days Ago

Job Skills

Job Description

Job Description: SRE – Linux

Location: Chennai and Bangalore
Employment: Full‑time
Shift: General (24×7 rotational support)

Notice period-Up to 60 days

 


Key Responsibilities

  • Manage and administer Linux environments across RHEL and Ubuntu.
  • Perform OS‑level troubleshooting, performance tuning, and patch management.
  • Handle day‑to‑day Linux operations including system services, networking, storage and security.
  • Manage CI/CD tools and support pipeline automation and implementation 
  • Work with Ansible for configuration management and infra automation.
  • Support database environments with basic MySQL/PostgreSQL understanding.
  • Write and maintain automation using Shell and Python scripts.
  • Troubleshoot production issues, drive root‑cause analysis and maintain runbooks.
  • Collaborate with platform, cloud and infrastructure teams to ensure system reliability.
  • Work in ELK stack ,Prometheus and Grafana environment 

Required Skills

  • 5+ years of Linux administration (RHEL family preferred).
  • Strong Linux networking experience (bonding, VLANs, NIC tuning).
  • Solid Kubernetes administration for upgrades and troubleshooting.
  • Hands‑on experience implementing and managing CI/CD pipelines.
  • Good experience with virtualization platforms (VMware / RHEV‑M).
  • Prometheus and Grafana knowledge (rules, alerts, dashboards).
  • ELK stack operations including upgrades and integrations.
  • Strong automation experience using Ansible and scripting in Bash/Python.

Nice to Have

  • RHCSA / RHCE
  • CKA
  • Note- Need profiles on daily basis.

     

    Job Description: Senior SRE / Platform Engineer

    Location: Hybrid PNA INDIA  (IST)

    Employment: Full‑time

    Experience-6-9 Years

    Notice Period-Up to 60 days

    CTC-18 LPA

    Shift: General : IST General 

     

    Key Responsibilities

  • Build and maintain automation in Python and Go.
  • Operate and improve AWS and GCP environments.
  • Manage Kubernetes clusters, GitOps workflows and cluster upgrades.
  • Own Day‑2 operations for Kafka, Cassandra, Postgres and other data platforms.
  • Develop and maintain IaC using Terraform, Terragrunt, Ansible and Packer.
  • Implement and support CI/CD pipelines with Jenkins, CircleCI and GitHub Actions.
  • Maintain observability using Prometheus, Thanos, Vector and cloud monitoring tools.
  • Participate in on‑call rotations and lead incident investigations and postmortems.
  • Build tools, exporters and runbooks that improve reliability and operations.
  •  

    Required Skills

  • Strong programming skills in Python and Go for automation and tooling.
  • Deep Linux troubleshooting experience across networking, performance and system internals.
  • Solid AWS experience with EC2, VPC, IAM, ALB/NLB, RDS, S3 and EKS.
  • Hands‑on knowledge of Kubernetes, Helm and GitOps (Flux).
  • Strong understanding of Terraform, Terragrunt and Ansible.
  • Experience managing and upgrading distributed systems (Kafka, Cassandra, Postgres).
  • Strong skills in incident management, observability and SLO/alerting design.
  •  

    What This Role Is Not

  • Not an application development or UI engineering role.
  • Not a junior DevOps position.
  • Not focused only on scripting.
  • Not a data science or ML role.
  •