Logo

Linux SRE (NCS/Job/ 2169)

For A French Mnc It Company
5 - 7 Years
Full Time
Up to 60 Days
Up to 20 LPA
1 Position(s)
Chennai
Posted 21 Days Ago

Job Skills

Job Description

Role Overview

As a member of the WiFi Systems Administration and SRE team, the candidate will be involved in the management, maintenance, security, and operational availability of the 130 servers in the Sky Business WiFi core estate. The ability to work as part of a larger team with good communication skills is essential to the role.


Responsibilities

  • Take part in ad-hoc on-call rota to action symptoms before they become outages.

  • Be responsible for the engineering and support of production and non-production environments, including automation of patches, upgrades, reliability, and performance improvements.

  • Develop assurance, monitoring, and management capabilities for the platform using Prometheus, Grafana, and ELK stack.

  • Monitor and manage Linux servers, containers, and applications.

  • Support and lifecycle management of various applications and services, including patching, upgrades, updates, and troubleshooting.

  • Day-to-day processing of request, change, and incident work tickets.

  • Routine internal and external changes.

  • Engagement with 3rd party vendors, raising support tickets and assisting engineers.

  • Documentation: responsible for maintaining clear documentation, peer review, and contributing to wider team updates.


What You’ll Bring

  • Linux administration of Ubuntu, and CentOS – we want to migrate our platform from CentOS to Ubuntu.

  • Shell / Python scripting and automation process work (including Ansible).

  • Contribute to improvements to existing systems.

  • Experience working with public cloud, or automating on-premise systems so that they are like using a public cloud.

  • Strong background automating the configuration and management of large-scale platforms: Linux, Git, any scripting language like Python, Go, Bash etc.

  • Experience in database deployment and management (SQL, NoSQL), e.g., Redis, PostgreSQL.

  • Experience of building and maintaining CI/CD pipelines.

  • Experience with automation/orchestration with tools such as Ansible and Chef.

  • Knowledge of Linux networking, e.g., nftables, policy routing.

  • Work as a member of the team to facilitate improvements and increase the efficiency of the department.

  • Good emotional skills, remaining calm under pressure in a crisis or incident.

  • Modern best practices to help level up the team.


Behaviours We’re Looking For

  • Act as a role model to set acceptable working standards, ethics, and practices.

  • Self-start; able to work under instruction and under own initiative.