atsmantra logo
Nilasu Consulting Services Pvt Ltd logo

ML Eng – AI Ops & Model Infrastructure (NCS/Job/ 1873)

For An Indian Mnc Information Technology Services And Consulting Co

7 - 9 Years

Full Time

Up to 30 Days

Up to 32 LPA

1 Position(s)

Bangalore / Bengaluru, Hyderabad

7 - 9 Years

Full Time

Up to 30 Days

Up to 32 LPA

1 Position(s)

Bangalore / Bengaluru, Hyderabad

Job Description

  • Build and maintain serving infrastructure for ONNX models, Augloop, SLM-based inference, and future LLM/SLM pipelines.
  • Integrate models into scalable APIs for online prediction and retrieval-augmented generation workflows.
  • Set up and run real-time A/B experiments on production Copilot features.
  • Implement alerting, logging, and telemetry tools to monitor model drift, latency, and regressions.
  • Develop dashboards for automated quality monitoring and error detection in inference traffic.
  • Optimize inference latency and cost across CPU/GPU environments.
  • Build internal tools for performance analysis, model comparison, and troubleshooting.
  • Work on batch and streaming inference frameworks, ensuring SLA adherence.
  • Implement resource orchestration and utilization tracking across CPU/GPU workloads.
  • Contribute to tools that monitor uptime, throughput, container health, and job scaling.
  • Ensure scalability and reliability of model APIs, with clear SLAs around latency, throughput, cost, and memory footprint.
  • Profile models and infra for cold start issues, load testing, and concurrency handling.
  • Integrate Responsible AI checks for fairness, explainability, and performance variance.
  • Address AI injection attacks, inference sandboxing, and privacy guardrails.
  • Contribute to regression pipelines for SLA, PII, and compliance validation across Copilot features.

Required Experience

  • 3–6 years of hands-on experience as an ML SWE or MLOps Engineer in production AI systems.
  • Strong coding skills in Python, C++, or Go, with experience in TensorRT, ONNX Runtime, or similar.
  • Experience with ML Ops tools: Azure ML, Kubernetes, Prometheus, Grafana, MLflow, Airflow, etc.
  • Hands-on with monitoring systems, load testing tools, and infra debugging utilities.
  • Familiarity with model security, compliance frameworks, or Responsible AI practices is a plus.

Soft Expectations

  • Able to work independently and deliver code-quality infrastructure within agile cycles.
  • Document architecture, assumptions, and SLA metrics clearly.
  • Comfort in collaborating with both AI scientists and infra/DevOps teams.
  • Availability for overlap with Prague or Redmond teams preferred.

Matching Jobs

Mindtel Global Private Limited logo
GCP Data Engineer

For It Service And Consulting

location icon

Bangalore / Bengaluru, Hyderabad, Kolkata, Noida, Pune

experience icon

5 - 7 Years ( Full Time )

skill icon

Airflow, Bigquery, Gcp, Python

Not disclosed

share icon
Rarr Technologies Pvt Ltd logo
MS AI Engineer

For International Trade And Development Company

location icon

Bangalore / Bengaluru, Hyderabad, Pune

experience icon

5 - 8 Years ( Full Time )

skill icon

Ai Agents, Azure, Genai, Open Ai, Prompt Engineering, Python

Not disclosed

share icon
Rarr Technologies Pvt Ltd logo
Telecom QA Automation Engineer

For An Indian Multinational Information Technology Company

location icon

Bangalore / Bengaluru, Hyderabad

experience icon

4 - 7 Years ( Full Time )

skill icon

Api, Java, Javascript, Oss, Python, Scripting Languages, Telecom Testing, Ui

Not disclosed

share icon
atsMantra logo
A unified recruitment ecosystem designed to simplify hiring for companies, recruitment agencies, and job seekers alike. From powerful applicant tracking to smart job discovery, we offer intelligent tools that bring speed, clarity, and structure to every step of the recruitment journey.
atsMantra Facebook accountatsMantra Instagram accountatsMantra Twitter accountatsMantra LinkedIn accountatsMantra YouTube account