AI Platform Administrator

To apply: Share your resumes on khushboo.a@mitrhr.com, muskan.d@mitrhr.com and rashmita.r@mitrhr.com

๐Ÿ“ Location: Bangalore
๐Ÿ’ผ Experience: 8โ€“10 Years


About the Role

We are seeking an experienced AI Platform Administrator to oversee and manage the infrastructure, operations, and reliability of enterprise AI systems. The ideal candidate will have a strong background in Databricks Mosaic AI, cloud AI/ML services, and infrastructure-as-code (IaC) tools. This role is crucial to ensure the seamless functioning of AI platforms supporting developers, ML engineers, and data scientists across the organization.


Key Responsibilities

  • Administer and maintain AI platforms such as Databricks Mosaic AI, Azure ML, AWS SageMaker, or other commercial AI systems.
  • Manage infrastructure provisioning, scaling, and monitoring for production-grade AI workloads.
  • Support AI Developers and ML Engineers with environment setup, access configuration, and technical troubleshooting.
  • Oversee API gateways, MCP servers, and other integrations that connect AI systems and applications.
  • Implement and maintain monitoring, alerting, and incident response processes for AI infrastructure.
  • Maintain comprehensive platform documentation, SOPs, and runbooks to ensure consistent and reliable operations.
  • Collaborate with DevOps and Security teams to enforce IAM, compliance, and platform security best practices.
  • Optimize platform performance and cost efficiency across multi-cloud environments.

Required Skills & Expertise

โœ… 8โ€“10 years of experience in Platform Administration, DevOps, or Cloud Infrastructure roles โ€” ideally within AI/ML environments.
โœ… Hands-on experience managing Databricks, Azure ML, AWS SageMaker, or GCP Vertex AI.
โœ… Strong proficiency in Infrastructure as Code (IaC) using Terraform, Bicep, CloudFormation, or ARM templates.
โœ… Deep understanding of cloud platforms (Azure, AWS, GCP) and their AI/ML ecosystems.
โœ… Experience with MCP servers, API gateways, and AI service orchestration.
โœ… Familiarity with CI/CD pipelines, containerization (Docker/Kubernetes), and monitoring tools (Prometheus, Grafana, Azure Monitor).
โœ… Working knowledge of IAM policies, data security, and governance best practices.
โœ… Excellent documentation, troubleshooting, and communication skills.


Good to Have

  • Exposure to LLM infrastructure, vector databases, or multi-agent frameworks.
  • Experience supporting LangChain or Mosaic AI implementations.
  • Certification in Azure, AWS, or GCP Cloud Architecture / DevOps.
  • Understanding of MLOps and AI lifecycle management tools (MLflow, Kubeflow).

Why Join Us

Join a dynamic and innovation-driven team building next-generation AI infrastructure and platforms that enable scalable, secure, and high-performing enterprise AI solutions. Work at the intersection of cloud, AI, and automation โ€” driving the backbone of intelligent digital transformation.


Comments

No comments yet. Why don’t you start the discussion?

Leave a Reply

Your email address will not be published. Required fields are marked *