- Company Name
- MUFG Investor Services
- Job Title
- Senior MLOps Engineer
- Job Description
-
**Job Title:** Senior MLOps Engineer
**Role Summary:**
Architect, deploy, and maintain AI agent infrastructure on Agent Core MCP servers and gateways. Responsible for building a secure, scalable, and cost‑optimized platform that supports AI and data science initiatives through robust DevOps practices and cloud automation.
**Expectations:**
- Deliver high‑visibility AI platform components on schedule.
- Collaborate cross‑functionally with AI researchers, backend and frontend engineers, and security teams.
- Ensure platform reliability, observability, compliance, and performance at scale.
**Key Responsibilities:**
- Design, deploy, and manage AI agents on MCP servers/gateways.
- Implement observability with OpenTelemetry, Datadog, and related tools.
- Optimize cost, security, and availability of AI platform components.
- Provide infrastructure and deployment support for AI teams, integrating new technologies into production.
- Conduct load testing, token cost measurement, and resource utilization optimization.
- Perform vulnerability assessments, compliance checks, and external audits.
- Resolve platform issues quickly to maintain uptime.
- Contribute to CI/CD pipelines, automation, and DevOps workflows for AI deployments.
- Evaluate third‑party products for hosting AI agents.
- Maintain documentation for architecture and processes.
- Develop automation scripts (AWS Boto3) and IaC (Terraform) for multi‑environment deployments.
**Required Skills:**
- 5+ years in platform engineering or DevOps.
- Deep knowledge of DevOps principles, workflows, and best practices.
- Proficient in API design, integration, and full‑stack development.
- Hands‑on experience with AWS services (EC2, EKS, Lambda, S3, etc.).
- Expertise in container orchestration (Kubernetes/EKS) and cloud‑native architectures.
- Strong coding in Python, Node.js, or Go.
- Familiarity with OpenTelemetry, Datadog, and observability stack.
- Proven security, cost‑optimization, and performance‑testing experience.
- Knowledge of MCP and AI agent frameworks (preferred).
**Required Education & Certifications:**
- Bachelor’s degree in Computer Science, Engineering, or related field.
- AWS certifications (e.g., Solutions Architect, DevOps Engineer) highly preferred.