- Company Name
- Altera
- Job Title
- Cloud Devops/Site Reliability Engineer
- Job Description
-
**Job Title:** Cloud DevOps / Site Reliability Engineer
**Role Summary:**
Design, develop, and maintain cloud‑native, containerized platforms for Altera hardware solutions across cloud, hybrid‑cloud, and on‑prem environments. Ensure high availability, scalability, and performance through automation, monitoring, and infrastructure‑as‑code practices.
**Expectations:**
- Deliver reliable, automated infrastructure supporting compute‑intensive workloads.
- Optimize resource utilization, autoscaling, and disaster recovery.
- Collaborate with firmware, driver, OS, and application teams to integrate full‑stack software.
- Maintain compliance with OCI standards and licensing requirements.
**Key Responsibilities:**
- Architect and manage Docker‑based containers orchestrated with Kubernetes.
- Create and maintain CI/CD pipelines and Helm chart deployments.
- Implement monitoring, alerting, and observability using Prometheus and Grafana.
- Apply infrastructure‑as‑code (IaC) for provisioning, configuration, and version control.
- Manage container registries, artifact lifecycle, and OCI compliance.
- Optimize system performance, autoscaling, and resource scheduling.
- Ensure platform high availability, fault tolerance, and disaster recovery.
- Support licensing systems (e.g., FlexLM) and software entitlement management.
**Required Skills:**
- Advanced Docker and Kubernetes orchestration (deployment, scaling, networking).
- Helm chart authoring and management.
- CI/CD tooling (e.g., Jenkins, GitLab CI, Argo CD).
- Scripting (Shell, Makefile) and IaC (Terraform, Ansible, or similar).
- Monitoring/observability with Prometheus, Grafana, and alerting frameworks.
- Performance tuning, autoscaling, and resource optimization.
- Container registry and OCI artifact handling.
- Knowledge of software licensing mechanisms (FlexLM).
**Required Education & Certifications:**
- Master’s degree in Computer Science, Computer Engineering, or a related technical field.
- 10+ years of relevant experience in cloud infrastructure, DevOps, and SRE practices.
- Relevant certifications (e.g., CKAD/CKA, AWS Certified DevOps Engineer, Google Cloud Professional DevOps Engineer) are a plus.