- Company Name
- CLS
- Job Title
- Ingénieur sre - exploitation systèmes F/H
- Job Description
-
**Job Title**
SRE Systems Operations Engineer (M/F)
**Role Summary**
Responsible for ensuring the reliability, availability, and performance of critical 24/7 services in a hybrid cloud/on‑premise environment. Integrates SRE principles, operational engineering, and system mastery to maintain service resilience, reduce toil, and improve incident response and automation.
**Expectations**
* Minimum 3 years of experience in operations, reliability engineering, or systems/SRE roles.
* Proven ability to work independently and collaboratively across disciplines.
* Strong communication skills for coordinating with level‑2/3 operators, developers, and business stakeholders.
* Commitment to continuous improvement, automation, and adherence to security and compliance standards.
**Key Responsibilities**
1. **Incident Management** – Lead major incident resolution, coordinate technical teams, conduct post‑incident reviews, identify root causes, and implement preventive actions.
2. **Observability & Monitoring** – Sustain and evolve dashboards, metrics, logs, and traces; define SLI/SLO metrics with product and business owners; pinpoint architectural weaknesses and operational risks.
3. **Automation & Industrialization** – Automate repetitive tasks, reduce manual toil; infrastructure as code and CI/CD pipeline development; standardize configurations and operational processes.
4. **Multidisciplinary Coordination** – Act as level‑2/3 support; collaborate with 24/7 operators, IT, developers, and security teams to ensure solutions meet contractual, economic, and scheduling requirements; enforce security posture.
**Required Skills**
* Linux administration, Bash, scripting.
* Containerization & orchestration: Docker, Kubernetes.
* Deployment & versioning: Git/GitLab, Ansible.
* Cloud platforms: AWS, Azure (basic knowledge).
* IT fundamentals: storage, virtualization, databases, networking protocols.
* Observability tools: Zabbix, Grafana, CloudWatch, Prometheus, ELK, OpenSearch.
* Analytical problem‑solving, proactive mindset, and strong sense of accountability.
* Team orientation, knowledge sharing, and clear communication.
**Required Education & Certifications**
* Bachelor’s or Master’s degree in Computer Science, Systems Engineering, or equivalent.
* Relevant certifications (e.g., AWS Solutions Architect, CKA, Certified Kubernetes Administrator, or SRE certifications) are advantageous.
Ramonville-saint-agne, France
On site
Junior
25-11-2025