- Company Name
- Aquent
- Job Title
- Site Reliability Engineer
- Job Description
-
Job Title: Site Reliability Engineer
Role Summary: Lead engineering teams to design, build, and maintain highly available, cloud‑native applications on GCP, ensuring reliability, scalability, and continuous delivery for millions of users.
Expectations: Deliver projects on schedule with high quality, mentor multiple teams, and drive the adoption of agile practices, DevOps automation, and AI/ML enhancements. Must be U.S. W2 employee for a 12‑month, full‑time contract.
Key Responsibilities:
- Lead end‑to‑end delivery of engineering initiatives using Scrum and robust release practices.
- Translate high‑level architecture into detailed, low‑level designs, providing technical oversight for cloud‑native app development and deployment on GCP.
- Design, build, and operate highly available, scalable systems, ensuring optimal performance and resilience.
- Manage data solutions across MongoDB, Aerospike, SQL Server, and PostgreSQL, optimizing data architecture and maintenance.
- Implement containerization (Docker, Kubernetes) and CI/CD pipelines to streamline development and deployment workflows.
- Integrate AI/ML models for data analytics, visualization, and automation to solve complex business problems.
- Mentor and coach cross‑functional teams, fostering technical growth and collaborative culture.
- Present architectural solutions and data‑backed proposals to governance boards and stakeholders, influencing strategic decisions.
Required Skills:
- Strong experience leading engineering teams with Scrum and release management.
- Proven ability to convert designs into detailed specifications and supervise implementation.
- Expertise in GCP, including compute, storage, networking, and monitoring services.
- Proficient with Docker, Kubernetes, and CI/CD tools (e.g., Jenkins, GitLab CI, GitHub Actions).
- Deep knowledge of MongoDB, Aerospike, SQL Server, and PostgreSQL.
- Ability to leverage AI‑Driven development tools and incorporate ML models into production systems.
- Excellent communication, presentation, and stakeholder engagement skills.
- Analytical mindset with a bias for action and creative problem solving.
Required Education & Certifications:
- Bachelor’s degree in Computer Science, Engineering, or related field (equivalent experience acceptable).
- Relevant certifications such as Google Cloud Professional Cloud Architect, Kubernetes Administrator, or equivalent are highly desirable.