cover image
AceStack

DevOps SRE with AI Ops____Montreal, QC (Hybrid) Need F2F in Final Round _____Full Time Permanent

Hybrid

Montreal, Canada

Full Time

12-02-2026

Share this job:

Skills

Communication Python Java Go Incident Response DevOps Docker Kubernetes Monitoring Ansible Networking Programming TCP/IP Terraform Prometheus Grafana

Job Specifications

Role- DevOps SRE with AI Ops

Location- Montreal, QC (Hybrid) Need F2F in Final Round

Full Time Permanent

Need 12+ yrs of experience

Skills Required :

Production experience in SRE / Infrastructure / ops for large-scale systems
Strong programming/scripting skills (Python, Go, Java, or equivalent)
Deep experience with containerization (Docker), orchestration (Kubernetes, etc.)
Infrastructure-as-code (Terraform, Helm, CloudFormation, Ansible, etc.)
Familiarity with GPU / AI compute clusters, high-performance data storage, and distributed architectures
Experience with monitoring / observability / logging / alerting tools (Prometheus, Grafana, ELK / EFK, Datadog, etc.)
Production experience in SRE / Infrastructure / ops for large-scale systems
Strong programming/scripting skills (Python, Go, Java, or equivalent)
Deep experience with containerization (Docker), orchestration (Kubernetes, etc.)
Infrastructure-as-code (Terraform, Helm, CloudFormation, Ansible, etc.)
Familiarity with GPU / AI compute clusters, high-performance data storage, and distributed architectures
Experience with monitoring / observability / logging / alerting tools (Prometheus, Grafana, ELK / EFK, Datadog, etc.)
Networking & systems engineering knowledge (TCP/IP, DNS, routing, load balancing, distributed storage)
Solid experience in capacity planning, performance tuning, scaling, and incident response
Demonstrated ability to lead RCAs, deploy fixes, and drive reliability improvements
Experience in regulated environments (financial services, compliance, audit, security) is a strong plus
Excellent communication, documentation, and cross-team collaboration skills
Proven track record of reducing operational toil via automation

About the Company

AceStack is a global IT consulting and technology solutions company, founded in 2017 in New Jersey, USA. Since inception, we have demonstrated consistent growth and an unwavering commitment to delivering exceptional value to our clients. Our consultants are strategically located across the USA, Canada, and Asia, enabling us to provide localized support with a global perspective. In addition to our headquarters in New Jersey, we maintain offices in Canada, Noida, and Ahmedabad, empowering our teams to collaborate effectively... Know more