cover image
DeepRec.ai

DeepRec.ai

linktr.ee

1 Job

13 Employees

About the Company

We are your Deep Tech recruitment specialists, driven by a mission to power progress in the world’s most exciting industries.

We build high-quality connections through a dedicated community of AI specialists, bridging the gap between Deep Tech pioneers and world-class talent.

Our consultants specialise in:

- Computer Vision
- NLP
- Data Science
- Machine Learning
- C++
- Blockchain

We are part of Trinnovo Group – united by a shared mission to build diversity, create inclusion and encourage workplace innovation. Our community groups are:

• Ex-Military Careers – Bridging the gap between the military and a meaningful civilian career
• Women in DevOps – Closing the DevOps gender gap and inspiring future leaders in tech
• Pride in Tech – Creating a kinder and safer space for queer people in technology
• Ethnicity Speaks – Championing an equitable workplace for people of all ethnic backgrounds

If you’re looking for Deep Tech insights from VCs, founders, and innovators, check out our podcast, the Leadership Lab on Spotify.

Our Group awards and accreditations include:

• B Corp
• Investors in People Platinum
• Recruiter Award for Diversity, Equality, and Inclusivity Service Excellence 2024
• SIA Top 100 Europe Staffing Leaders, 2024
• TIARA's 'Best Company to Work for (£20-£50m)' Award, 2022 & 2021 & 2020
• TIARA’s ‘Diversity, Equity & Inclusion’ Award 2022
• APSCo 'Diversity & Inclusion' Award for Excellence, 2018
• SIA 'Best Staffing Firms to Work for across the US' 2022
• Best Companies' 3-Star Accreditation 2022 & 2021
• Tiara's 'Growth Award' 2021 & 2020
• Tiara's 'Recruitment Leader of the Year' 2021

No more generic searches. This is Deep Tech recruitment, evolved.

Visit our website to find out more www.deeprec.ai

Listed Jobs

Company background Company brand
Company Name
DeepRec.ai
Job Title
Senior ML Infra Engineer
Job Description
Job title: Senior Machine Learning Infrastructure Engineer Role Summary: Own, build, and scale end‑to‑end ML infrastructure for physics‑based foundation models in a fast‑moving AI startup. Drive production‑grade training, fine‑tuning, serving, and data pipelines across cloud and on‑prem environments, partnering closely with customers and executive leadership. Expectations: Minimum 3 years of designing and deploying scalable ML infrastructure, proven proficiency with AWS/GCP/Azure, Kubernetes, Docker, IaC, and distributed training frameworks. Strong Python, debugging, and execution skills. Optional: physics/background in simulation, regulated deployments, GPU optimization, and open‑source contributions. Key Responsibilities: - Architect and manage multi‑GPU/multi‑node distributed training and fine‑tuning clusters. - Design low‑latency, highly reliable inference and model serving systems. - Build secure, automated fine‑tuning pipelines for customer data workflows. - Deploy ML solutions across cloud and on‑prem (including enterprise/air‑gapped) environments. - Construct data pipelines for large‑scale simulation and CFD datasets. - Implement observability, monitoring, and debugging across training, serving, and data pipelines. - Collaborate directly with customers on deployment, integration, and scaling. - Rapidly transition prototypes to production‑grade infrastructure. Required Skills: - Distributed training frameworks (PyTorch Distributed, DeepSpeed, Ray, etc.) - Cloud platforms (AWS, GCP, Azure) and hybrid deployment environments - Kubernetes, Docker, and infrastructure‑as‑code tools (e.g., Terraform, Helm) - Python programming, end‑to‑end ML lifecycle understanding - Distributed systems, networking, security fundamentals - Debugging, performance tuning, and scaling experience - Strong communication, collaboration, and independent ownership capability Required Education & Certifications: - Bachelor’s (or higher) in Computer Science, Engineering, or related STEM field (or equivalent experience) ---
San francisco, United states
On site
Senior
16-03-2026