cover image
Scale AI

Scale AI

scale.com

11 Jobs

4,684 Employees

About the Company

At Scale, our mission is to accelerate the development of AI applications. We believe that to make the best models, you need the best data.

The Scale Generative AI Platform leverages your enterprise data to customize powerful base generative models to safely unlock the value of AI. The Scale Data Engine consists of all the tools and features you need to collect, curate and annotate high-quality data, in addition to robust tools to evaluate and optimize your models. Scale powers the most advanced LLMs and generative models in the world through world-class RLHF, data generation, model evaluation, safety, and alignment.

Scale is trusted by leading technology companies like Microsoft and Meta, enterprises like Fox and Accenture, Generative AI companies like Open AI and Cohere, U.S. Government Agencies like the U.S. Army and the U.S. Airforce, and Startups like Brex and OpenSea.

Listed Jobs

Company background Company brand
Company Name
Scale AI
Job Title
ML Research Engineer, ML Systems
Job Description
**Job Title** ML Research Engineer, ML Systems **Role Summary** Design, develop, and optimize a distributed framework for large‑language‑model (LLM) training and inference. Work cross‑functionally with research and engineering teams to accelerate ML research, improve system performance, and enable next‑generation LLM training, inference, and data curation. **Expectations** - Deliver a high‑performance, scalable ML platform that supports fast, automatic LLM training and evaluation. - Continuously profile, benchmark, and enhance system efficiency for multi‑node operations. - Integrate state‑of‑the‑art technologies and research advances into production systems. - Communicate progress, challenges, and solutions clearly across cross‑functional teams. **Key Responsibilities** - Build and maintain the core training and inference framework for large‑scale LLM workloads. - Profile and optimize GPU/CPU utilization, memory footprint, and network traffic in distributed settings. - Collaborate with ML scientists and data engineers to provide tooling that accelerates model development and data curation pipelines. - Research, evaluate, and integrate cutting‑edge system components (e.g., flash attention, transformer optimizations, custom CUDA kernels). - Document architecture, performance metrics, and best practices for internal usage. **Required Skills** - Strong passion for system optimization and performance engineering. - Proven experience with multi‑node LLM training and inference pipelines. - Hands‑on experience building large‑scale distributed ML systems (e.g., cluster scheduling, fault tolerance). - Advanced software engineering skills; proficiency with CUDA, PyTorch, Hugging Face Transformers, and related libraries. - Ability to write clean, maintainable code and to create reproducible benchmarks. - Excellent written and verbal communication; comfortable working in a cross‑functional environment. **Nice to Have** - Expertise in post‑training methods (instruction tuning, RLHF), next‑generation LLM use cases (tool use, reasoning, agents, multimodal). **Required Education & Certifications** - Bachelor’s degree or higher in Computer Science, Electrical Engineering, or a related technical field. - Relevant certifications (e.g., GPU programming, distributed systems) are a plus but not mandatory.
New york city, United states
Hybrid
24-11-2025
Company background Company brand
Company Name
Scale AI
Job Title
Engineering Manager, International Public Sector
Job Description
**Job Title:** Engineering Manager, International Public Sector **Role Summary:** Lead and grow a high‑performing engineering team delivering backend services for AI‑driven public sector applications. Drive technical delivery from concept through production, ensure quality at scale, and partner cross‑functionally to align engineering solutions with business goals. **Expectations:** - Manage and mentor engineers, fostering a collaborative culture. - Deliver high‑velocity experiments and production‑grade features. - Translate business/product ideas into robust engineering solutions. - Operate effectively in a fast‑paced, international environment. **Key Responsibilities:** - Oversee end‑to‑end development of backend services for AI agents, evaluation tools, and automation. - Design, build, debug, test, and optimize scalable systems. - Coordinate with internal stakeholders and external public‑sector clients throughout the product lifecycle. - Influence engineering processes, standards, and team values. - Ensure high‑quality, reliable releases and maintain platform stability. **Required Skills:** - ≥5 years of software engineering experience, ≥2 years in engineering management (preferred). - Proven track record of shipping large‑scale consumer or enterprise products. - Strong expertise in backend architecture, APIs, and cloud infrastructure. - Ability to analyze product engagement data and drive feature decisions. - Excellent problem‑solving, communication, and cross‑functional collaboration skills. - Experience with AI/ML platforms or generative AI is a plus. **Required Education & Certifications:** - Bachelor’s degree in Computer Science, Engineering, or related technical field (Master’s preferred). - No specific certifications required; leadership or project‑management credentials (e.g., PMP, Scrum Master) are advantageous.
London, United kingdom
On site
Mid level
26-12-2025
Company background Company brand
Company Name
Scale AI
Job Title
DevOps Engineer, IPS
Job Description
**Job title** DevOps Engineer, IPS **Role summary** Design, build, and maintain secure, scalable cloud‑native backend systems and infrastructure for AI applications serving the public sector. Own services, implement IaC, CI/CD, networking, and disaster‑recovery pipelines while ensuring compliance with privacy and security standards. **Expectations** - 5+ years of engineering experience post‑graduation (or equivalent) - Proven ownership of end‑to‑end engineering projects - Strong collaborative mindset across cross‑functional teams - Continuous improvement of tooling, processes, and engineering standards **Key responsibilities** - Design and implement secure, scalable backend services using Python, TypeScript, JavaScript, or C++ - Own system health, define long‑term goals, and drive reliability improvements - Write and maintain Terraform/CloudFormation IaC for automated cloud provisioning - Manage networking: VPCs, VPNs, load balancers, and firewalls in AWS/Azure environments - Build and optimise CI/CD pipelines with CircleCI, GitHub Actions, or similar tools - Deploy containerised applications on Kubernetes, ensuring high availability and scalability - Develop disaster‑recovery plans, backups, failover mechanisms, and hybrid/multi‑cloud strategies - Enhance engineering standards, tooling, and processes across the organization **Required skills** - Proficiency in Python, TypeScript, JavaScript, or C++ (backend development) - Experience with AWS or Azure public cloud platforms - Hands‑on use of containerisation (Kubernetes, Docker) and IaC (Terraform, CloudFormation) - Strong grasp of CI/CD tooling (CircleCI, GitHub Actions, etc.) - Knowledge of network engineering fundamentals (VPC, VPN, load balancing, firewalls) - Familiarity with distributed systems concepts - Ability to work independently and own complex projects **Required education & certifications** - Bachelor’s degree in Computer Science, Mathematics, or a related quantitative field (or equivalent practical experience) - Relevant cloud certifications (AWS Certified Solutions Architect, Azure Administrator, etc.) preferred but not mandatory.
London, United kingdom
On site
Mid level
25-12-2025
Company background Company brand
Company Name
Scale AI
Job Title
Machine Learning Research Engineer - Robotics
Job Description
**Job Title:** Machine Learning Research Engineer – Robotics **Role Summary:** Lead applied research and development of machine‑learning pipelines for robotics data, model training, fine‑tuning, and evaluation. Partner with customers and cross‑functional teams to advance robotic perception and policy models, and integrate outcomes into Scale’s platform. **Expectations:** - Drive research initiatives in VLA (Vision‑Language‑Action) models and embodied AI. - Build and maintain scalable ML pipelines using proprietary robotics data. - Publish and present research findings; communicate results to stakeholders. - Operate autonomously while collaborating across product, engineering, and customer teams. **Key Responsibilities:** - Design and implement training/fine‑tuning pipelines for VLA models on large‑scale robotics datasets. - Conduct research on data collection strategies, cross‑embodiment training, and policy refinement. - Create novel evaluation metrics and industry benchmarks for VLA models. - Partner with customers to improve data acquisition and model performance. - Work with product teams to deploy ML outcomes onto Scale’s robotics platform. - Document research, publish papers, and present at internal/external forums. **Required Skills:** - 3+ years of experience in robotics, computer vision, embodied AI, sim‑to‑real, imitation learning, reinforcement learning, or vision‑language‑action models. - Proven ability to build and train VLA or similar models; experience with large‑scale data pipelines. - Strong research record with peer‑reviewed publications in robotics or related fields. - Proficiency in Python and ML frameworks (e.g., PyTorch, TensorFlow). - Experience in data collection, annotation, and model evaluation. - Excellent written and verbal communication; ability to work with customers and cross‑functional teams. - High intellectual curiosity, autonomy, and collaborative mindset. **Required Education & Certifications:** - Ph.D. in Machine Learning, Robotics, Computer Science, or equivalent practical experience. - Equivalent industry experience may be considered in lieu of formal degree.
San francisco bay, United states
Hybrid
Junior
29-12-2025