cover image
SRM Digital LLC

AI/ML Architect

On site

Philadelphia, United states

Senior

Freelance

18-02-2026

Share this job:

Skills

Leadership Incident Response CI/CD Monitoring Training Architecture Machine Learning Organization Azure AWS GCP

Job Specifications

Description:

• Seeking a senior AI/ML platform leader to design and operationalize a scalable production-ready AI/ML architecture across multiple business units.

• This role is responsible for moving the organization from proof of concept AI efforts to repeatable, governed delivery of models in production.

• The ideal candidate has built and deployed machine learning systems at scale, understands both ML development and platform engineering, and can define the environments, pipelines, and architectural standards that enable the team to safely and efficiently ship AI.

• This is a hands-on architectural leadership role with significant influence across product, engineering, data, and security teams.

Key responsibilities:

• Include AIML architecture and platform design define the end-to-end AI/ML reference architecture from Data injection through model serving and monitoring. Establish standards for data storage, access patterns, and lineage, including separation of raw, curate,d and feature ready data. Assess and define the need for shared platform capabilities such as

o feature stores

o model registries

o AIML catalogues

o experiment tracking

o design for scale across multiple business units with differing data sensitivity, regulatory, and operational needs

• Environment and delivery pipeline: Define standard development, validation, and production environments for AIML workloads. Design a repeatable ML delivery pipeline covering

o model development and training,

o validation, approval, and promotion,

o deployment (batch and/or real time)

o monitoring drift detection and retraining

o establish CI/CD (and continuous training where appropriate) best practices for ML systems

• MLOps governance and production readiness: Define what production readiness means for AIML models, including:

o testing and validation requirements,

o monitoring and alerting

o rollback and incident response patterns,

o partner with security, legal, and compliance team to integrate governance without slowing delivery

o ensure models are discoverable, auditable, and traceable across the environment and business units

• Enablement and operating model: Create a paved road for AI development

o shared standards, templates, and tooling thatthe business unit can use against

o advise and enable product teams and engineering teams moving models from POC to production

o help define the long-term operating model for AI/ML ownership across central platform teams and federated BU teams

Required qualifications:

• 15+ years of experience in software engineering, data platforms or ML engineering

• 5+ years of hands-on experience deploying machine learning systems into production

• proven experience in designing AI/ML platform or MLOps architectures (not just individual models)

• strong understanding of:

o ML life cycle management data,

o Data pipelines and feature engineering

o model serving patterns (batch, real-time, API’s )

o experience working across organizational boundaries (multiple business teams and units)

o Ability to communicate architectural designs and decisions clearly to both technical and non-technical stakeholders

Preferred qualification: Experience designing or operating

• features stores

• model registries and experiment tracking platforms,

• AI governance and risk framework

• familiarity with cloud, native ML platforms, and infrastructure (AWS, GCP, Azure, or similar)

• Experience in monitoring teams and establishing standards at scale

• background in a regulated or data-sensitive environment

What does success look like in 12 to 18 months:

• A documented adopted AIML reference architecture used across business units

• Clear standardized environment and pipelines enabling faster promotion from POC to production

• Reduced the duplication of feature engine engineering and model deployment efforts

• consistent visibility into which models are running in production and how they perform

• Increase confidence from leadership and the organization’s ability to deliver AI responsibly and scale.

About the Company

We at SRM Digital are focused towards connecting businesses with top talent across various industries. With a Commitment to excellence, we specialize in providing tailored recruitment services that meet the unique needs of our clients & Candidates. Know more