Job Specifications
Title: AI Optimization Engineer
Duration: 6 Months
Location: NYC, NY(Onsite)
Long Term Contract
Only W2 (USC OR GC)
Qualifications
Proficiency in languages such as Python, with experience in libraries like NumPy and scikit-learn.
Knowledge of various machine learning algorithms, including supervised and unsupervised learning, neural networks, decision trees, clustering, and dimensionality reduction.
Experience with deep learning frameworks such as TensorFlow, PyTorch, or Keras, and knowledge of their architectures and APIs.
Proficient with SLURM workload manager with REST and Flask APIs for automated and secure job scheduling.
Experienced in scalable infrastructure for deploying and managing large language models (LLMs),
HPC engineer with hands-on experience designing and managing GPU-accelerated clusters for large-scale AI/ML workloads.
Experience with deploying machine learning models in production environments, including containerization, microservices, and API design.
Leveraging Prometheus and Grafana to collect and analyze metrics, identify performance issues, and implement fixes. Experience creating Slurm and Triton metrics will be a plus.
Familiarity with Triton Inference Server, including its architecture, configuration, and deployment.
Knowledge of model optimization techniques, including pruning, quantization, and knowledge distillation.
Exploratory Data Analysis - Plotly, Seaborn, matplotlib
Deep Learning, Neural Networks, Decision Trees, Ensemble Methods, Gradient Boosting, Support Vector Machines, Random Forest, Logistic Regression, Transfer learning, Transformer based models, BART, Hyperparameter Tuning, Gen-AI, CNN, Computer Vision, NLP
Tools and Platforms like - Docker, Kubernetes, Jupyter, MLFlow, Github, Terraform, Jenkins, HuggingFace
Flask API Development and Security
Container Runtimes: Enroot, Pyxis, Podman
Linux (RHEL/CentOS) System Administration
Model Optimization techniques using Triton with TRTLLM
Desired Qualifications
Experience with data cleaning, feature scaling, and normalization
Programming skills creating UI/UX using the Angular framework, HTML, CSS, and JavaScript
Creating vector embeddings
Tools and Platforms like - AWS (SageMaker, Lambda, EC2)
Database Technologies – Oracle, MS-SQL, MongoDB, Redis and MySQL
SQL and PL/SQL Scripting
karthik@itminds.net
About the Company
IT Minds LLC provides the resources for long and short-term contracts. It also offers various product services.
Information Technology , Pharmaceutical , Regulatory Affairs and Health Care Staffing, Product Development, On-site Customer Services, etc. are some of our services.
We have over 15 years of experience placing information technology professionals in permanent positions and consulting assignments. At any level, on any platform, we provide quality professionals in quality positions.
We always look forward to long-...
Know more