← Serch more jobs

Senior AI/ML Lead Engineer

LinkedIn Toyota North America Plano, TX
Not Applicable Posted March 29, 2026 Job link
Thinking about this job
Not Met Priorities
What still needs stronger evidence
Requirements
  • 7+ years of software engineering experience, including 3–5 years focused specifically on ML/AI in production, with a track record of operating at a principal or staff engineer level
  • Deep understanding of machine learning fundamentals: supervised and unsupervised learning, deep learning architectures (transformers, CNNs, RNNs), optimization techniques, and evaluation methodologies
  • Hands-on experience with large language models: prompt engineering, fine-tuning (LoRA, QLoRA), RAG pipelines, embedding models, vector databases, and agent frameworks (LangChain, LlamaIndex, or similar)
  • Production experience with AWS AI/ML services, including:
  • Amazon Bedrock for foundation model access, fine-tuning, and knowledge bases or
  • Amazon SageMaker for custom model training, hosting, and MLOps pipelines
  • Lambda and Step Functions for orchestrating inference workflows
  • S3 for data lakes and model artifact storage
  • EventBridge, SQS, or SNS for event-driven ML pipelines
  • OpenSearch or similar for vector search and semantic retrieval
  • Strong proficiency in Python or Typescript — you write production-quality ML code, not just notebooks
  • Experience with core ML frameworks: PyTorch, TensorFlow, or JAX, and libraries like Hugging Face Transformers, scikit-learn, and XGBoost
  • Solid understanding of MLOps practices: experiment tracking (MLflow, W&B), model registries, CI/CD for ML, A/B testing, and canary deployments for models
  • Experience with data engineering fundamentals: ETL pipelines, feature stores, data validation, and working with structured and unstructured data at scale
  • Strong understanding of Infrastructure as Code using AWS CDK, CloudFormation, or Terraform for ML infrastructure
  • Experience with observability and monitoring for ML systems: model performance tracking, data drift detection, and alerting
  • Deep experience debugging complex issues across ML systems — from training instabilities to inference latency to data pipeline failures
  • Strong written and verbal communication — you can write a clear RFC, lead a design review, and explain model tradeoffs to a non-technical stakeholder Added bonus if you have
  • Experience in the financial services, banking, or insurance industry
  • Experience with responsible AI: fairness metrics, bias detection, explainability (SHAP, LIME), and model governance frameworks
  • Familiarity with computer vision or NLP beyond LLMs (named entity recognition, document understanding, OCR)
  • Experience with real-time inference at scale: model optimization (quantization, distillation, ONNX), GPU/accelerator management, and latency-sensitive serving
  • Experience with multi-modal models and architectures that combine text, image, and structured data
  • Hands-on experience with GraphQL federation or API gateway patterns for exposing ML services
  • Experience with containerized ML workloads (ECS Fargate, Docker, Kubernetes) for training and serving
  • AWS certifications (Machine Learning Specialty, Solutions Architect, Developer Associate)
  • Published research or conference presentations in ML/AI
  • Experience contributing to or maintaining open-source ML projects
  • Experience defining engineering standards, writing ADRs, or leading org-wide technical initiatives
Preferred Skills
  • Production experience with AWS AI/ML services, including:
  • Amazon SageMaker for custom model training, hosting, and MLOps pipelines
  • EventBridge, SQS, or SNS for event-driven ML pipelines
  • OpenSearch or similar for vector search and semantic retrieval
  • Strong proficiency in Python or Typescript — you write production-quality ML code, not just notebooks
  • Experience with core ML frameworks: PyTorch, TensorFlow, or JAX, and libraries like Hugging Face Transformers, scikit-learn, and XGBoost
  • Solid understanding of MLOps practices: experiment tracking (MLflow, W&B), model registries, CI/CD for ML, A/B testing, and canary deployments for models
  • Strong understanding of Infrastructure as Code using AWS CDK, CloudFormation, or Terraform for ML infrastructure
  • Experience with observability and monitoring for ML systems: model performance tracking, data drift detection, and alerting
  • Deep experience debugging complex issues across ML systems — from training instabilities to inference latency to data pipeline failures
  • Strong written and verbal communication — you can write a clear RFC, lead a design review, and explain model tradeoffs to a non-technical stakeholder Added bonus if you have
  • Experience in the financial services, banking, or insurance industry
  • Experience with responsible AI: fairness metrics, bias detection, explainability (SHAP, LIME), and model governance frameworks
  • Familiarity with computer vision or NLP beyond LLMs (named entity recognition, document understanding, OCR)
  • Experience with real-time inference at scale: model optimization (quantization, distillation, ONNX), GPU/accelerator management, and latency-sensitive serving
  • Experience with multi-modal models and architectures that combine text, image, and structured data
  • Hands-on experience with GraphQL federation or API gateway patterns for exposing ML services
  • Experience with containerized ML workloads (ECS Fargate, Docker, Kubernetes) for training and serving
  • AWS certifications (Machine Learning Specialty, Solutions Architect, Developer Associate)
  • Published research or conference presentations in ML/AI
  • Experience contributing to or maintaining open-source ML projects
  • Experience defining engineering standards, writing ADRs, or leading org-wide technical initiatives
Education
  • (Not required) – Bachelor's degree in Computer Science, Machine Learning, Statistics, or related field, or equivalent practical experience
  • (Not required) – Master's or PhD in Machine Learning, AI, Computer Science, Statistics, or related field
  • (Not required) – AWS certifications (Machine Learning Specialty, Solutions Architect, Developer Associate)