← Serch more jobs

Engineering Manager - Forward Deployed Engineering (LLM)

LinkedIn Baseten San Francisco, CA
Not Applicable Posted April 5, 2026 Job link
Thinking about this job
Not Met Priorities
What still needs stronger evidence
Requirements
  • 4+ years of professional software engineering experience, including 1+ year in a leadership or mentorship capacity.
  • Strong programming skills in Python, with production experience in building or optimizing ML inference systems.
  • Proven experience with LLMs, inference optimization, or serving frameworks (e.g., vLLM, TensorRT, Triton, Hugging Face, Ray Serve).
  • Familiarity with observability, profiling, and cost/performance tradeoffs in production ML systems.
  • Excellent communication and collaboration skills—able to lead cross-functional efforts and drive outcomes in ambiguous, fast-paced environments.
Preferred Skills
  • Familiarity with observability, profiling, and cost/performance tradeoffs in production ML systems.
  • Experience leading customer-facing engineering teams or working directly with enterprise partners.
  • Deep understanding of GPU infrastructure, distributed inference, or model compression techniques.
  • Exposure to a variety of ML startups, offering unparalleled learning and networking opportunities.
Education
  • (Not required) – Bachelor’s, Master’s, or Ph.D. in Computer Science, Engineering, or related field.