← Serch more jobs

Lead Associate — Generative AI & Applied Data Science

LinkedIn Fannie Mae Washington, DC
Not Applicable Posted April 4, 2026 Job link
Thinking about this job
Not Met Priorities
What still needs stronger evidence
Requirements
  • 4 years of experience.
  • Experience working at a large financial institution and demonstrable familiarity with financial accounting, capital, or mortgage/loan data and workflows.
  • 2+ years of relevant industry experience building large-scale machine learning or deep learning models/systems (for Lead Associate level, 3+ years is preferred).
  • Hands-on programming experience in Python (3+ years recommended) and familiarity with Linux-based environments.
  • Experience working in cloud environments (e.g., AWS) and comfortable with tools such as SageMaker, Jupyter, Spark.
  • Practical experience with NLP, NLG and Large Language Models (LLMs) and GenAI tools (for example: GPT-4, OpenAI APIs, LLaMA, Claude, etc.).
  • Demonstrated experience with model development and MLOps workflows (data prep, training, evaluation, CI/CD, model deployment, monitoring).
  • Familiarity with Git, build/deploy tools (Jenkins/GitHub Actions/GitLab CI), and container workflows (Docker/Kubernetes) is expected.
  • SQL skills and experience with relational and analytics databases (e.g., Redshift, Postgres, Oracle, Hive, EMR).
  • Excellent written and verbal skills and the ability to proactively communicate and collaborate with stakeholders across business, engineering, and controls teams.
  • Experience building reproducible, productionized data workflows and visualizations (Tableau, Kibana, QuickSight, etc.).
Preferred Skills
  • PhD (preferred) or MS in Finance, Economics, Computer Science, Statistics, Math, or a related field.
  • 3+ years building large-scale ML/DL systems in production, with at least 1 year of focused deep-learning / LLM/GenAI work.
  • Prior experience developing and deploying LLM agents or agentic systems.
  • Experience with MLOps platforms (Domino, Sagemaker, or similar) and CI/CD for ML.
  • Deep learning frameworks: TensorFlow, Keras, PyTorch.
  • Applied NLP/GenAI frameworks: Hugging Face transformers, LangChain, RAG architectures, LoRA, PEFT, LLM fine-tuning approaches.
  • Vector search / retrieval: Vector DBs, FAISS, Milvus, Pinecone, or cloud equivalents; experience implementing retrieval-augmented generation (RAG).
  • NLP toolkits and techniques: spaCy, NLTK, topic modeling, sentiment analysis, NER, POS tagging, TF-IDF, text embedding workflows.
  • Familiarity with LLM-centric architectures and patterns: agentic programming / LLM Agents, Chain-of-Thought, Tree-of-Thought, Human-in-the-Loop (HITL) design.
  • Experience with search and indexing technologies (Elasticsearch / OpenSearch / Solr) and knowledge graphs/ontologies (OWL, RDF, SPARQL) is a plus.
  • Image model knowledge (e.g., ResNet, CLIP) is a plus for multimodal use cases.
  • Experience building reproducible, productionized data workflows and visualizations (Tableau, Kibana, QuickSight, etc.).
  • Demonstrated success translating analytics into business impact in a fast-paced, cross-functional environment.
  • Strong scripting skills (Shell, Python) and familiarity with Spark / PySpark for large-scale data processing.
  • Comfortable working with ambiguous problems, imperfect data, and evolving requirements.
Education
  • (Not required) – A PhD in finance, economics, computer science (or strongly related field) is preferred.
  • (Not required) – Bachelor’s degree in Computer Science, Data Science, Engineering, Finance, Mathematics, Physics, Statistics, Business Analytics, or a related field.
  • (Not required) – PhD preferred (see desired).
  • (Not required) – PhD (preferred) or MS in Finance, Economics, Computer Science, Statistics, Math, or a related field.
  • (Not required) – Education:
  • (Required) – Bachelor's Level Degree (Required), Master's Level Degree