← Serch more jobs

Data Engineer Intern

LinkedIn GoodRx San Francisco, CA
Not Applicable Posted April 4, 2026 2 variants Job link
Thinking about this job
Not Met Priorities
What still needs stronger evidence
Requirements
  • Experience in any one of the Cloud data spaces such as AWS, Azure or GCP.
  • Experience in engineering data pipelines using big data technologies (Python, pySpark, Real-time data platform like Active MQ or Kafka or Kinesis) on large scale data sets.
  • Experience writing complex SQL and ETL development with experience processing extremely large data sets.
  • Demonstrated ability to analyze large data sets to identify gaps and inconsistencies, provide data insights, and advance effective product solutions.
  • Familiarity with AWS Services (S3, Event Bridge, Glue, EMR, Redshift, Lambda).
  • Ability to quickly learn complex domains and new technologies.
  • Innately curious and organized with the drive to analyze data to identify deliverables, anomalies and gaps and propose solutions to address these findings.
  • Experience using Jira, GitHub, Docker, CodeFresh, Terraform.
  • Experience contributing to full lifecycle deployments with a focus on testing and quality.
  • Experience with data quality processes, data quality checks, validations, data quality metrics definition and measurement.
Preferred Skills
  • Experience in any one of the Cloud data spaces such as AWS, Azure or GCP.
  • Familiarity with AWS Services (S3, Event Bridge, Glue, EMR, Redshift, Lambda).
  • Experience using Jira, GitHub, Docker, CodeFresh, Terraform.
  • Experience with data quality processes, data quality checks, validations, data quality metrics definition and measurement.
Education
  • (Not required) – Bachelor’s degree in analytics, statistics, engineering, math, economics, science or related discipline.