Data engineer healthcare data Job at Techmasters, San Francisco, CA

Y1A4Z1hGL0doNEt6NXliT0dBSEg0T09mZVE9PQ==
  • Techmasters
  • San Francisco, CA

Job Description

Join to apply for the Data Engineer (Healthcare Data) role at Pangaea Data .

About Pangaea Data

Pangaea Data (Pangaea) is a South San Francisco and London based business founded by Dr Vibhor Gupta and Prof Yike Guo (Director Data Science Institute at Imperial College London; Provost, Hong Kong University of Science and Technology). They have worked in medicine and computing for over 20 years and have raised over $300 million through their academic research, including a $110 million grant focused on development work on large language models in medicine. Pangaea’s AI platform, PALLUX, is configured on clinical guidelines to find more untreated (undiagnosed, miscoded, at-risk) and under-treated patients with hard-to-diagnose conditions for screening and treatment at the point of care.

The Role

As Data Engineer (Healthcare Data), you will join Pangaea’s team to lead and support the development of reliable, scalable, and secure data solutions. The ideal candidate will be experienced with healthcare data standards (e.g. FHIR, OMOP), possess a strong understanding of data privacy regulations (e.g., HIPAA, GDPR), and have technical expertise to design and implement data pipelines, storage systems, and integrations. A strong software engineering background and knowledge in AI, especially Machine Learning and Natural Language Processing, is essential. For the right candidate, this is a senior technical position with scope to grow into a leadership role.

Key Technical Responsibilities Will Include


  • Design, implement, and maintain ETL pipelines to collect, clean, and transform healthcare data from various sources such as EHR systems, APIs, and databases.
  • Ensure data quality and integrity through robust testing and validation processes.
  • Optimize storage solutions for structured and unstructured healthcare data using databases (e.g., MongoDB) and cloud-based data warehouses (e.g., Azure Cosmos, Azure Fabric).
  • Maintain strict compliance with data privacy regulations such as HIPAA, GDPR, and other local healthcare policies.
  • Work closely with the clinical team to understand data requirements and translate them into technical solutions.
  • Collaborate with the AI team to provide clean, well-structured datasets for research, and AI/ML models.
  • Stay up-to-date with the latest data engineering technologies and best practices.

Mandatory Requirements



Technical Skills


  • Experience working with Electronic Health Records (EHR) systems (e.g. Epic, Cerner).
  • A university qualification (Bachelors, Masters, Doctorate) with at least two years of university study in Computer Science, Informatics, Data Science, Engineering, or related.
  • Experience in data engineering, with a focus on healthcare data preferred.
  • Familiarity with NoSQL databases (e.g., MongoDB) and relational databases (e.g., PostgreSQL, MySQL).
  • 5+ years in Python and SQL work.
  • Knowledge of ETL tools (e.g., Apache Airflow) and cloud platforms (e.g., AWS, Azure, GCP).
  • Understand data modelling concepts and best practices. Experience with healthcare data standards (e.g., HL7, FHIR, ICD, SNOMED, DICOM) preferred.
  • Excellent problem-solving and communication skills.

Personal Traits


  • Ability to communicate complex ideas effectively, both verbally and written.
  • Ability to engage all levels of the company and the customers’ organizations.
  • Ability to work collaboratively in a team environment.

Nice to Have


  • 3-5 years experience of managing teams.
  • Experience working on large-scale, commercial software development projects is a plus.
  • Experience with research communities and/or efforts, including having published papers (being listed as author) at AI/ML/NLP/CV conferences (e.g. Bio-IT, NeuraIPS, ICML, ICLR, ACL, CVPR and KDD) and journals.
  • Experience and knowledge of deploying AI and Data solutions for healthcare and pharmaceuticals at scale is desirable.

Perks and Benefits


  • Flexible working hours.
  • Salary dependent on experience.
  • Package of attractive benefits including private medical insurance and monthly travel card.
  • You will join a dedicated highly renowned team offering you the opportunity to grow and develop your professional skills and profile.
  • You will have the opportunity to learn about building a startup business from experienced professionals and serial entrepreneurs.

Application Contact Information

Your application should include a CV and cover letter highlighting your relevant experiences and motivations. Please send this to careers@pangaeadata.ai.

General Information

Pangaea Data is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, sex, sexual orientation, gender identity or expression, religion, national origin or ancestry, age, disability, marital status, pregnancy, protected veteran status, protected genetic information, political affiliation, or any other characteristics protected by local laws, regulations, or ordinances.

Seniority level

  • Mid-Senior level

Employment type

  • Full-time

Job function

  • Information Technology

#J-18808-Ljbffr

Job Tags

Full time, Local area, Flexible hours,

Similar Jobs

GradePower Learning Centers

English & Social Studies Teacher/Tutor Job at GradePower Learning Centers

 ...Part-Time English & Social Studies Teacher/Tutor Wanted Passionate about puns? Excited by literature? Eager to help students succeed in English and also learn about their role in the community? Gain valuable teaching experience! GradePower Learning Cary is currently... 

Get It - Healthcare

Medical Scribe - Remote Job at Get It - Healthcare

 ...scribe experience required) Additional $1/hour for fluent Spanish-speaking candidates Opportunities for experienced scribes to earn up to $16/hour Position Overview: As a remote medical scribe, you'll be an essential part of the healthcare team, working side-by-... 

Riverside Community College District

Assistant Professor, Ethnic Studies (Moreno Valley College) - 2 Positions | Riverside Community College District Job at Riverside Community College District

 ...well as an off-campus site, the Ben Clark Training Center, located approximately 11 miles from the main campus. MVC is committed to educating and empowering our students, providing equitable access to education, and serving our communities. MVC's core mission can be... 

Verizon

Senior Client Executive - SLED Sales Job at Verizon

 ...When you join Verizon You want more out of a career. A place to share your ideas freely...  ...for all offerings, products, and services applicable to the government space. Collaborating...  ...Building trusting relationships with customers. Identifying and qualifying... 

Aflac, Incorporated

Sr International Affairs Analyst Job at Aflac, Incorporated

Salary Range: $105,000 - $115,000 Job Posting End Date: May 31, 2025 Weve Got You Under Our Wing We are the duck. We develop and empower our people, cultivate relationships, give back to our community, and celebrate every success along the way. We do it all...