Senior Data Engineer
C2i Genomics
At Veracyte, we offer exciting career opportunities for those interested in joining a pioneering team that is committed to transforming cancer care for patients across the globe. Working at Veracyte enables our employees to not only make a meaningful impact on the lives of patients, but to also learn and grow within a purpose driven environment. This is what we call the Veracyte way – it’s about how we work together, guided by our values, to give clinicians the insights they need to help patients make life-changing decisions.
Our Values:
- We Seek A Better Way: We innovate boldly, learn from our setbacks, and are resilient in our pursuit to transform cancer care
- We Make It Happen: We act with urgency, commit to quality, and bring fun to our hard work
- We Are Stronger Together: We collaborate openly, seek to understand, and celebrate our wins
- We Care Deeply: We embrace our differences, do the right thing, and encourage each other
The Position:
The Senior Data Engineer will contribute to Veracyte’s success by designing, developing, and maintaining scalable cloud data infrastructure and pipelines to support the company’s data engineering needs. This role involves hands-on work with data lakes, meshes, and catalogs, collaborating with cross-functional teams in a Scrum environment to deliver high-quality data solutions. The Senior Data Engineer will support the implementation of data management frameworks and align with Veracyte’s global data strategy, policies, and digital transformation initiatives, including the Veracyte Lakehouse built on AWS and Snowflake.
The position is based out of our San Diego office (hybrid) and we are also open to US Remote (working PST hours).
Key Responsibilities:
- Design and Develop Data Infrastructure:
- Build and maintain scalable, efficient data pipelines and infrastructure for Lakehouse systems, including bronze, silver, and gold data layers.
- Work with technologies such as Amazon S3, Snowflake, AWS Glue, Lake Formation, and SageMaker for data storage, processing, and analytics.
- Collaborate Across Teams:
- Partner with the Technical Program Manager (TPM), data scientists, and stakeholders to understand business requirements and translate them into technical data solutions.
- Participate in Scrum processes, including backlog grooming, sprint planning, and handling data set requests via Jira.
- Optimize and Secure Data:
- Optimize data retrieval, processing, and ELT workflows for improved performance and reliability.
- Implement data security measures, governance policies, and compliance with PHI, consent, and regulatory requirements.
- Support Data Management Initiatives:
- Assist in identifying and assessing internal and external data sources for the data catalog.
- Contribute to the evaluation, development, or integration of user-friendly data catalog applications aligned with best practices.
- Help provide training and support to users of the data catalog.
- Contribute to Data Strategy:
- Provide technical input to support the development and implementation of Veracyte’s data strategy and policies.
- Collaborate on defining user stories, data quality levels (e.g., Medallion architecture), and access controls for datasets.
- Support data acquisition, curation, and delivery for use cases like AI model training, clinical decision support, and operational efficiency.
- Mentorship and Knowledge Sharing:
- Mentor junior data engineers and foster a culture of continuous learning.
- Share expertise in data engineering best practices, emerging technologies, and tools like Apache Parquet, Iceberg, and Zero-ETL integrations.
Who You Are:
- Education: Bachelor’s or Master’s degree in Engineering, Computer Science, or a related field.
- Experience:
- 6+ years of experience (BS) or 3+ years (MS) in data engineering or a similar role.
- Hands-on experience with designing and deploying data pipelines in cloud environments, preferably AWS and/or GCP.
- Technical Skills:
- Proficiency in programming languages such as Python, Java, or Scala.
- Experience with AWS services (S3, Glue, Lake Formation, SageMaker) and Snowflake for data warehousing, ELT processes, and data modeling.
- Familiarity with data cataloging tools, data lakes, and governance best practices.
- Knowledge of open formats like Apache Parquet and Iceberg is a plus.
- Soft Skills:
- Strong problem-solving and analytical skills.
- Excellent communication and collaboration abilities to work effectively in a cross-functional Scrum team.
- Ability to thrive in a fast-paced, dynamic environment focused on data as a service (DaaS).
#LI-Hybrid, or #LI-Remote
The final salary offered to a successful candidate will be dependent on several factors that may include but are not limited to the type and length of experience within the job, type and length of experience within the industry, education, etc. Base pay is one part of the Total Package that is provided to compensate and recognize employees for their work, and this role may be eligible for additional discretionary bonuses/incentives, and restricted stock units. Veracyte is a multi-state employer, and this salary range may not reflect positions that work in other states.
What We Can Offer You
Veracyte is a growing company that offers significant career opportunities if you are curious, driven, patient-oriented and aspire to help us build a great company. We offer competitive compensation and benefits, and are committed to fostering an inclusive workforce, where diverse backgrounds are represented, engaged, and empowered to drive innovative ideas and decisions. We are thrilled to be recognized as a 2024 Certified™ Great Place to Work® in both the US and Israel - a testament to our dynamic, inclusive, and inspiring workplace where passion meets purpose.
About Veracyte
Veracyte (Nasdaq: VCYT) is a global genomic diagnostics company that improves patient care by providing answers to clinical questions, informing diagnosis and treatment decisions throughout the patient journey in cancer and other diseases. The company’s growing menu of genomic tests leverage advances in genomic science and technology, enabling patients to avoid risky, costly diagnostic procedures and quicken time to appropriate treatment. The company’s tests in lung cancer, prostate cancer, breast cancer, thyroid cancer, bladder cancer and idiopathic pulmonary fibrosis are available to patients and its lymphoma subtyping and renal cancer tests are in development. With Veracyte’s exclusive global license to a best-in-class diagnostics instrument platform, the company is positioned to deliver its tests to patients worldwide. Veracyte is based in South San Francisco, California. For more information, please visit www.veracyte.com and follow the company on X (Formerly Twitter).
Veracyte, Inc. is an Equal Opportunity Employer and will consider all qualified applicants for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, protected veteran status or disability status. Veracyte participates in E-Verify in the United States. View our CCPA Disclosure Notice.
If you receive any suspicious alerts or communications through LinkedIn or other online job sites for any position at Veracyte, please exercise caution and promptly report any concerns to careers@veracyte.com