With a track record of global leadership in bone marrow transplantation, HIV/AIDS prevention, immunotherapy and COVID-19 vaccines, Fred Hutch has earned a reputation as one of the world’s leading cancer, infectious disease and biomedical research centers. Fred Hutch operates eight clinical care sites that provide medical oncology, infusion, radiation, proton therapy and related services, and network affiliations with hospitals in five states. Together, our fully integrated research and clinical care teams seek to discover new cures to the world’s deadliest diseases and make life beyond cancer a reality.
At Fred Hutch we value collaboration, compassion, determination, excellence, innovation, integrity and respect. These values are grounded in and expressed through the principles of diversity, equity and inclusion. Our mission is directly tied to the humanity, dignity and inherent value of each employee, patient, community member and supporter. Our commitment to learning across our differences and similarities make us stronger. We seek employees who bring different and innovative ways of seeing the world and solving problems. Fred Hutch is in pursuit of becoming an anti-racist organization. We are committed to ensuring that all candidates hired share our commitment to diversity, anti-racism and inclusion.
The Data Scientist II/III supports Fred Hutch's clinical data ecosystem for translational research via clinical natural language processing (NLP) to promote the use of textual information in electronic health records (EHRs) and other real-world data (RWD) for observational studies. This position involves implementing rule-based and statistical/machine-learning NLP systems for electronic phenotyping and cohort identification, transformation of the output of NLP tools into structured fields and common data models, collaborations with translational researchers to guide development NLP systems and use of RWD, and building data stewardship and management skills across Fred Hutch.
This role will have the opportunity to work partially at our campus and remotely.
- Develop, evaluate, and productionalize NLP models for mining written clinical text from oncology patient medical records, such as progress notes and pathology reports.
- Work closely with data engineers to design and deploy robust NLP pipelines and ensure high data quality.
- Collaborate with translational researchers and other internal customers to understand desired outcomes and identify new features for NLP systems.
- Work with other clinical Data Scientists to develop templates, workflows, and packages to support use of data generated from NLP models in research.
- BS, BA in information technology, computer science, business analytics, statistics, data science, public health or other scientific or health related field.
- Minimum 5 years of experience in data analytics, data management, governance, security, privacy, or related disciplines.
- Graduate degree in statistics, data science, informatics, computer science, or a related field
- Experience with software development best practices: version control, unit testing, continuous integration, and containers (Docker)
- A functional understanding of medical oncology, cancer epidemiology, or immunotherapy
- Experience with data governance processes for working with regulated data (HIPAA, GDPR, etc.)
- Graduate degree in statistics, data science, informatics, computer science, or a related field
- Experience with software development best practices: version control, unit testing, continuous integration, and containers (Docker)
- A functional understanding of medical oncology, cancer epidemiology, or immunotherapy
- Experience with data governance processes for working with regulated data (HIPAA, GDPR, etc.)
Data Scientist III
- Master’s degree in information technology, computer science, business analytics, statistics, data science, public health or other scientific or health-related field or equivalent combination of education and experience.
- Minimum 8 years of experience in data analytics, data management, governance, security, privacy, or related disciplines.
- Graduate degree in statistics, data science, informatics, computer science, or a related field
- Experience with software development best practices: version control, unit testing, continuous integration, and containers (Docker)
- A functional understanding of medical oncology, cancer epidemiology, or immunotherapy
- Experience with data governance processes for working with regulated data (HIPAA, GDPR, etc.)
- Experience leading the development of coding standards or best practices for statistical code review
- Graduate degree in statistics, data science, informatics, computer science, or a related field
- Experience with software development best practices: version control, unit testing, continuous integration, and containers (Docker)
- A functional understanding of medical oncology, cancer epidemiology, or immunotherapy
- Experience working with standardized medical ontologies such as ICD10, RxNorm, SNOMED CT
- Experience with data governance processes for working with regulated data (HIPAA, GDPR, etc.)
- Experience in community building, group communication, and/or mentoring
A statement describing your commitment and contributions toward greater diversity, equity, inclusion, and antiracism in your career or that will be made through your work at Fred Hutch is requested of all finalists.
Fred Hutchinson Cancer Center offers employees a comprehensive benefits package designed to enhance health, well-being, and financial security. Benefits include medical/vision, dental, flexible spending accounts, life, disability, retirement, family life support, employee assistance program, onsite health clinic, tuition reimbursement, paid vacation (12-22 days per year), paid sick leave (12-25 days per year), paid holidays (13 days per year), paid parental leave (up to 4 weeks), and partially paid sabbatical leave (up to 6 months).