Remote
Duties:
- As a Data Curation Engineer in the Genomics Research Center, you will be responsible for building and running workflows to load and manage high-value datasets in a centralized environment.
- You will work closely with Bioinformatics Engineering, as well as bioinformatics research scientists to identify data sources and requirements for loading and querying.
- Your expertise in PostgreSQL for database management and Python and R for scripting and automation will be crucial in developing and maintaining ETL processes to ensure data quality and integrity.
Responsibilities:
- Develop and implement workflows to load and manage genomic data.
- Work with researchers and data scientists to identify data sources and requirements for loading new datasets.
- Maintain existing data models for storing and querying genomic data.
- Develop and maintain ETL processes to ensure data quality and integrity.
- Build and maintain scripts for automation of data loading and processing.
Qualifications:
- Bachelor's degree in computer science, bioinformatics, or a related field +3 years of experience.
- Experience with building and running workflows for RDMS data loading and ETL processes.
- Proficient in SQL and ability to write complex queries for data extraction and analysis.
- Strong programming skills in Python for scripting and automation. Additional experience with R is preferred.
- Familiarity with genomic data formats and databases commonly used in bioinformatics research.
- Knowledge of data modeling concepts and implementing common data models in a relational database.
- Familiarity with data cleaning, normalization, and quality control processes.
- Excellent communication skills and ability to collaborate with researchers and stakeholders.
Job Type: Contract
Pay: $42.00 - $45.00 per hour
Schedule:
- 8 hour shift
- Day shift
- Monday to Friday
Education:
- Bachelor's (Preferred)
Experience:
- R programming: 4 years (Preferred)
- Bioinformatics: 4 years (Preferred)
- SQL: 4 years (Preferred)
- Develop and maintain ETL processes: 4 years (Preferred)
- write complex queries for data extraction and analysis.: 4 years (Preferred)
- Python: 4 years (Preferred)
- genomic data.: 4 years (Preferred)
Work Location: Remote