Job Description
Data Engineer
Malvern, PA
FTE
Provides advanced data solutions by using software to process, store, and serve data to others. Tests data
quality and optimizes data availability. Ensures that data pipelines are scalable, repeatable, and secure.
Utilizes a deep dive analytical skillset on a variety of internal and external data.
Job Description
Core Responsibilities
1. Writes ETL (Extract / Transform / Load) processes, designs database systems, and develops tools for
real-time and offline analytic processing.
2. Troubleshoots software and processes for data consistency and integrity. Integrates large scale data
from a variety of sources for business partners to generate insight and make decisions.
3. Translates business specifications into design specifications and code. Responsible for writing complex
programs, ad hoc queries, and reports. Ensures that all code is well structured, includes sufficient.
documentation, and is easy to maintain and reuse.
4. Partners with internal clients to gain an enhanced understanding of business functions and informational
needs. Gains expertise in tools, technologies, and applications/databases in specific business areas and
company-wide systems.
5. Leads all phases of solution development. Explains technical considerations at related meetings,
including those with internal clients and less experienced team members.
6. Tests code thoroughly for accuracy of intended purpose. Reviews end product with the client to ensure
adequate understanding. Provides data analysis guidance as required.
7. Designs and conducts training sessions on tools and data sources used by the team and self
provisioners. Provides job aids to team members and business users.
8. Tests and implements new software releases through regression testing. Identifies issues and engages
with vendors to resolve and elevate software into production.
9. Participates in special projects and performs other duties as assigned.
10. Strong experience in Python core, Pandas, NumPy and SQL. PySpark would be an advantage
11. Within AWS : Glue, Athena, Redshift, IAM, CloudFormation, S3, Basics of Sagemaker and Serverless (Lamda, Dynamo DB). Understanding of AWS RDS would be an advantage.
12. Understanding of CI/CD pipeline
13. Able to process various filetypes like csv, excel and parquet (good to have) using Python and have basic understanding of Tableau.
14. Understanding of Investment Management would be an advantage.
15. Familiarity with Agile/Scrum Framework.