About the team:
Klaviyo operates a real-time data analytics platform coded primarily in Python that is built for massive scale and hosted on Amazon Web Services (AWS). Engineers come to Klaviyo with experience in a variety of languages and from a number of disciplines.
At Klaviyo, we love tackling tough engineering problems and look for employees who specialize in certain areas but are passionate about building, owning & scaling features end to end from scratch and breaking through any obstacle or technical challenge in their way. We push each other to move out of our comfort zone, learn new technologies, and work hard to ensure each day is better than the last. Learn more about our engineering culture at https://klaviyo.tech
About the role:
As a Machine LearningEngineer II, you will be a key contributor to the DS Platform team's efforts to build and improve the tools, systems, and software services that Data Scientists depend on to create cutting edge models that power Klaviyo's most advanced features.
You will be responsible for developing tools to train and develop models, serve models in production, and monitor models' long term performance. You'll work with a modern software stack built on Kubernetes, Sagemaker, and Ray, helping to support models running on technologies such as PyTorch, SKLearn, Huggingface, and more.
You will learn from senior team members and level up your software engineering, dev ops, and DS/ML skills in a collaborative hybrid environment surrounded by engineers and data scientists passionate about producing high quality and high value models and features.
How you'll have an impact:
30 days
- You will have set up your local environment and contributed your first PR.
- You will be participating in team meetings and processes, and have met several members of the wider Data Science team.
60 days
- You will have a firm understanding of at least one of the systems the team owns, and will be actively and consistently contributing code on a regular cadence.
- You will be actively reviewing teammates' code.
90 days
- You will have developed at least one major feature.
- You will be familiar with all of the team's systems and have joined the team's on-call rotation, demonstrating ownership and mastery.
- You will be an active contributor to team discussions on technical decisions.
Up to 1 year
- You will be a key member of the team's technical and social fabric, shipping impactful code and systems, responding to incidents, monitoring and ensuring uptime, and collaborating with teammates effectively to solve problems.
What we're looking for:
- Prior industry experience as software engineer or Machine Learning engineer
- Python
- AWS
- Unix/Linux
- Networking
Nice to have:
- Kubernetes
- Terraform
- Go
- Sagemaker
- ML frameworks such as Huggingface, PyTorch, Tensorflow, Keras
- Distributed training frameworks such as Spark, Ray, etc
#LI-Onsite #LI-CB2