Company

Machine Learning And AiSee more

addressAddressCupertino, CA
CategoryInformation Technology

Job description

In this role, you will be building ultra large scale batch & streaming datasets to support analytics, experimentation and machine learning and helping to drive our self-serve strategy for reporting on-behalf of data scientists and product engineers as we collectively make product better. You will help design instrumentation required to log data from device and server side and validate data is flowing in the correct shape, frequency, and quality into the Data Warehouse. Curate a high performance and easy to understand data model that meets the needs of the many. Identify common patterns and build self-serve tools to scale Data Engineering, and automate lifecycle of datasets with highest standards of data quality. Educate your consumers on how to access your products, assuring transparency and understanding in logic definitions and enabling self-service.

Request

  • 7+ years of technical experience designing, building, and maintaining distributed data processing platforms.
  • 5+ years of industry experience working with batch or streaming distributed data processing technologies (e.g. Hadoop, MapReduce, Spark, Flink, Kafka, Presto, etc.) for building efficient & large-scale data pipelines.
  • 3+ years of data modeling experience designing data warehouse table schemas and logging schemas.
  • Proficiency in at least one high-level programming language (Java, Scala, Python, Go or equivalent).
  • Experience with large, complex, highly dimensional data sets; hands-on experience with SQL.
  • Experience working with cross-functional teams to collect business requirements, build consensus, and manage expectations.
  • You are self-directed and capable of operating amidst ambiguity.
  • You are humble, continually growing in self-awareness, and possessing a growth mindset.
  • You are curious and have excellent written and verbal communication as well as problem-solving skills.
  • You are excited about digging into massive petabyte-scale semi-structured datasets.
Refer code: 9138294. Machine Learning And Ai - The previous day - 2024-04-26 10:48

Machine Learning And Ai

Cupertino, CA
Jobs feed

General Manager, Mountain Thunder Lodge

Vail Resorts

Breckenridge, CO

Engineering Specialist (Building Controls) - Anchorage, AK

Convergint Technologies

Anchorage, AK

Instructional Designer

Credit Union 1

Anchorage, AK

Technology Portfolio Vendor Manager

Becton Dickinson

Franklin Lakes, NJ

Executive Assistant

Goddard Franchisor Llc

King Of Prussia, PA

Hollister Co. - Brand Representative, Destiny USA

Abercrombie And Fitch Co.

Syracuse, NY

Licensed Pest Control Specialist

Hawx Services, Llc

Texas, United States

Assistant Manager, Operations | Houston City Centre

Lululemon Athletica

Texas, United States

Executive Agency Counsel, Non-Managerial (Level B- Litigation)

Metropolitan Transportation Authority

New York, NY

Salsa Sabor y Salud Program Facilitator

Ymca Of Central New York

Syracuse, NY

Share jobs with friends

Related jobs

Aiml - Data Engineer, Data And Ml Innovation

Software Engineer - Data Science, Apple Services Engineering

Software And Services

Cupertino, CA

just now - seen

Senior Data Engineer

Tendo

San Francisco, CA

yesterday - seen

Sr. Data Analytics Engineer - Remote

Harbor Freight Tools Usa Inc

Calabasas, CA

yesterday - seen

Sr. Data Engineer II

Goodrx

San Francisco, CA

yesterday - seen

Sr Software Engineer, IS&T Ai & Data Platforms

Corporate Functions

Sunnyvale, CA

5 days ago - seen

Senior Data Engineer

Albertsons

Pleasanton, CA

5 days ago - seen

AIML - Sr Data Engineer, Data and ML Innovation

Machine Learning And Ai

Cupertino, CA

6 days ago - seen

Sr Software Engineer - Data Cloud

Software And Services

San Diego, CA

6 days ago - seen

Senior / Staff Full Stack Software Engineer, Clinical Data

Insitro

South San Francisco, CA

6 days ago - seen

Data Engineer

5S Technology

Irvine, CA

6 days ago - seen

Sr. Data Engineer

Biomérieux

San Jose, CA

7 days ago - seen

Staff Data Engineer (Menlo Park, CA) #3378

Biospace

Menlo Park, CA

7 days ago - seen

Staff Data Engineer (San Diego) #3378

Biospace

San Diego, CA

7 days ago - seen

DevOps Engineer (Big Data Platform), Applied Machine Learning

Software And Services

Sunnyvale, CA

7 days ago - seen

Software Engineer (Data Apps & Frontend), IS&T Ai & Data Platforms

Software And Services

Cupertino, CA

a week ago - seen

SAP Data Center Engineer

Sap

Palo Alto, CA

a week ago - seen

Solutions Engineer - IS&T Ai & Data Platforms

Corporate Functions

Sunnyvale, CA

2 weeks ago - seen