Company

Focal SystemsSee more

addressAddressSan Francisco, CA
type Form of workFull-Time
CategoryInformation Technology

Job description

Job Description

Company Description

Focal Systems is the industry leader in retail AI solutions. We are a Silicon Valley based startup that has more than doubled in size every year since inception. We are a Deep Learning first company. Our mission is to automate and optimize brick and mortar retail using deep learning computer vision. Focal Systems has been deployed at scale with the top retailers in the world. We are looking for smart, creative and passionate people who want to help build a great and enduring company and deploy Deep Learning to the world! 
Mission of the role:
To enable us to scale from 200k to 1 million cameras.
Job Summary:
As a Sr. DevOps/Site Reliability Engineer (SRE) at our company, you will play a pivotal role in ensuring the smooth operation and continuous improvement of our infrastructure, deployment processes, and overall system reliability.

Responsibilities:
  • Set up and manage blue/green and canary deployments to ensure smooth launches without downtime.
  • Operate multiple large GCP Kubernetes clusters and fine tune for reliability vs cost.
  • Manage the various distributed services of the company, ensuring to always provide graceful updates, comprehensive test coverage, tracking of logs, and 99.9% uptime.
  • Work with Backend, Frontend and Deep Learning teams and write infrastructure automation code for their needs.
  • Identify scalability bottlenecks through load testing and plan infrastructure architecture.
  • Create tools to provide transparency/ease of access into the company's rich datasets stored across varying geographic locations and data formats.
  • Design, build, and manage a robust Continuous Integration and Continuous Deployment (CI/CD) pipeline.
Requirements:
  • 4+ years experience in an infrastructure or Site Reliability Engineer (SRE)  role.
  • 3+ years of experience with containerization (Docker) and orchestration platforms (Kubernetes) required.
  • Great understanding of SQL, networking, distributed systems, operating systems (debian), data structures, algorithms, and software engineering practices.
  • Experience operating Kafka (or other Pub/Sub) clusters at terabyte scale.
  • Terraform or other Infrastructure as Code automation solution.
  • Operating Relational SQL databases and Redis at terabyte scale. 
  • Proven experience with setting up monitoring/alerting and reliability engineering.
  • Scriptings skills in Python.
Nice to have experience:
  • GitOps.
  • Setting up automation for complex load testing scenarios.
  • Tuning Deep Learning pipelines with Python, Pytorch and Multiprocessing.
  • Backend programming with Python.
Why Focal Systems:
Strong Values and Mission - We are a tightly-knit team with an ambitious mission and a strong set of core values, which define our approach to business and have successfully guided us since inception.
Exceptional Team - We are a team of hard-working, fun-loving professionals from some of the most eminent universities, research labs, and tech companies of our time. We pride ourselves on recruiting exceptional individuals to help us redefine the state-of-the-art.
Outstanding Partners - We work with 10+ of the largest retailers in the world and have a world-class roster of investors, advisors and partners to support & advise us in our endeavors.
Benefits:
We care deeply about the health, happiness, and wellbeing of all of our employees. We offer:
  • Competitive Salary & Attractive Stock.
  • Health Insurance.
  • Catered lunches.
  • Paid Time Off.
  • Quarterly Team Retreats.
  • Education grants.
Refer code: 8728565. Focal Systems - The previous day - 2024-03-25 15:01

Focal Systems

San Francisco, CA
Jobs feed

Guest Services Associate

Pacific Mobile Structures, Inc.

Freeport, TX

Director, Employee Experience – Communication & Engagement Solutions

Willis Towers Watson

Texas, United States

RN Float Full Time Days $8,000 Bonus

Cbs17

Olin, NC

Library Shelver - Part Time - 17 hours - Lake Jackson

Brazoria County, Tx

Angleton, TX

Driver - Roanoke Airport

Enterprise Holdings, Inc.

Roanoke, VA

Senior Caregiver - Now Hiring

Care.com

Las Vegas, NV

RN - Float (Full-Time/Days)

Cbs17

Olin, NC

RN Registered Nurse

Cbs17

Olin, NC

Share jobs with friends

Senior DevOps Software Engineer

Johnson & Johnson

Redwood City, CA

5 days ago - seen

Senior DevOps Engineer

Talentmovers

$60 - $68 an hour

Pleasanton, CA

3 weeks ago - seen

Senior DevOps Engineer - Flight Software

Astranis

$150,000 - $200,000 a year

San Francisco, CA

4 weeks ago - seen

Senior DevOps Engineer

Canepa Associates

Palo Alto, CA

a month ago - seen

Senior Infrastructure DevOps Engineer

Teksystems

Oakland, CA

a month ago - seen

Senior DevOps Engineer

Skan

Menlo Park, CA

a month ago - seen

Senior Software Engineer - Integration Automation/DevOps

Intuitive Surgical

Sunnyvale, CA

a month ago - seen

Senior/Staff DevOps Engineer

Betterhelp

Mountain View, CA

a month ago - seen

Senior Staff Software Engineer, Autonomy DevEx & DevOps (R2416)

Shield Ai

San Diego, CA

a month ago - seen

Senior DevOps Engineer

Ciitizen

San Francisco, CA

a month ago - seen

Senior AWS DevOps Engineer (REMOTE)

Embtel, Inc.

Palo Alto, CA

a month ago - seen

Senior Data Engineer (DevOps Automation) - Alameda, CA (Hybrid)

Georgia It Inc.

Alameda, CA

2 months ago - seen

Senior Software Engineer, DevOps

Mixhalo

San Francisco, CA

2 months ago - seen

Senior DevOps Engineer

Elixir Technologies

Ojai, CA

2 months ago - seen

Senior Software Engineer, DevOps

Ixl Learning

San Mateo, CA

2 months ago - seen

Senior DevOps Engineer

Primer.ai

San Francisco, CA

2 months ago - seen

Senior DevOps Engineer - Remote

Pipl

San Francisco, CA

2 months ago - seen

Senior Engineer, DevOps

Chipotle

$111,000 - $155,000 a year

Newport Beach, CA

2 months ago - seen