Site Reliability Engineer - Observability (Palo Alto) Job at Rivian and Volkswagen Group Technologies, Palo Alto, CA

MnVWOGNmRU5oL1pqTkRJMkJjU2F2YWlQOHc9PQ==
  • Rivian and Volkswagen Group Technologies
  • Palo Alto, CA

Job Description

Overview

We are seeking a Senior Site Reliability Engineer (SRE) specializing in Observability to join RivianVW's Data Platform - Production Engineering team. In this role, you will design, implement, and scale robust observability systems to ensure the health, performance, and reliability of our production environment. You will collaborate closely with cross-functional teams to create telemetry solutions that provide actionable insights into our distributed systems.

Responsibilities

  • Observability Platform Design: Architect, implement, and maintain observability systems, leveraging tools like Datadog, LGTM stack, OpenTelemetry, and Vector to enable real-time performance monitoring, logging, and alerting.
  • Telemetry Optimization: Evolve and scale telemetry pipelines to ensure low latency and high availability for metrics, logs, and traces across multi-cloud environments.
  • Performance Engineering: Proactively identify performance bottlenecks, optimize systems, and provide recommendations for reliability improvements.
  • Scalable Automation: Implement automation solutions to scale systems sustainably while driving improvements in reliability and deployment velocity.
  • Incident Management: Collaborate with the incident response team to establish data-driven debugging and troubleshooting processes using observability data.
  • Tooling Development: Create and maintain self-service observability tools and dashboards to empower teams across the organization.
  • Cross-functional Collaboration: Partner with development, DevOps, and infrastructure teams to define SLOs/SLIs and ensure observability is embedded throughout the software lifecycle.

Qualifications

  • Educational Background: Bachelors degree in Computer Science, Engineering, or equivalent practical experience.
  • Experience: 5+ years in Site Reliability Engineering or a related role with a strong emphasis on observability.
  • Technical Expertise:
    • Proficiency in designing and operating observability platforms with tools like Prometheus, Grafana, Loki, Jaeger, or Datadog.
    • Experience with OpenTelemetry and distributed tracing in microservices architectures.
    • Deep knowledge of Kubernetes (e.g., EKS), ArgoCD, and Crossplane.
  • Programming Skills: Strong proficiency in Python, Go, or similar languages for building automation and custom telemetry solutions.
  • Cloud & Systems: Familiarity with multi-cloud setups, containerization (Docker), and Linux system fundamentals.
  • Soft Skills: Exceptional problem-solving, communication, and a data-driven approach to decision-making.

Pay Disclosure

Salary Range/Hourly Rate for California Based Applicants: $146,900 - $194,610 USD

Actual Compensation will be determined based on experience, location, and other factors permitted by law.

Benefits Summary

Rivian and Volkswagen Group Technologies provides robust medical, prescription, dental and vision insurance packages for full-time employees, their spouse or domestic partner, and their children up to age 26. Coverage is effective on the first day of employment.

Equal Opportunity

Rivian and Volkswagen Group Technologies is committed to creating a diverse environment and is proud to be an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, national origin, ancestry, sex, sexual orientation, gender, gender expression, gender identity, genetic information or characteristics, physical or mental disability, marital/domestic partner status, age, military/veteran status, medical condition, or any other characteristic protected by law. We are also committed to ensuring compliance with all applicable fair employment practice laws regarding citizenship and immigration status.

Accommodations

Rivian and Volkswagen Group Technologies is committed to ensuring that our hiring process is accessible for persons with disabilities. If you have a disability or limitation, such as those covered by the Americans with Disabilities Act, that requires accommodations to assist you in the search and application process, please email us at [email protected].

Candidate Data Privacy

Rivian and Volkswagen Group Technologies (Rivian and Volkswagen Group Technologies) may collect, use and disclose your personal information or personal data (within the meaning of the applicable data protection laws) when you apply for employment and/or participate in our recruitment processes. This data includes contact, demographic, communications, educational, professional, employment, social media/website, network/device, recruiting system usage/interaction, security and preference information. Rivian and Volkswagen Group Technologies may use your Candidate Personal Data for the purposes of (i) tracking interactions with our recruiting system; (ii) carrying out, analyzing and improving our application and recruitment process, including assessing you and your application and conducting employment, background and reference checks; (iii) establishing an employment relationship or entering into an employment contract with you; (iv) complying with our legal, regulatory and corporate governance obligations; (v) recordkeeping; (vi) ensuring network and information security and preventing fraud; and (vii) as otherwise required or permitted by applicable law. Rivian and Volkswagen Group Technologies may share your Candidate Personal Data with internal personnel, Rivian and Volkswagen Group Technologies affiliates, and service providers including background checks, staffing services, and cloud services. They may transfer or store internationally your Candidate Personal Data, including to or in the United States, Canada, and the European Union, and this data may be subject to the laws and accessible to authorities of such jurisdictions. Please see our Candidate Data Privacy Notice (English) and Candidate Data Privacy Notice (Serbian) for more information.

Please note that we are currently not accepting applications from third party application services.

Seniority level

  • Not Applicable

Employment type

  • Full-time

Job function

  • Engineering and Information Technology

Industries

  • Software Development
#J-18808-Ljbffr

Job Tags

Hourly pay, Full time, Contract work,

Similar Jobs

WELLS FARGO BANK

Senior Premier Banker Mesquite, NV Job at WELLS FARGO BANK

Why Wells Fargo: Are you looking for more? Find it here. At Wells Fargo, we're more than a financial services leader - we're a global trailblazer committed to driving innovation, empowering communities, and helping our customers succeed. We believe that a meaningful career... 

CAPPS, Inc.

DPS - DLD - License Specialist, Field Ops - 171 Job at CAPPS, Inc.

 ...this position; Driving requirements: Occasional (up to 50%). State of Texas Benefits and Retirement Information: Current DPS employees who submit applications for posted DPS positions shall notify their immediate supervisor in writing. A DPS employee who... 

MedStar Health

SEO Manager Job at MedStar Health

 ...everywhere in the organization, and we know the next big idea could be yours!The OpportunityAdobe is in search of a strategic SEO Manager to oversee organic search efforts for Adobe Acrobat's content strategy. We are dedicated to unlocking growth opportunities, refining... 

ShiftCode Analytics

Cloud Engineer Job at ShiftCode Analytics

 ...Money Laundering (AML). Our automation platform utilizes Automation Anywhere, a third-party RPA solution, to run bots in AWS Workspaces. However, due to security, cost, and scalability challenges, we are migrating from AWS Workspaces to AWS ECS (Elastic Container Service).

Morgan Murphy Media

Multi-Media Journalist Job at Morgan Murphy Media

 ...KOAM News Now is seeking a dedicated and passionate Full-Time Multi-Media Journalist to join our team. If you have a strong interest in local news and a drive to tell compelling stories, we want to hear from you! What You'll Do: Report daily on local news stories...