SanDiegoRecruiter Since 2001
the smart solution for San Diego jobs

Manager Site Reliability Engineering

Company: PlayStation Global
Location: San Diego
Posted on: May 26, 2023

Job Description:


A
s a Site Reliability Manager and engineering leader within the Account & Identity services domain, you will lead team and take ownership for keeping key user experiences on the platform available, resilient, performant, and secure, while continually enabling our engineering teams to efficiently deliver new and engaging products and adopt new tools and technology.
You will collaborate across engineering and operations teams, and leaders to lead and platform wide standardization and technical initiatives, proactively identify and drive improvements in people, process, and technology that enable our teams to deliver, run and operate their services, champion a culture of continuous improvement and operational excellence, and provide amazing customer experiences for millions of users. You should have a strong technical ability, demonstrated interpersonal skills with the ability to guide and inspire your team to achieve outstanding results in a fast-paced environment.
Responsibilities


  • Lead a team of site reliability engineers (SREs), provide for and support critical applications and services supporting our platform and be directly responsible for maintaining PlayStation's stellar uptime record.
  • Responsible for end-to-end availability and reliability of critical PSN experiences such as sign-in, account, and game play - provide our customers & players with always-available, high-performing, amazing, and secure Play experiences.
  • Continually refine and improve existing processes through structured decision making and collaborative execution.
  • Drive adoption of industry standard methodologies and practices for Infrastructure as Code, deployment, observability, load & resiliency testing, and reliability.
  • Reduce toil and enable teams to operate and run their services thru automation and tooling.
  • Define and maintain standards for Reliability, Availability, Serviceability (RAS) for onboarding of new features, products, and services.
  • Lead by example, care for your team, and establish credibility with the quality of your team's technical and operational execution.
  • Lead day-to-day team activities using the Agile/Scrum methodology.
  • Support and scheduling on-call rotations.

    Minimum Qualifications

    • Proven track record of leading successful engineering teams
    • Breadth and depth of experience building and running sophisticated software systems and highly scalable/available software.
    • 2+ years in a technical leadership or engineering manager role.
    • Bachelor's degree in computer science or equivalent practical experience.
    • Excellent written and verbal communication skills.
    • Ability to present technical information in a clear and concise manner to executives and non-technical leaders.

      Preferred Qualifications

      • 5 years of work experience in a technical leadership role with a minimum of 1+ years in SRE/DevOps.
      • Hands-on experience in triaging and tuning Java cloud applications with integration into AWS managed services.
      • Knowledge of Linux, Scripting, automation, Cloud, infrastructure, and application monitoring tools (Prometheus, Datadog, Dynatrace); previous exposure to Kubernetes, EKS, or similar container orchestration systems.
      • Experience in deploying, operating, and running services in AWS or other cloud environments.
      • Experience with distributed systems / micro service architecture.

        #LI-GM1

Keywords: PlayStation Global, San Diego , Manager Site Reliability Engineering, Engineering , San Diego, California

Click here to apply!

Didn't find what you're looking for? Search again!

I'm looking for
in category
within


Log In or Create An Account

Get the latest California jobs by following @recnetCA on Twitter!

San Diego RSS job feeds