Principal Site Reliability Operations Engineer

Company: Roblox Corporation
Location: San Mateo
Posted on: January 23, 2025

Job Description:

Every day, tens of millions of people come to Roblox to explore, create, play, learn, and connect with friends in 3D immersive digital experiences- all created by our global community of developers and creators.At Roblox, we're building the tools and platform that empower our community to bring any experience that they can imagine to life. Our vision is to reimagine the way people come together, from anywhere in the world, and on any device. We're on a mission to connect a billion people with optimism and civility, and looking for amazing talent to help us get there.A career at Roblox means you'll be working to shape the future of human interaction, solving unique technical challenges at scale, and helping to create safer, more civil shared experiences for everyone.We're looking for a Principal Site Reliability Operations Engineer with a passion for problem-solving to join our Reliability Response team. The ideal candidate will have demonstrated expertise in handling incidents and thrive in a dynamic, complex, and ever-evolving distributed environment. Your ability to identify and address root causes will be crucial for driving sustainable, long-term solutions and achieving success in this role.As a Principal Site Reliability Operations Engineer on the Reliability Team, you will handle production incidents and improve Roblox's incident processes. You will maintain reliability service-level objectives, lead incident resolution with determination, and collaborate with service teams to identify and implement actionable improvements during the incident postmortem process. If you are passionate about maintaining uptime in a sophisticated distributed environment full of continuous change, you'll be right at home with our Reliability team.This role will report to the Senior Manager, Reliability and requires 3 in-office days per week.You Will:

Lead and manage production incidents.
Collaborate cross-functionally to troubleshoot and resolve sophisticated technical challenges.
Guide the implementation of incident management processes and procedures, ensuring fast and effective responses to minimize impact.
Continually monitor system health, performance and capacity, proactively addressing potential issues.
Conduct comprehensive post-mortem analysis to ascertain the root cause of incidents and formulate corrective measures.
Contribute substantially to the design and improvement of system architecture to boost reliability and performance.
Leverage coding skills to automate daily routine tasks and improve system efficiency.
Serve in the Incident Manager On-Call rotation.
Mentor junior team members.You Have:
- At least 8+ years of experience in a comparable role within a Site Reliability Team.
- Advanced knowledge of systems and network infrastructure protocols.
- Demonstrated ability in managing, troubleshooting, and resolving incidents in distributed environments.
- Experience solving problems.
- An ability to distill complex technical issues into clear and concise language.
- Familiarity with at least one scripting or programming language to automate routine tasks (Python, Golang, or similar languages preferred).
- You have a Bachelor's degree, or equivalent experience, in Computer Science, Computer Engineering, or a similar technical field.You Are:
  - A great communicator; you are able to explain complex systems clearly to stakeholders and fellow engineers.
  - Able to operate in potentially ambiguous circumstances during a production incident.
  - Familiar with the interactions of services in a distributed system.
  - Tenacious towards driving challenging production incidents to resolution.For roles that are based at our headquarters in San Mateo, CA: The starting base pay for this position is as shown below. The actual base pay is dependent upon a variety of job-related factors such as professional background, training, work experience, location, business needs and market demand. Therefore, in some circumstances, the actual salary could fall outside of this expected range. This pay range is subject to change and may be modified in the future. All full-time employees are also eligible for equity compensation and for benefits.Annual Salary Range$226,450 - $262,150 USDRoles that are based in our San Mateo, CA Headquarters are in-office Tuesday, Wednesday, and Thursday, with optional in-office on Monday and Friday (unless otherwise noted).You'll Love:
    - Excellent medical, dental, and vision coverage
    - A rewarding 401k program
    - Flexible vacation policy (varies by exemption status)
    - Roflex - Flexible and supportive work policy
    - At Roblox HQ:
    - Free catered lunches five times a week and several fully stocked kitchens with unlimited snacks
    - Onsite fitness center and fitness program credit
    - Annual CalTrain Go PassRoblox provides equal employment opportunities to all employees and applicants for employment and prohibits discrimination and harassment of any type without regard to race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state or local laws. Roblox also provides reasonable accommodations for all candidates during the interview process.
      #J-18808-Ljbffr

Keywords: Roblox Corporation, Santa Rosa , Principal Site Reliability Operations Engineer, Professions , San Mateo, California

Click here to apply!

Didn't find what you're looking for? Search again!

Let San Mateo recruiters find you. Post your resume for free!

Get San Mateo Professions jobs via email.

View more Santa Rosa Professions jobs

Other Professions Jobs

Network Architect
Description: br br br br br Network Architect br br br br br br br br IT Data Network br br br br br Full Time br br br br br 80343BR br br br Job Summary br (more...)
Company: University of California - San Francisco
Location: San Francisco
Posted on: 01/26/2025

Fleet Maintenance Supervisor
Description: Position Summary: As a Penske Maintenance Supervisor for our Santa Rosa branch you will use your excellent communication, organization, and multitasking skills to engage your diesel technicians, maintain (more...)
Company: Penske
Location: Santa Rosa
Posted on: 01/26/2025

Chief Estimator - Electrical Construction California
Description: Chief Electrical Construction EstimatorLocation: San Francisco, CACompany Overview:Join one of the leading electrical contractors in the US, renowned for delivering excellence in healthcare and commercial (more...)
Company: LVI Associates
Location: San Francisco
Posted on: 01/26/2025

Salary in Santa Rosa, California Area | More details for Santa Rosa, California Jobs |Salary

Supply Chain Data Analyst
Description: Work Authorization: US Citizen, Green Card, H-1B, GC-EAD, H4-EAD, OPT-EAD, L2-EAD, TN VisaCandidates submitted over the max. bill rate will be automatically disqualified and counted as a submittal. For (more...)
Company: Cloud Analytics Technologies LLC
Location: San Francisco
Posted on: 01/26/2025

Site Coordinator - Garfield Elementary ASP
Description: POSITION DESCRIPTIONThis Site Coordinator is responsible for providing leadership, coordination, and implementation of the Garfield Elementary After School program. This position will provide supervision, (more...)
Company: Community Youth Center (CYC)
Location: San Francisco
Posted on: 01/26/2025

Senior Site Reliability Engineer San Francisco, CA
Description: Location: San Francisco - hybrid 1-2 days per week Salary: 165-175k stockCompany DescriptionFocal Systems is the industry leader in retail AI solutions. We are a Silicon Valley based startup that (more...)
Company: Tbwa Chiat/Day Inc
Location: San Francisco
Posted on: 01/26/2025

CDL-A Truck Drivers
Description: br Find your br Freedom with us. br OTR Truck Driver Jobs NEWER TRUCKS br EARN GREAT PAY CONSISTENT MILES br br CDL-A Truck DriversGREAT ROUTES AVAILABLE Apply Online or Call 855-218-7124 (more...)
Company: US Xpress
Location: Oakland
Posted on: 01/26/2025

Rooftop Loader
Description: Utilize your strength and reach new heights as a Rooftop Loader.Rooftop Loader will support the Driver at the Branch, by loading the truck and riding with the Driver to the job site to unload the building (more...)
Company: SRS Distribution Inc.
Location: Oakland
Posted on: 01/26/2025

CDL-A OTR Solo Engineered Truck Driver
Description: br br For CDL-A OTR SOLO ENGINEERED truck drivers, it's all keeping the wheels turning. Heartland Express keeps you rolling by putting you in the newest, most comfortable truck you can drive.We know (more...)
Company: Heartland Express
Location: Stockton
Posted on: 01/26/2025

Business Analyst (Functional) Consultant
Description: Position: Business Analyst Functional ConsultantLocation: Austin, Texas Onsite Job Description:We are seeking a skilled and experienced Business Analyst Functional Consultant with expertise in Contract (more...)
Company: Creative Solutions Services, LLC
Location: Fremont
Posted on: 01/26/2025

Loading more jobs...

Principal Site Reliability Operations Engineer

Didn't find what you're looking for? Search again!

Other Professions Jobs

Log In or Create An Account