Senior Manager, Site Reliability Engineering (FedRamp)
Company: BlackLine
Location: Pleasanton
Posted on: January 23, 2025
Job Description:
Get to Know Us:It's fun to work in a company where people truly
believe in what they're doing!At BlackLine, we're committed to
bringing passion and customer focus to the business of enterprise
applications.Since being founded in 2001, BlackLine has become a
leading provider of cloud software that automates and controls the
entire financial close process. Our vision is to modernize the
finance and accounting function to enable greater operational
effectiveness and agility, and we are committed to delivering
innovative solutions and services to empower accounting and finance
leaders around the world to achieve Modern Finance.Being a
best-in-class SaaS Company, we understand that bringing in new
ideas and innovative technology is mission critical. At BlackLine
we are always working with new, cutting edge technology that
encourages our teams to learn something new and expand their
creativity and technical skillset that will accelerate their
careers.The successful applicant will be performing work in FedRAMP
environments, and therefore, must be a U.S. Person (i.e. U.S.
citizen, U.S. national, lawful permanent resident, asylee, or
refugee). This position may also perform work that the U.S.
government has specified can only be performed by a U.S. citizen on
U.S. soil.Make Your Mark:We are seeking an experienced Senior
Manager, Site Reliability Engineering to lead the team overseeing
the FedRamp operation, performance and reliability of the
Multi-Tenant BlackLine Accounts Receivable SAAS products. These are
hosted in Microsoft Azure using serverless technologies for PAAS,
IAAS, and SAAS components.This position plays a key role in
ensuring that Blackline's Accounts Receivable products, services,
infrastructure and public cloud are carefully planned and deployed
in a time, place, and configuration which is ideal for serving BL's
users. Your role encompasses aspects of capacity planning,
technical project execution, performance monitoring, site
reliability, security and software engineering. You must be equally
at home explaining analyses and project recommendations to senior
management as you are discussing the technical findings to
engineers or building tools to automate and scale their impact.You
will manage a team of 24/7 SRE staff managing day to day operations
& monitoring, incident engagement, and disaster recovery
activities. The candidate must possess solid critical thinking
skills and have experience supporting 24x7 High Availability
mission-critical traffic-intensive web infrastructures and be
familiar with public cloud hosting.You'll Get To:
- Improves the BlackLine SaaS service experience by discovering
and highlighting optimization opportunities with existing code or
architectural design to address application availability,
performance, observability, efficiency, and security
challenges.
- Develops tools and systems to automate the identification,
analysis, and remediation of application events, infrastructure
issues, or requests.
- Manages Incident Response and delivers Root Cause analyses
- Manages Production Operations, including day-to-day
administration of running processes, security and vulnerability
management, 24x7 initial response to system alerts and requests
from the Customer team
- Adhere to the change management and other established processes
and procedures.
- Support our continued certification to ISO 27001, ISO 9001 and
SOC2 standards
- Advocates for change across the organization. Ensures the
implementation of change with appropriate communications, goals,
resources, metrics, and reviews.
- Partners with internal organizations and vendors to develop
multi-year roadmaps influencing the direction and evolution of the
operating environment and support protocols.
- Establish and maintain Key Performance Indicators for the
overall health of the service and build tools to exercise and
evaluate if these KPI's are being met.
- Leads cross-functionally with other teams to surface common
pain points, architect solutions, establish conventions, and
evangelize application development and operations best
practices.
- Maintains and evolves the BlackLine trust site to include real
time availability and performance information.
- Monitor and plan for capacity and growth.
- Maintain documentation and operational knowledge base.What
You'll Bring:
- Years of Experience in Related Field: Minimum 8+ years of
industry experience. 3+ years of leadership experience
- Education: bachelor's degree in information technology,
Business or related field or equivalent experience.
- Expertise in reliable and repeatable web application deployment
and architecture.
- Someone energized by a fast-paced, iterative approach.
- An ability to balance urgent needs along with long term
strategy.
- Strong ownership, pride in work, and ability to take things
across the finish line.
- Of particular interest is a specialty in one or more of the
following: multi-page web apps, API integrations,
monitoring/alerting, Public Cloud infrastructure management,
distributed systems, cloud networking, or application
security.
- Hands-on problem-solving skills and Root Cause Analysis,
technical leadership and mentoring qualities.
- Strong written and oral communication skills.
- Manage end-to-end availability and performance of mission
critical services and build automation to prevent problem
recurrence.
- Lead by example, care for your team, and establish credibility
with the quality of the teams' technical execution.
- Participate in and manage on-call rotation for the SRE
Team
- Design, write and deliver software to improve the availability,
scalability, latency, and efficiency of Blackline's services.
- Cross-system and full-stack architecture experience and
awareness.
- Ability to communicate well with both business owners,
Executives and technical staff, at the appropriate levels.
- A minimum of 5 years of experience with a significant subset of
the following technologies: HTML, CSS, XML, SOAP, Ajax, JavaScript,
IIS, MSSQL, Jenkins, Chef, PowerShell, WMI, Java, SSL, Docker,
Kubernetes, Azure DevOps.
- Prior C#, ASP.NET, Ruby, Go or Java development experience,
preferably in an agile SaaS environment.
- Working knowledge of cloud platforms (Microsoft Azure strongly
preferred).Thrive at BlackLine Because You Are Joining:
- A technology-based company with a sense of adventure and a
vision for the future. Every door at BlackLine is open. Just bring
your brains, your problem-solving skills, and be part of a winning
team at the world's most trusted name in Finance Automation!
- A culture that is kind, open, and accepting. It's a place where
people can embrace what makes them unique, and the mix of cultural
backgrounds and varying interests cultivates diverse thought and
perspectives.
- A culture where BlackLiner's continued growth and learning is
empowered. BlackLine offers a wide variety of professional
development seminars and inclusive affinity groups to celebrate and
support our diversity.Salary Range:USD $186,000.00 - USD
$248,000.00Pay Transparency Statement:Placement within this range
depends upon several factors, including the applicant's prior
relevant job experience, skill set, and geographic location. In
addition to base pay, BlackLine also offers short-term and
long-term incentive programs, based on eligibility, along with a
robust offering of benefit and wellness plans.
#J-18808-Ljbffr
Keywords: BlackLine, Santa Rosa , Senior Manager, Site Reliability Engineering (FedRamp), Professions , Pleasanton, California
Didn't find what you're looking for? Search again!
Loading more jobs...