Senior Service Reliability Manager

Senior Service Reliability Manager

23 Oct
|
Scotiabank
|
Toronto

23 Oct

Scotiabank

Toronto

Requisition ID: 205554
Join a purpose driven winning team, committed to results, in an inclusive and high-performing culture.
The Role:
As a member of the Global Payments and Core Banking Systems (PCBE) Systems Reliability team, the Senior Manager, Systems Reliability will lead and collaborate with a team that will work with engineering teams, infrastructure teams, and business partners to continuously improve the stability and reliability of PCBE systems through Site Reliability Engineering (SRE) based practices that will include continuous people, process and technology enhancements in support of our rapidly changing technology product portfolio.
The Team:






You will work cross-functionally amongst a variety of teams and be a contributor in all significant deliverables to the Systems Reliability Office stakeholders. You will also have an understanding of 'what could go wrong', solve complex problems and have a flair for communicating and leading discussions with technical and business partners.
You will leverage your deep experience with IT Service Delivery and IT Service Management to standardize and improve operations, analysis and service levels across the Global Payments and Core Banking Systems portfolio.
You will contribute to the overall success of the PCBE SRO globally, ensuring specific individual and team goals, plans, initiatives are executed/delivered in support of the SRO team's strategies and objectives. You will ensure that all activities conducted are in compliance with governing regulations, internal policies and procedures.
Is this role right for you? In this role you will:
Champion a customer focused culture to deepen client relationships and leverage broader Bank relationships,





systems and knowledge.
Be accountable for creating, maintaining and distributing SLOs data and reports/dashboards of our technology portfolio to various stakeholders across the organization.
Champion Stability and Reliability across a portfolio of applications and services by working closely with service owner team to continuously improve the MTTR metrics and reduce downtime, leading troubleshooting of our most severe incidents and participating in incident root cause analysis to prevent recurrence.
Contribute to prioritization of reliability features with service owners and engineering teams.
Contribute to the design, development and delivery of effective tooling, alerts,





and automated responses to identify and address reliability risks and automation of SLOs.
Participate in incident calls and when required lead communications on impact and recovery status.
Ensure information on incidents and problems is complete, accurate and the action items are being worked on by the assigned individuals.
Produce weekly/monthly/quarterly status reporting or dashboards on incidents and problems for distribution to business and technology stakeholders.
Participate and drive post incident activities as per organizational governance and requirements.
Interface with teams across technology and business partners on stability and reliability concerns, system disruptions and providing incident details and root causes.






Do you have the skills that will enable you to succeed in this role? We'd love to work with you if you have:
Degree in Computer Science, Engineering, or equivalent experience.
8+ years experience in the industry (Software development, DevOps, Service Management) with at least 3 years in a leadership capacity.
Experience with creating and maintaining system performance dashboards.
Experience with analyzing and troubleshooting systems.
Excellent communication (both verbal and written). The ability to communicate confidently and clearly on conference calls, in meetings, via email, etc. at all levels of the organization is essential.
Ability to quickly and clearly communicate incident status via email in business friendly language.
Experience with ITSM tools (ServiceNow, a plus)





with strong understanding of SRE and service management principles.
Nice to Haves:
Experience or familiarity with the Financial industry.
Systematic problem-solving approach, coupled with effective communication skills and a sense of drive.
Experience with Performance and Capacity Management (PCM) tools (ie. Dynatrace, Splunk).
Well-rounded broad knowledge of OS platforms (Linux/UNIX), Networking, Web Systems and IT Ops.
ITIL Foundation Certification.
What's in it for you?
Diversity, Equity, Inclusion & Allyship - We strive to create an inclusive culture where every employee is empowered to reach their fullest potential, respected for who they are, and are embraced through bias-free practices and inclusive values across Scotiabank.






Accessibility and Workplace Accommodations - We value the unique skills and experiences each individual brings to the Bank, and are committed to creating and maintaining an inclusive and accessible environment for everyone.
Upskilling through online courses, cross-functional development opportunities, and tuition assistance.
Competitive Rewards program including bonus, flexible vacation, personal, sick days and benefits will start on day one.
Community Engagement - no matter where you choose to work from; we offer opportunities for community engagement & belonging with our various programs such as hackathons,





contests, cooking with friends, Humans of Digital and much more!
Work arrangements: Hybrid
Location(s): Canada : Ontario : Toronto

#J-18808-Ljbffr

▶️ Senior Service Reliability Manager
🖊️ Scotiabank
📍 Toronto

Subscribe to this job alert:
Enter Your E-mail address to receive the latest job offers for: senior service reliability manager

Senior Service Reliability Manager

Senior Service Reliability Manager

Requisition ID: 205554 Join a purpose driven winning team, committed to results, in an inclusive and high-performing culture. The Role: As a member of the Global Payments and Core Banking Systems (PCBE) Systems Reliability team, the Senior M [...]
Toronto
23 Oct
    Toronto
    23 Oct

Senior Service Reliability Manager

Senior Service Reliability Manager

Requisition ID: 205554 Join a purpose driven winning team, committed to results, in an inclusive and high-performing culture. The Role: As a member of the Global Payments and Core Banking Systems (PCBE) Systems Reliability team, the Senior Manag [...]
Toronto
21 Oct
    Toronto
    21 Oct

Senior Site Reliability Engineer (SRE)

Senior Site Reliability Engineer (SRE)

Application Deadline: 12/30/2024 Address: 4100 Gordon Baker Road Job Family Group: Technology Senior Site Reliability Engineer (SRE) We are looking for a talented Site Reliability Engineer to enhance our observability capabilities. This role i [...]
Toronto
10 Oct
    Toronto
    10 Oct

Senior Site Reliability Engineer (SRE)

Senior Site Reliability Engineer (SRE)

Application Deadline: 12/30/2024 Address: 4100 Gordon Baker Road Job Family Group: Technology Senior Site Reliability Engineer (SRE) We are looking for a talented Site Reliability Engineer to enhance our observability capabilities. This role i [...]
Toronto
10 Oct
    Toronto
    10 Oct
Subscribe to this job alert:
Enter Your E-mail address to receive the latest job offers for: senior service reliability manager