Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Site Reliability Engineering I

Scotiabank

We’re looking for an SRE with deep experience in production observability and incident response to raise the reliability and transparency of our customer-facing services. You will own the end-to-end observability stack across Dynatrace, Splunk, Power BI, and Google Cloud (GCP) Monitoring, drive proactive detection and reduction of toil, and lead major incident response. This role focuses on operational excellence and service health and NOT platform engineering or DevOps provisioning.

Design and maintain end-to-end monitoring for critical services using Dynatrace (APM, Real User Monitoring, Synthetic, Davis AI, Smartscape) and GCP Cloud Monitoring (metrics, alerting policies, SLOs/SLIs, uptime checks, dashboards).

tune Davis AI problem rules and reduce alert noise through thresholds, baselining, and tagging.

continuously review burn rates and align alerting to customer impact.

Create and optimize Splunk data models, indexes, sourcetypes, ingestion pipelines, and SPL searches; build actionable dashboards for NOC/SRE/Engineering.

Develop operational analytics and executive reporting in Power BI (data modeling, DAX/Measures, scheduled refresh) to track reliability KPIs, incident trends, MTTR/MTTD, SLO compliance, and capacity signals.

Establish governance for data quality, field extractions, and retention to ensure fast, accurate investigations.

Lead incident response (Sev1/Sev2): run bridges, coordinate SMEs, communicate status/timelines, drive mitigation and customer updates.

Maintain runbooks, decision trees, and standard operating procedures; Use light coding/scripting to automate recurring tasks: alert tuning, data enrichment, log parsing, playbook triggers, service health checks.

auto-triage, context gathering, incident timelines).

Contribute to observability standards and best practices (naming, tags, SLIs, alert policies), and mentor teams on instrumenting for reliability.

Cloud Monitoring, Cloud Logging, Alerting Policies, Uptime Checks, SLOs/SLIs; Power BI proficiency: data modeling, DAX measures, role-level security, and scheduled refresh for operational/Exec reporting.

  • Coding/scripting for automation and data manipulation (e.g., Python or PowerShell; Solid understanding of service reliability concepts: golden signals, SLOs/error budgets, capacity and saturation, graceful degradation.
  • Diversity, Equity, Inclusion & Allyship - We strive to create an inclusive culture where every employee is empowered to reach their fullest potential, respected for who they are, and are embraced through bias-free practices and inclusive values across Scotiabank. We embrace diversity and provide opportunities for all employee to learn, grow & participate through our various Employee Resource Groups (ERGs) that span across diverse gender identities, ethnicity, race, age, ability & veterans.

Upskilling through online courses, cross-functional development opportunities, and tuition assistance.

Competitive Rewards program including bonus, flexible vacation, personal, sick days and benefits will start on day one.

Dynamic Ecosystem - Free tea & coffee, universal washrooms, and lots of space for team collaboration.

Community Engagement - No matter where you choose to work from; we offer opportunities for community engagement & belonging with our various programs.

Guided by our purpose: "for every future", we help our customers, their families and their communities achieve success through a broad range of advice, products and services, including personal and commercial banking, wealth management and private banking, corporate and investment banking, and capital markets.

If you require accommodation (including, but not limited to, an accessible interview site, alternate format documents, ASL Interpreter, or Assistive Technology) during the recruitment and selection process, please let our Recruitment team know. If you require technical assistance, please click here. Candidates must apply directly online to be considered for this role.

Vacancy posted 6 days ago
Similar jobs that could be interesting for youBased on the Site Reliability Engineering I in Toronto, ON vacancy
  • $100.9k - $131.1k per year

     ...initiatives, contributing directly to the code base, guiding services to production readiness, and building common tooling for all of engineering. We are all curious folks and strive to be constantly learning! This role follows a hybrid schedule, with in-office work required... 
    Suggested
    Long term contract
    Permanent employment
    Temporary work
    Manual labor
    Work at office
    Remote work
    Flexible hours

    ecobee

    Toronto, ON
    7 days ago
  •  ...Support/SRE will be responsible for the supporting the application stack and spearheading the development, and implementation of Site Reliability Engineering solutions for all applications within City National Bank (CNB), an RBC company. This team will work collaboratively with... 
    Suggested
    Full time
    Flexible hours

    Royal Bank of Canada

    Toronto, ON
    11 days ago
  •  ...Job Description: Skills: Dynatrace, Observability, Monitoring Engineering, SRE Practices   Experience: 6-8 years   Job...  ...are seeking a highly skilled Dynatrace Monitoring Engineer / Site Reliability Engineer (SRE) responsible for designing, implementing, and maintaining... 
    Suggested
    Permanent employment
    Contract work

    Astra North Infoteck Inc.

    Toronto, ON
    7 days ago
  •  ...best of both work styles in a workplace that is intentional about belonging, collaboration, and accomplishment. Being a Site Reliability Engineer – Data Services at iManage Means… You are an engineer, a builder, and a systems thinker. You ensure data durability, optimize... 
    Suggested
    Full time
    Work at office
    Local area
    Remote work
    Worldwide
    Monday to friday
    Flexible hours

    iManage

    Toronto, ON
    8 days ago
  • $141k - $191k per year

     ...and develop your career. As an SRE Manager, you will lead a team of 10+ engineers, oversee their development and ensure operational excellence. About the Role: In this opportunity as Site Reliability Engineering Manager , you will be responsible for: Team Leadership... 
    Suggested
    Work at office
    Local area
    Flexible hours
    2 days per week
    3 days per week

    Thomson Reuters

    Toronto, ON
    21 days ago
  •  ...Job Description: Site Reliability Engineer (SRE) – Observability Toronto - Hybrid (1-2 days office) Role Summary We are looking for a Observability Engineer to help implement, operate, and improve observability capabilities across our applications and platforms... 
    Contract work
    Work at office

    Astra North Infoteck Inc.

    Toronto, ON
    14 days ago
  •  ...San Francisco and founded in 2014, Tubi is part of Tubi Media Group, a division of Fox Corporation. About the Role: Site Reliability Engineering (SRE) at Tubi is not a traditional operations team. We are a software engineering organization that applies a developer's mindset... 
    Remplacement
    Full time
    Contract work
    Temporary work
    Flexible hours

    Tubi

    Toronto, ON
    20 days ago
  •  ...Description What is the opportunity? This role will be responsible for the development, implementation, and support of Site Reliability Engineering (SRE) solutions for applications supported by the Digital Branch SRE organization. As the Engineering arm of the Digital... 
    Full time
    Flexible hours

    Royal Bank of Canada

    Toronto, ON
    22 days ago
  • $72k - $138k per year

     ...mentoring and on the job coaching Summary As a Senior Site Reliability Engineer – Production Management, you will design, deliver, and support...  ...Kubernetes, GitOps, and high‑availability systems to ensure reliable, scalable production services. The role requires end‑to‑end... 
    Temporary work
    Fixed term contract
    Flexible hours

    Deloitte

    Toronto, ON
    7 days ago
  •  ...Title: Site Reliability Engineer (Production Support & Incident Management) Role: Site Reliability Engineer Location: Toronto Work Mode: Hybrid (4 Days WFO) Primary Skills: Production Support and Incident Management Experience Required: 6-8 years... 
    Contract work

    Astra North Infoteck Inc.

    Toronto, ON
    8 days ago
  •  ...Site Reliability Engineer - Dynatrace & Ansible Required Skills & Experience (Mandatory) ~5–8 years of experience in SRE | DevOps | or Platform Engineering roles ~ Strong hands-on experience with Dynatrace for observability and monitoring ~ Strong hands-... 
    Full time

    Astra North Infoteck Inc.

    Toronto, ON
    a month ago
  • ~ University degree in an Engineering discipline or technologist course  relevant to job/equipment function. ~3 to 5 years working as...  ...record of at least 2 to 3 years demonstrating application of Reliability methods and analysis ~ Experience in controlling and being accountable... 
    Local area

    Alstom

    Toronto, ON
    14 days ago
  •  ...Production Support Engineer / SRE Work Mode: 4 days Onsite Production Support & Operations · Manage day‑to‑day production...  ...DevOps & SRE Practices · Experience with DevOps and Site Reliability Engineering tools such as: Helios, UCD (UrbanCode Deploy), Jenkins... 
    Full time

    Astra North Infoteck Inc.

    Toronto, ON
    a month ago
  •  ...Job Title: Rotating Engineer – Offshore Reliability Experience: Minimum 12 Years Qualification: Bachelor’s Degree in Mechanical Engineering Industry: Oil & Gas / Refinery (Offshore) Work Location : Saudi Arab Job Description: The Rotating Engineer – Offshore... 
    Permanent employment
    Full time

    Hudson Manpower

    Toronto, ON
    1 day ago
  •  ...Job Title: Rotating Engineer – Onshore Reliability Experience: Minimum 12 Years Qualification: Bachelor’s Degree in Mechanical Engineering...  ...ensure safe and efficient plant operations. # Ensure reliable operation and optimal performance of rotating equipment including... 
    Permanent employment
    Full time

    Hudson Manpower

    Toronto, ON
    1 day ago
  •  ...Job Title: Mechanical Engineer – Onshore Reliability Experience: Minimum 12 Years Qualification: Bachelor’s Degree in Mechanical Engineering...  ..., and optimize maintenance activities to ensure safe, reliable, and efficient plant operations. # Develop and implement... 
    Permanent employment
    Full time

    Hudson Manpower

    Toronto, ON
    1 day ago
  •  ...Job Title: Mechanical Engineer – Offshore Reliability Experience: Minimum 12 Years Qualification: Bachelor’s Degree in Mechanical Engineering Industry: Oil & Gas / Refinery (Offshore) Work Location : Saudi Arab Job Description: The Mechanical Engineer... 
    Permanent employment
    Full time

    Hudson Manpower

    Toronto, ON
    1 day ago
  • $112k - $162k per year

     ...that welcomes you—because when you feel valued, you’re empowered to do your best work. About the Role We are seeking a Site Planning Engineer to join our Network & Infrastructure Engineering (N&IE) team. In this role, you will be responsible for supporting the... 
    Full time

    Equinix

    Toronto, ON
    2 days ago
  • $133k - $199.6k per year

     ...communications. Our team collaborates closely with engineering teams across Stripe and internal...  ...an ability to establish priorities and reliably execute on solutions (often with hard deadlines...  ...office for team/business meetings, on-sites, meet-ups, and events, our expectation is... 
    Full time
    Work at office
    Local area
    Remote work
    Work from home
    Relocation

    Stripe

    Toronto, ON
    25 days ago
  •  ...Protecnium  is an international consulting firm specializing in engineering and technical services ( . We are currently looking for a  Site Engineer  to join our team. -Project:   subway/tunnel project -Estimated length : 18 months- with the possibility of being extended... 
    Contract work
    Temporary work
    Local area
    Monday to friday
    Night shift

    Protecnium Ingeniería y Servicios, S. L.

    Toronto, ON
    6 days ago
  •  ...you have: ~ Degree in Computer Science, Engineering, or related field. ~15+ years of...  ...operations, production support, or service reliability roles, preferably within financial services...  ...not limited to, an accessible interview site, alternate format documents, ASL Interpreter... 
    Contract work
    Flexible hours

    Scotiabank

    Toronto, ON
    3 days ago
  •  ...now for the following position: Senior Project Manager Product Reliability. Overview: As the Senior Project Manager Product...  ...Reliability strategy, and Reporting to the Director Sustaining Engineering, this role is integral to achieving our must-win priority of delivering... 
    Full time
    Flexible hours

    Sonova AG

    Toronto, ON
    4 days ago
  • $90k - $100k per year

     ...Efficiency and reliability are the pillars of our success. Amentum is looking for a Facilities Maintenance Site Manager to drive our preventive maintenance programs and optimize...  ...Qualifications: Bachelor’s degree in engineering, Business Administration, Facility Management... 
    Hourly pay
    Daily paid
    Long term contract
    Remplacement
    Contract work
    Work at office
    Local area
    Shift work
    Weekend work

    Amentum

    Toronto, ON
    24 days ago
  • $70k - $80k per year

     ...Amentum is seeking a Reliability Planning Analyst I to support our team of multi-skilled technicians...  ...with OSHA, EPA and Company and Site-Specific rules and regulations always....  ...accomplish work. Identifies work requiring engineering and design and reviews with proper entities... 
    Hourly pay
    Contract work
    Work at office
    Local area
    Shift work
    Weekend work
    Day shift

    Amentum

    Toronto, ON
    24 days ago
  • $70k - $105k per year

     ...rewarding career working with the #1 globally ranked water treatment engineering firm, and with the industry’s best and most innovative engineers, then Jacobs is where you belong.   This role is fully on-site at our Pickering site location, therefore, only candidates... 
    Long term contract
    Full time
    Contract work
    Relocation
    Toronto, ON
    1 day ago
  • $20 per hour

     ...Allied Universal is seeking Security Professional - Corporate Site in Downtown Toronto, Ontario. Job Title : Security Professional...  ...clearance Overview : We are currently seeking a reliable and dedicated Security Professional for a Corporate Site in... 
    Hourly pay
    Full time
    Part time
    Work at office
    Immediate start
    Shift work
    Weekend work

    Allied Universal

    Toronto, ON
    1 day ago
  • $22.5 - $25.5 per hour

     ...Canada, with over 28,000 employees across 40 sites in North America, Europe, and Asia....  ...full suite of services – from design and engineering, to manufacturing and supply chain...  ...this Opportunity The Global Quality & Reliability Intern position is ideal for a person interested... 
    Hourly pay
    Full time
    Casual work
    Internship
    Work at office
    Local area
    Remote work
    Worldwide

    Celestica International LP

    Toronto, ON
    11 days ago
  • $60k - $65k per year

     ...Job Responsibility: Assistant Site Manager Job Overview: Reporting to the Site Manager, the Assistant Site Manager will be an employee of the Aurum Group of Companies and will be working with one of the biggest clients of Aurum. The Assistant Site Manager (ON) is responsible... 
    Permanent employment
    Full time
    Contract work
    Shift work

    Aurum

    Toronto, ON
    19 hours ago
  •  ...Application Deadline: Rolling Salary Range: TBD Key Responsibilities Provide day-to-day administrative support to the site team. Maintain organized site files, drawings, and documentation. Assist with general office duties such as printing, scanning, and filing... 
    For subcontractor
    Work at office

    Mutual Developments

    Toronto, ON
    3 days ago
  • We are seeking a highly organized and experienced Site Supervisor to oversee construction projects and ensure their successful completion. The ideal candidate will manage daily operations on-site, coordinate with teams, and ensure adherence to safety, quality, and project specifications... 
    For contractors
    For subcontractor
    Local area

    Vestacon Limited

    Toronto, ON
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Site Reliability Engineering I. Be the first to apply!