Site Reliability Engineering I
Scotiabank
We’re looking for an SRE with deep experience in production observability and incident response to raise the reliability and transparency of our customer-facing services. You will own the end-to-end observability stack across Dynatrace, Splunk, Power BI, and Google Cloud (GCP) Monitoring, drive proactive detection and reduction of toil, and lead major incident response. This role focuses on operational excellence and service health and NOT platform engineering or DevOps provisioning.
Design and maintain end-to-end monitoring for critical services using Dynatrace (APM, Real User Monitoring, Synthetic, Davis AI, Smartscape) and GCP Cloud Monitoring (metrics, alerting policies, SLOs/SLIs, uptime checks, dashboards).
tune Davis AI problem rules and reduce alert noise through thresholds, baselining, and tagging.
continuously review burn rates and align alerting to customer impact.
Create and optimize Splunk data models, indexes, sourcetypes, ingestion pipelines, and SPL searches; build actionable dashboards for NOC/SRE/Engineering.
Develop operational analytics and executive reporting in Power BI (data modeling, DAX/Measures, scheduled refresh) to track reliability KPIs, incident trends, MTTR/MTTD, SLO compliance, and capacity signals.
Establish governance for data quality, field extractions, and retention to ensure fast, accurate investigations.
Lead incident response (Sev1/Sev2): run bridges, coordinate SMEs, communicate status/timelines, drive mitigation and customer updates.
Maintain runbooks, decision trees, and standard operating procedures; Use light coding/scripting to automate recurring tasks: alert tuning, data enrichment, log parsing, playbook triggers, service health checks.
auto-triage, context gathering, incident timelines).
Contribute to observability standards and best practices (naming, tags, SLIs, alert policies), and mentor teams on instrumenting for reliability.
Cloud Monitoring, Cloud Logging, Alerting Policies, Uptime Checks, SLOs/SLIs; Power BI proficiency: data modeling, DAX measures, role-level security, and scheduled refresh for operational/Exec reporting.
- Coding/scripting for automation and data manipulation (e.g., Python or PowerShell; Solid understanding of service reliability concepts: golden signals, SLOs/error budgets, capacity and saturation, graceful degradation.
- Diversity, Equity, Inclusion & Allyship - We strive to create an inclusive culture where every employee is empowered to reach their fullest potential, respected for who they are, and are embraced through bias-free practices and inclusive values across Scotiabank. We embrace diversity and provide opportunities for all employee to learn, grow & participate through our various Employee Resource Groups (ERGs) that span across diverse gender identities, ethnicity, race, age, ability & veterans.
Upskilling through online courses, cross-functional development opportunities, and tuition assistance.
Competitive Rewards program including bonus, flexible vacation, personal, sick days and benefits will start on day one.
Dynamic Ecosystem - Free tea & coffee, universal washrooms, and lots of space for team collaboration.
Community Engagement - No matter where you choose to work from; we offer opportunities for community engagement & belonging with our various programs.
Guided by our purpose: "for every future", we help our customers, their families and their communities achieve success through a broad range of advice, products and services, including personal and commercial banking, wealth management and private banking, corporate and investment banking, and capital markets.
If you require accommodation (including, but not limited to, an accessible interview site, alternate format documents, ASL Interpreter, or Assistive Technology) during the recruitment and selection process, please let our Recruitment team know. If you require technical assistance, please click here. Candidates must apply directly online to be considered for this role.
$100.9k - $131.1k per year
...initiatives, contributing directly to the code base, guiding services to production readiness, and building common tooling for all of engineering. We are all curious folks and strive to be constantly learning! This role follows a hybrid schedule, with in-office work required...SuggestedLong term contractPermanent employmentTemporary workManual laborWork at officeRemote workFlexible hours- ...Support/SRE will be responsible for the supporting the application stack and spearheading the development, and implementation of Site Reliability Engineering solutions for all applications within City National Bank (CNB), an RBC company. This team will work collaboratively with...SuggestedFull timeFlexible hours
- ...Job Description: Skills: Dynatrace, Observability, Monitoring Engineering, SRE Practices Experience: 6-8 years Job... ...are seeking a highly skilled Dynatrace Monitoring Engineer / Site Reliability Engineer (SRE) responsible for designing, implementing, and maintaining...SuggestedPermanent employmentContract work
- ...best of both work styles in a workplace that is intentional about belonging, collaboration, and accomplishment. Being a Site Reliability Engineer – Data Services at iManage Means… You are an engineer, a builder, and a systems thinker. You ensure data durability, optimize...SuggestedFull timeWork at officeLocal areaRemote workWorldwideMonday to fridayFlexible hours
$141k - $191k per year
...and develop your career. As an SRE Manager, you will lead a team of 10+ engineers, oversee their development and ensure operational excellence. About the Role: In this opportunity as Site Reliability Engineering Manager , you will be responsible for: Team Leadership...SuggestedWork at officeLocal areaFlexible hours2 days per week3 days per week- ...Job Description: Site Reliability Engineer (SRE) – Observability Toronto - Hybrid (1-2 days office) Role Summary We are looking for a Observability Engineer to help implement, operate, and improve observability capabilities across our applications and platforms...Contract workWork at office
- ...San Francisco and founded in 2014, Tubi is part of Tubi Media Group, a division of Fox Corporation. About the Role: Site Reliability Engineering (SRE) at Tubi is not a traditional operations team. We are a software engineering organization that applies a developer's mindset...RemplacementFull timeContract workTemporary workFlexible hours
- ...Description What is the opportunity? This role will be responsible for the development, implementation, and support of Site Reliability Engineering (SRE) solutions for applications supported by the Digital Branch SRE organization. As the Engineering arm of the Digital...Full timeFlexible hours
$72k - $138k per year
...mentoring and on the job coaching Summary As a Senior Site Reliability Engineer – Production Management, you will design, deliver, and support... ...Kubernetes, GitOps, and high‑availability systems to ensure reliable, scalable production services. The role requires end‑to‑end...Temporary workFixed term contractFlexible hours- ...Title: Site Reliability Engineer (Production Support & Incident Management) Role: Site Reliability Engineer Location: Toronto Work Mode: Hybrid (4 Days WFO) Primary Skills: Production Support and Incident Management Experience Required: 6-8 years...Contract work
- ...Site Reliability Engineer - Dynatrace & Ansible Required Skills & Experience (Mandatory) ~5–8 years of experience in SRE | DevOps | or Platform Engineering roles ~ Strong hands-on experience with Dynatrace for observability and monitoring ~ Strong hands-...Full time
- ~ University degree in an Engineering discipline or technologist course relevant to job/equipment function. ~3 to 5 years working as... ...record of at least 2 to 3 years demonstrating application of Reliability methods and analysis ~ Experience in controlling and being accountable...Local area
- ...Production Support Engineer / SRE Work Mode: 4 days Onsite Production Support & Operations · Manage day‑to‑day production... ...DevOps & SRE Practices · Experience with DevOps and Site Reliability Engineering tools such as: Helios, UCD (UrbanCode Deploy), Jenkins...Full time
- ...Job Title: Rotating Engineer – Offshore Reliability Experience: Minimum 12 Years Qualification: Bachelor’s Degree in Mechanical Engineering Industry: Oil & Gas / Refinery (Offshore) Work Location : Saudi Arab Job Description: The Rotating Engineer – Offshore...Permanent employmentFull time
- ...Job Title: Rotating Engineer – Onshore Reliability Experience: Minimum 12 Years Qualification: Bachelor’s Degree in Mechanical Engineering... ...ensure safe and efficient plant operations. # Ensure reliable operation and optimal performance of rotating equipment including...Permanent employmentFull time
- ...Job Title: Mechanical Engineer – Onshore Reliability Experience: Minimum 12 Years Qualification: Bachelor’s Degree in Mechanical Engineering... ..., and optimize maintenance activities to ensure safe, reliable, and efficient plant operations. # Develop and implement...Permanent employmentFull time
- ...Job Title: Mechanical Engineer – Offshore Reliability Experience: Minimum 12 Years Qualification: Bachelor’s Degree in Mechanical Engineering Industry: Oil & Gas / Refinery (Offshore) Work Location : Saudi Arab Job Description: The Mechanical Engineer...Permanent employmentFull time
$112k - $162k per year
...that welcomes you—because when you feel valued, you’re empowered to do your best work. About the Role We are seeking a Site Planning Engineer to join our Network & Infrastructure Engineering (N&IE) team. In this role, you will be responsible for supporting the...Full time$133k - $199.6k per year
...communications. Our team collaborates closely with engineering teams across Stripe and internal... ...an ability to establish priorities and reliably execute on solutions (often with hard deadlines... ...office for team/business meetings, on-sites, meet-ups, and events, our expectation is...Full timeWork at officeLocal areaRemote workWork from homeRelocation- ...Protecnium is an international consulting firm specializing in engineering and technical services ( . We are currently looking for a Site Engineer to join our team. -Project: subway/tunnel project -Estimated length : 18 months- with the possibility of being extended...Contract workTemporary workLocal areaMonday to fridayNight shift
- ...you have: ~ Degree in Computer Science, Engineering, or related field. ~15+ years of... ...operations, production support, or service reliability roles, preferably within financial services... ...not limited to, an accessible interview site, alternate format documents, ASL Interpreter...Contract workFlexible hours
- ...now for the following position: Senior Project Manager Product Reliability. Overview: As the Senior Project Manager Product... ...Reliability strategy, and Reporting to the Director Sustaining Engineering, this role is integral to achieving our must-win priority of delivering...Full timeFlexible hours
$90k - $100k per year
...Efficiency and reliability are the pillars of our success. Amentum is looking for a Facilities Maintenance Site Manager to drive our preventive maintenance programs and optimize... ...Qualifications: Bachelor’s degree in engineering, Business Administration, Facility Management...Hourly payDaily paidLong term contractRemplacementContract workWork at officeLocal areaShift workWeekend work$70k - $80k per year
...Amentum is seeking a Reliability Planning Analyst I to support our team of multi-skilled technicians... ...with OSHA, EPA and Company and Site-Specific rules and regulations always.... ...accomplish work. Identifies work requiring engineering and design and reviews with proper entities...Hourly payContract workWork at officeLocal areaShift workWeekend workDay shift$70k - $105k per year
...rewarding career working with the #1 globally ranked water treatment engineering firm, and with the industry’s best and most innovative engineers, then Jacobs is where you belong. This role is fully on-site at our Pickering site location, therefore, only candidates...Long term contractFull timeContract workRelocation$20 per hour
...Allied Universal is seeking Security Professional - Corporate Site in Downtown Toronto, Ontario. Job Title : Security Professional... ...clearance Overview : We are currently seeking a reliable and dedicated Security Professional for a Corporate Site in...Hourly payFull timePart timeWork at officeImmediate startShift workWeekend work$22.5 - $25.5 per hour
...Canada, with over 28,000 employees across 40 sites in North America, Europe, and Asia.... ...full suite of services – from design and engineering, to manufacturing and supply chain... ...this Opportunity The Global Quality & Reliability Intern position is ideal for a person interested...Hourly payFull timeCasual workInternshipWork at officeLocal areaRemote workWorldwide$60k - $65k per year
...Job Responsibility: Assistant Site Manager Job Overview: Reporting to the Site Manager, the Assistant Site Manager will be an employee of the Aurum Group of Companies and will be working with one of the biggest clients of Aurum. The Assistant Site Manager (ON) is responsible...Permanent employmentFull timeContract workShift work- ...Application Deadline: Rolling Salary Range: TBD Key Responsibilities Provide day-to-day administrative support to the site team. Maintain organized site files, drawings, and documentation. Assist with general office duties such as printing, scanning, and filing...For subcontractorWork at office
- We are seeking a highly organized and experienced Site Supervisor to oversee construction projects and ensure their successful completion. The ideal candidate will manage daily operations on-site, coordinate with teams, and ensure adherence to safety, quality, and project specifications...For contractorsFor subcontractorLocal area
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Site Reliability Engineering I. Be the first to apply!
- site reliability engineer intern Toronto, ON
- site reliability engineer remote Toronto, ON
- senior site reliability engineer Toronto, ON
- site reliability engineer Toronto, ON
- site safety Toronto, ON
- site carpentry Toronto, ON
- site maintenance Toronto, ON
- website developer Toronto, ON
- site reliability engineer intern
- site reliability engineer remote
