Azure Site Reliability Engineer- Remote
Astra North Infoteck Inc.
Site Reliability Engineer (SRE) – Azure AKS, Observability & Terraform
Remote Role
Key Responsibilities
Observability, SRE, DevOps roles with expertise in infrastructure and application reliability
Dynatrace, ELK, Splunk, PagerDuty
SLI/SLO frameworks
Azure Kubernetes Service (AKS), Terraform, Azure managed services
What will you do
Design and implement observability-as-code solutions using Terraform for monitoring pipelines, dashboards, and alerting across distributed systems
Drive observability improvements using Dynatrace, ELK, Splunk, PagerDuty for real-time performance insights and system visibility
Instrument applications for end-to-end observability including distributed tracing, metrics collection, and log aggregation across Node.js and .NET microservices and event-driven architectures
Troubleshoot complex production incidents across service layers, databases, caches, and APIs using SLI/SLO frameworks
Investigate and resolve Azure Kubernetes Service (AKS) infrastructure issues ensuring reliability and scalability of containerized workloads using Terraform and Azure services (SQL MI, Redis, Functions, Event Grid)
Translate business requirements into observable, resilient systems aligned to SLIs/SLOs
Automate operational tasks using Infrastructure-as-Code and CI/CD to reduce toil and improve resilience
Lead incident response and remediation for critical systems, including blameless postmortems and chaos engineering practices
Collaborate with development, platform, and business teams to improve availability, scalability, and operational excellence
What do you need to succeed
Must-have
- 8+ years experience in SRE, DevOps, or Observability roles focused on infrastructure and application reliability
- Strong expertise in Dynatrace, ELK, Splunk, PagerDuty and observability principles (instrumentation, correlation IDs, SLIs/SLOs)
- Advanced proficiency in Azure Kubernetes Service (AKS), Terraform, and Azure managed services (SQL MI, Redis, Functions, Event Grid)
- Hands-on experience with observability instrumentation (distributed tracing, metrics, logs) across Node.js and .NET microservices and event-driven systems
- Strong troubleshooting skills across distributed systems (services, databases, caches, APIs) in production environments
- Incident management expertise using PagerDuty and ServiceNow, including high-severity incident resolution and RCA
- Knowledge of incident, problem, and change management, SRE principles, blameless postmortems, and chaos engineering
- Strong communication and leadership skills for cross-functional coordination and incident handling
- ...SRE is part of a global organization that leverages the latest technology to communicate with our colleagues across the globe... ...about belonging, collaboration, and accomplishment. Being a Site Reliability Engineer at iManage Means… You are an engineer, a builder, and a...SuggestedFull timeWork at officeLocal areaRemote workWorldwideMonday to fridayFlexible hours
- ...Title Suggestion: Azure Cloud Engineer | SRE & Observability Work Type: 100% Remote /Hybrid Experience Required: 5–10 years Education Requirements: Graduate Job Description: Design, implement, and maintain CI/CD pipelines in Azure DevOps for...SuggestedContract workRemote work
$141k - $191k per year
...at Thomson Reuters and develop your career. As an SRE Manager, you will lead a team of 10+ engineers, oversee their development and ensure operational excellence... .... About the Role: In this opportunity as Site Reliability Engineering Manager , you will be responsible for:...SuggestedWork at officeLocal areaFlexible hours2 days per week3 days per week$72k - $138k per year
...mentoring and on the job coaching Summary As a Senior Site Reliability Engineer - Production Management, you will design, deliver, and... ...including pipeline automation, Infrastructure-as-Code (e.g. Terraform, CloudFormation), and configuration management solutions....SuggestedTemporary workFixed term contractFlexible hours- ...Job Description WHAT IS THE OPPORTUNITY? We are seeking a highly skilled Senior Site Reliability Engineer (SRE)to join our dynamic team at RBC Investor Services. In this role, you will have the opportunity to automate and optimize current application support related...SuggestedFull timeFlexible hours
- ...Job Title: Senior Platform Engineer / Senior SRE Developer – Observability (Dynatrace) Location: Toronto, ON Work Style: Hybrid (2 days per week in-... ...Development experience with Python, AWS Lambda, ECS, and Azure Functions • Understanding of AI based system...Work at office2 days per week
- ...IS THE OPPORTUNITY? This role will be responsible for leading the design, development, implementation and support of Site Reliability Engineering (SRE) solutions for applications supported by the Commercial Payments Technology (CPT) SRE organization. The incumbent will need...Full timeFlexible hours
- ...driven decision-making. Technology is a strategic enabler, and reliability, security, and governance are foundational to how their... ..., risk, and analytics teams. Your new role As a Site Reliability Engineer (SRE) focused on User Access & Applications, you’ll sit at the...Contract work
- ...cloud infrastructure solutions on Microsoft Azure. • Implement and manage Infrastructure as Code (IaC) using Terraform or Bicep. • Build and maintain CICD pipelines... ...practices for scalability, security, and reliability. • Troubleshoot cloud platform issues and...
- ...data, portfolio management, risk, and investment operations. Reliability, controlled change, and clear operational readiness are... ...technology supports the business. Your new role As a Site Reliability Engineer (SRE) – Applications, you will focus on operational...Long term contractContract work
$136k - $187k per year
...millions of users worldwide. Our commitment to reliability is a key foundation of our product and our... ...customer availability expectations is a core engineering focus. As a Senior Site Reliability Engineer, you'll join our SRE team based in Europe to ensure our production...Local areaRemote workWorldwide- ...Title: Azure Cloud Infrastructure Engineer – Terraform / CDK Experience Required: 6-8 Required Skills: 1. Azure CDK 2. Terraform 3. DevOps Required Skills: Design and build Azure cloud environments using Infrastructure as...Contract work
- ...San Francisco and founded in 2014, Tubi is part of Tubi Media Group, a division of Fox Corporation. About the Role: Site Reliability Engineering (SRE) at Tubi is not a traditional operations team. We are a software engineering organization that applies a developer's...RemplacementFull timeContract workTemporary workFlexible hours
- ...Job Description SRE PM Toronto - Hybrid (4 Days WFO... ...error budgets| toil management| observability). Lead SRE maturity... ...the primary interfacebetween engineering teams| service management| leadership... ...).Track and report on reliability KPIs| toil reduction| and automation...Contract work
$135k - $185k per year
...Summary Yelp engineering culture is driven by our values : we’re a cooperative team... ...healing, globally-distributed systems? Our Site Reliability engineers keep Yelp fast, available,... ...everything with Python, Puppet, Git, Jenkins, Terraform and more! Develop custom tools, when...Full timeLocal areaRemote work- ...apply now. We are currently seeking a Azure DevOps Engineer to join our team in Toronto, Ontario (... ...in supporting Microsoft Azure and Terraform, and familiar with Cloud Cost Management... ...enhance the environment ensuring the reliability and performance of critical infrastructure...Full timeContract workWork at officeRemote workFlexible hoursShift workWeekend work
- ...Job Title: Kubernetes Engineer Location / Workstyle:... ...Deployment Cloud Platforms (GCP / AKS) Must-Have Requirements... ...Kubernetes Engine) and/or AKS (Azure Kubernetes Service) Strong written... ...to ensure high performance, reliability, and efficiency in virtualized...Contract workRemote work
$112.4k - $162.4k per year
...driving competitive advantage.As an Azure Developer, you are responsible... ..., and Azure Kubernetes Service (AKS), while integrating APIs,... ...collaborate with architects, DevOps engineers, and business stakeholders to ensure applications are reliable, cost-efficient, and aligned with...Full time- ...We are looking for a Database Reliability Engineer to join our team. This is not a traditional... ...think in code, manage infrastructure via Terraform, and treat database provisioning,... ...environments: provisioning, scaling, observability, and reliability — all through Infrastructure...Permanent employmentFull timeInternshipRemote workWorldwide
- ...Job Description: Job Title: Azure DevOps Engineer Architect / Azure Platform Engineer Location... ...: 8-10 Keywords: Azure, Devops, Terraform, Github Education: Bachelor'... ...services, ensuring high availability and reliability - Collaborate with development...Contract workRemote work
- ...Job Description: Job Title: AKS Engineer Location / Workstyle: Remote Skills: Digital : DevOps~Digital : Kubernetes~Digital: Terraform~Github Enterprise Experience Required: 8-10 Essential Skills: Azure devops| Github| Kubernetes| Docker |Helm| Terraform Responsibilities...Contract workRemote work
- ...Build and maintain reusable, versioned Terraform modules aligned with enterprise standards... ...controlled deployments. Establish IaC engineering standards , including: Module design... ...secrets handling Cloud platforms: Azure (preferred) ; AWS/GCP acceptable Core...Contract workRemote workFlexible hours
- ...: Requisition Number: 97456 Azure Administrator (contract ) Location... ...and provide scalable reliable solutions that leverage Azure's... ...Database, Azure Kubernetes Service (AKS), Azure Functions, and Azure... ...storage using Azure Backup, Azure Site Recovery (ASR), and Azure File...Contract workFixed term contractRemote work
- ...Role Overview We are looking for a Senior Azure Serverless Engineer with strong expertise in Azure API Management (APIM) and event-driven... ...authorizations, rate limiting, etc Strong coding skill – IAC – Terraform and Type Script. Python and PowerShell Required Skills...Full time
- ...looking for a Senior Cloud DevOps Engineer to join their growing team.... ...building platforms in Azure and GCP public clouds Promote... ...of experience with developing Terraform modules (preference Enterprise... ...in the office, at the client site, and virtually unless accommodations...Full timeWork at officeLocal areaFlexible hours
- ...Hiring: GCP Engineer – Remote We are looking for a GCP Engineer to support and develop Google Cloud Platform environments with a... ...Experience: 3–5 Years Skills: Google Cloud Platform (GCP), Terraform, Kubernetes, CI/CD Key Responsibilities: Design, deploy...Full timeRemote work
- ...DevOps Engineer – Public Cloud & Kubernetes (GCP/AWS/Azure) Toronto, ON - Hybrid Primary Skills: GCP,AWS, Azure... ...optimizing scalable, secure, and reliable platforms while ensuring all activities... ...workflows using tools such as Terraform and Ansible. Enforce version-...Contract work
- ...Job Description: Job Title: Azure/AWS Cloud DevOps Engineers Location: Toronto, ON Work Style... ...Infrastructure as Code (IaC) principles (Terraform, AWS CDK, CloudFormation). -... ...and implement robust monitoring and observability solutions. - Apply networking...Contract workWork at office2 days per week
- ...What's the opportunity? We’re looking for a Principal MLOps Engineer, Azure who will bring focus and subject-matter expertise around... ...optimizing cloud (Azure) infrastructure using Infrastructure as Code (Terraform) Designing and implementing best practices and standards for...Full timeInternshipLocal areaFlexible hours
$250k per year
...Role: Observability Engineer – Trading Client: Elite FinTech Compensation: $120,000 - $250,000 CAD + Bonus Location: Toronto... ...Working with multiple technical teams to ensure visibility and reliability across systems. Key Responsibilities Monitoring Tools...Permanent employmentImmediate start
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Azure Site Reliability Engineer- Remote. Be the first to apply!
- site reliability engineer Toronto, ON
- site reliability engineer remote Toronto, ON
- senior site reliability engineer Toronto, ON
- site reliability engineer intern Toronto, ON
- site safety Toronto, ON
- website developer Toronto, ON
- site maintenance Toronto, ON
- remote health information technology Toronto, ON
- angular developer remote Toronto, ON
- python developer remote Toronto, ON
