Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Site Reliability Engineer (SRE) - Azure AKS, Observability & Terraform

Temporary

Astra North Infoteck Inc.

Site Reliability Engineer (SRE) – Azure AKS, Observability & Terraform

Remote Role

Key Responsibilities

  • Observability, SRE, DevOps roles with expertise in infrastructure and application reliability
  • Dynatrace, ELK, Splunk, PagerDuty
  • SLI/SLO frameworks
  • Azure Kubernetes Service (AKS), Terraform, Azure managed services

What will you do

  • Design and implement observability-as-code solutions using Terraform for monitoring pipelines, dashboards, and alerting across distributed systems
  • Drive observability improvements using Dynatrace, ELK, Splunk, PagerDuty for real-time performance insights and system visibility
  • Instrument applications for end-to-end observability including distributed tracing, metrics collection, and log aggregation across Node.js and .NET microservices and event-driven architectures
  • Troubleshoot complex production incidents across service layers, databases, caches, and APIs using SLI/SLO frameworks
  • Investigate and resolve Azure Kubernetes Service (AKS) infrastructure issues ensuring reliability and scalability of containerized workloads using Terraform and Azure services (SQL MI, Redis, Functions, Event Grid)
  • Translate business requirements into observable, resilient systems aligned to SLIs/SLOs
  • Automate operational tasks using Infrastructure-as-Code and CI/CD to reduce toil and improve resilience
  • Lead incident response and remediation for critical systems, including blameless postmortems and chaos engineering practices
  • Collaborate with development, platform, and business teams to improve availability, scalability, and operational excellence

What do you need to succeed

Must-have

  • 8+ years experience in SRE, DevOps, or Observability roles focused on infrastructure and application reliability
  • Strong expertise in Dynatrace, ELK, Splunk, PagerDuty and observability principles (instrumentation, correlation IDs, SLIs/SLOs)
  • Advanced proficiency in Azure Kubernetes Service (AKS), Terraform, and Azure managed services (SQL MI, Redis, Functions, Event Grid)
  • Hands-on experience with observability instrumentation (distributed tracing, metrics, logs) across Node.js and .NET microservices and event-driven systems
  • Strong troubleshooting skills across distributed systems (services, databases, caches, APIs) in production environments
  • Incident management expertise using PagerDuty and ServiceNow, including high-severity incident resolution and RCA
  • Knowledge of incident, problem, and change management, SRE principles, blameless postmortems, and chaos engineering
  • Strong communication and leadership skills for cross-functional coordination and incident handling
Vacancy posted a month ago
Similar jobs that could be interesting for youBased on the Site Reliability Engineer (SRE) - Azure AKS, Observability & Terraform in Vancouver, BC vacancy
  • $120k - $200k per year

     ...Coinbase Ventures, Uniswap Labs, Circle Ventures, Delphi Digital, and many more.   ABOUT THE ROLE At LayerZero, our Site Reliability Engineering (SRE) team is at the intersection of software and systems engineering, dedicated to crafting and maintaining large-scale,... 
    Suggested

    LayerZero Labs

    Vancouver, BC
    9 days ago
  • $80k - $100k per year

     ...you. Job Description We are looking for a talented Site Reliability Engineer to join our growing global team at Sectigo. The Site Reliability...  ...our critical products and services by meeting or exceeding SRE objectives.  Instantiate and maintain production... 
    Suggested
    Full time
    Internship
    Live out
    Remote work

    Sectigo

    Vancouver, BC
    10 days ago
  •  ...Saviynt   • Work on a mission-critical SaaS platform used by global enterprises • Solve complex reliability challenges at scale • Influence architecture and engineering culture at a company level • Competitive compensation, benefits, and growth opportunities... 
    Suggested

    saviynt

    Vancouver, BC
    5 days ago
  •  ...experience with PowerApps, Power Automate, Dataverse Proficiency in PowerShell and Kusto Query Language Experience with SharePoint and Azure services Knowledge of Azure DevOps and CI/CD practices A problem-solver who thrives in a fast-paced environment If you’re... 
    Suggested

    HCLTech

    Vancouver, BC
    22 hours ago
  • $90k - $110k per year

     ...Job Responsibility: Netskrt is seeking a talented and motivated Software Engineer (Observability) based in Vancouver, BC to be part of our platform team. If you are passionate about building metrics and log collection systems, this is an exciting opportunity to make a... 
    Suggested
    Full time
    Work at office

    Netskrt Systems Inc.

    Vancouver, BC
    7 days ago
  • $65k - $130k per year

     ...encourages employees to collaborate and learn from each other, completely free of barriers. Your role: As a DevOps/SRE, you will be responsible for the reliability and smooth operation of your service in both production and test environments. At Global Relay we use leading... 
    Full time

    Global Relay

    Vancouver, BC
    10 hours ago
  • $115k - $125k per year

     ...LNG) export facility on the previous Woodfibre pulp mill site, which would have a storage capacity of 250,000 m3 and would...  ...Pacific Energy Corporation Limited. Position The Reliability & Integrity (R&I) Engineer is an integral part of Woodfibre LNG ’s dynamic team.... 
    For contractors
    Work at office

    Pacific Energy Canada

    Vancouver, BC
    26 days ago
  • $125k - $150k per year

     ...unified network security and observability platform to prevent, detect and...  ...Role As a Senior TestOps Engineer within our Delivery...  ...Golang, Docker/Kubernetes, and Terraform/Pulumi. Eliminate Toil: Identify...  ...Software Test Tooling, DevOps, or Site Reliability Engineering (SRE)... 
    Flexible hours

    Tigera

    Vancouver, BC
    2 days ago
  • $120k - $140k per year

     ...motivated Senior Infrastructure Engineer to join our dynamic IT...  ...key role in ensuring system reliability, scalability, and performance...  ...infrastructure solutions across Azure and hybrid on-premises environments...  ...PowerShell, ARM, Bicep, or Terraform Implement security... 
    Full time

    Global Relay

    Vancouver, BC
    10 hours ago
  • $176.26k - $220.32k per year

     ...a Senior Principal Software Engineer, you will be a technical leader...  ...the highest standards of reliability, performance, and maintainability...  ...operating systems on AWS, Azure, or Google Cloud at scale. Deep...  ...Knowledge and proficiency in Terraform, CloudFormation, or Ansible.... 
    Long term contract
    Remote work

    Boomi

    Vancouver, BC
    20 days ago
  •  ...ideal for someone who takes pride in hard work, shows initiative on site, and is eager to be part of a construction team delivering high-...  ...to follow instructions and work effectively within a team • Reliable, punctual, and motivated • Good attitude and willingness to learn... 
    Full time
    Temporary work
    For subcontractor

    FMI

    Vancouver, BC
    21 days ago
  • $85k - $100k per year

     ...is seeking a skilled DevOps Engineer to join our team in Vancouver...  ...lifecycle and overall system reliability. You will be instrumental in...  ...consistency. CI/CD and System Observability : You will be responsible...  ...hands-on experience in a DevOps, SRE, or similar role. ~ Strong... 
    Work at office

    WOW 1 DAY PAINTING

    Vancouver, BC
    3 days ago
  • TRS Staffing Solutions has an exciting opportunity for a Site Health & Safety Administrator to support our clients’ team. This is an on...  ...TRS Staffing Solutions (Canada) Inc. specializes in supplying engineers, designers, project managers, and other technical and professional... 
    Contract work
    Work at office
    Worldwide
    Vancouver, BC
    13 days ago
  • $150k - $200k per year

     ...products, ensuring scalability, reliability, and long-term...  ...infrastructure, including orchestration, observability, safety, and integration...  ...technical multiplier, setting engineering standards, mentoring...  ...cloud infrastructure (AWS, Azure, or GCP) and building reliable... 
    Long term contract
    Direct hire

    Jobright.ai

    Vancouver, BC
    21 days ago
  • $122.3k - $170.7k per year

     ...our studio partners to improve reliability, scalability, and efficiency...  ...CI/CD pipelines (Jenkins, Azure DevOps, GitLab) Write automation...  ...infrastructure using Terraform, Packer, Ansible, or Chef...  ...with us to improve reliability, observability, and developer workflows Contribute... 
    Full time
    Local area

    Electronic Arts (EA)

    Vancouver, BC
    7 days ago
  • $37.5 - $39 per hour

     ...Languages English Education ~ Secondary (high) school graduation certificate Experience 2 years to less than 3 years On site Work must be completed at the physical location. There is no option to work remotely. Responsibilities Tasks Prepare... 
    Permanent employment
    Full time
    Apprenticeship
    Remote work

    QKD Construction Ltd.

    Vancouver, BC
    3 days ago
  •  ...leading AI-powered Quality Engineering Company? Ready to advance your...  ...logging (CloudWatch, basic observability tools) * Automate build, deployment...  ...(preferred) or Terraform-Containers: Docker-Kubernetes...  ...organizations ensure systems perform reliably in real-world conditions.... 
    Full time
    For contractors
    Local area
    Remote work

    Qualitest Group

    Vancouver, BC
    5 days ago
  • $28 per hour

     ...apply for this role. TRS Staffing Solutions has an exciting opportunity for a Site Administrative Assistant role to work for one of Canada's top employers and a global leader in the Engineering, Procurement, Construction Management industry. Responsibilities: Maintain... 
    Contract work
    Work at office
    Worldwide
    Vancouver, BC
    25 days ago
  •  ...Job Title: Senior Site Supervisor – Commercial Construction Company: SAWW Developments Location: Williams Lake, British Columbia, Canada About SAWW Developments SAWW Developments is an established commercial construction firm specializing in complex interior construction... 
    For contractors
    For subcontractor
    Internship
    Relocation

    SAWW Developments

    Vancouver, BC
    12 days ago
  •  ...Join INTRALOT as a DevOps Engineer At INTRALOT, we shape the future...  ...industry with scalable, reliable, and cutting-edge systems. Here...  ...using GitOps, IaC, and advanced observability to keep our systems stable,...  ...optimize infrastructure using Terraform, Helm, AWS CloudFormation and... 
    Permanent employment
    Full time
    Worldwide

    BALLY’S INTRALOT SA

    Vancouver, BC
    11 days ago
  • $120k - $145k per year

     ...unified network security and observability platform to prevent, detect and...  ...will be joining our Delivery Engineering team. You will work closely...  ...with many of the following: Terraform, Ansible, Jenkins, SemaphoreCI...  ...Ginkgo, GCP, AWS, GKE, EKS, AKS, Rancher, and OpenShift.... 
    Flexible hours

    Tigera

    Vancouver, BC
    10 hours ago
  • $120k - $145k per year

     ...Site Superintendent – Vancouver Novacom is a commercial and multi-family General Contractor based in Surrey, BC. We value people, relationships, innovation and culture, and we believe in constantly improving ourselves to provide the best possible service to our clients... 
    Full time
    For contractors
    Summer work
    Immediate start

    Novacom Building Partners

    Vancouver, BC
    3 days ago
  • $117k - $167k per year

     ...looking for an Application Security Engineer to join the Agentic Platform pillar,...  ...Partner with engineering teams to harden Azure Kubernetes Service (AKS) workloads, identity and access (...  ...Infrastructure-as-code: Bicep or Terraform for security configuration, policy-as... 
    Permanent employment
    Full time
    Worldwide
    Flexible hours
    Shift work

    IFS

    Vancouver, BC
    4 days ago
  •  ...Overview Job Skills / Requirements The Site Supervisor oversees and coordinates the security operations at Rocky Mountaineer and other RM properties as assigned; identifies potential security problems and develops corresponding plans and protocols to ensure safety of... 
    Remplacement
    Full time
    Contract work
    Immediate start
    Vancouver, BC
    10 days ago
  • $120k - $140k per year

     ...We’re looking for a DevOps Engineer to help support and grow Later...  ..., AWS, Kubernetes, CI/CD, Terraform , while also helping support...  ...MLOps. You’ll help maintain reliable infrastructure, improve deployment...  ...improve system efficiency, observability, developer experience, model... 
    Long term contract
    Permanent employment
    Local area
    Remote work

    Later

    Vancouver, BC
    17 days ago
  •  ...an experienced Senior DevOps Engineer to join our engineering team at...  ...infrastructure, ensuring system reliability, and implementing disaster...  ...Collaboration:  ~ Work closely with SRE team on infrastructure...  ...code tools, particularly Pulumi, Terraform or Cloud Formation. ~ Strong... 
    Full time
    Internship
    Work at office
    Day shift

    RAZR Marketing, Inc.

    Vancouver, BC
    10 days ago
  • $63k - $81k per year

     ...Vancouver, Canada . This is a full-time, on-site position where you'll support critical...  ...and asset tracking Collaborate with engineering and operations teams Participate in capacity...  ...hardware and networking knowledge Reliable and professional with strong... 
    Permanent employment
    Full time
    Freelance
    Local area
    Flexible hours
    Shift work
    Night shift
    Weekend work

    RM Staffing B.V.

    Vancouver, BC
    23 days ago
  • $120k - $150k per year

     ...Job Title: Director of Engineering Department: Technology/Engineering...  ...Monetization The Coaches Site is an education and social...  ...decisions that support scalability, reliability, security, and speed of...  ...data infrastructure. Drive observability, reliability, and operational... 
    Long term contract
    Permanent employment
    Full time
    Internship
    Remote work
    Worldwide
    Flexible hours

    The Coaches Site

    Vancouver, BC
    13 hours ago
  •  ...Downtown Vancouver (On-Site) • Full-Time   Here's Your Chance...  ...touched by modern software engineering, let alone AI. EviSmart is...  ...on compliance, security, and reliability so the quality bar isn't a conversation...  ...:  Docker, Kubernetes, Terraform CI/CD:  GitHub Actions... 
    Long term contract
    Full time
    Temporary work
    Internship
    Work at office
    Immediate start

    Evismart

    Vancouver, BC
    10 hours ago
  •  ...Join INTRALOT as a Platform Engineer - Cloud Networking (AWS) - Powering...  ...industry with scalable, reliable, and cutting-edge systems. Here...  ...infrastructure using Terraform, Ansible, CloudFormation, and...  ...peer reviewed deployments. 🔎 Observe, Measure & Troubleshoot Traffic... 
    Permanent employment
    Full time
    Worldwide

    BALLY’S INTRALOT SA

    Vancouver, BC
    11 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Site Reliability Engineer (SRE) - Azure AKS, Observability & Terraform. Be the first to apply!