Site Reliability Engineer (SRE) - Azure AKS, Observability & Terraform
Astra North Infoteck Inc.
Site Reliability Engineer (SRE) – Azure AKS, Observability & Terraform
Remote Role
Key Responsibilities
- Observability, SRE, DevOps roles with expertise in infrastructure and application reliability
- Dynatrace, ELK, Splunk, PagerDuty
- SLI/SLO frameworks
- Azure Kubernetes Service (AKS), Terraform, Azure managed services
What will you do
- Design and implement observability-as-code solutions using Terraform for monitoring pipelines, dashboards, and alerting across distributed systems
- Drive observability improvements using Dynatrace, ELK, Splunk, PagerDuty for real-time performance insights and system visibility
- Instrument applications for end-to-end observability including distributed tracing, metrics collection, and log aggregation across Node.js and .NET microservices and event-driven architectures
- Troubleshoot complex production incidents across service layers, databases, caches, and APIs using SLI/SLO frameworks
- Investigate and resolve Azure Kubernetes Service (AKS) infrastructure issues ensuring reliability and scalability of containerized workloads using Terraform and Azure services (SQL MI, Redis, Functions, Event Grid)
- Translate business requirements into observable, resilient systems aligned to SLIs/SLOs
- Automate operational tasks using Infrastructure-as-Code and CI/CD to reduce toil and improve resilience
- Lead incident response and remediation for critical systems, including blameless postmortems and chaos engineering practices
- Collaborate with development, platform, and business teams to improve availability, scalability, and operational excellence
What do you need to succeed
Must-have
- 8+ years experience in SRE, DevOps, or Observability roles focused on infrastructure and application reliability
- Strong expertise in Dynatrace, ELK, Splunk, PagerDuty and observability principles (instrumentation, correlation IDs, SLIs/SLOs)
- Advanced proficiency in Azure Kubernetes Service (AKS), Terraform, and Azure managed services (SQL MI, Redis, Functions, Event Grid)
- Hands-on experience with observability instrumentation (distributed tracing, metrics, logs) across Node.js and .NET microservices and event-driven systems
- Strong troubleshooting skills across distributed systems (services, databases, caches, APIs) in production environments
- Incident management expertise using PagerDuty and ServiceNow, including high-severity incident resolution and RCA
- Knowledge of incident, problem, and change management, SRE principles, blameless postmortems, and chaos engineering
- Strong communication and leadership skills for cross-functional coordination and incident handling
$153k - $187k per year
...clear signal owners can use to run stronger, more resilient businesses. We’re looking for an incredible Senior Site Reliability Engineer to join our SRE team. We aim to make reliability, security, and speed reinforce one another so that the platform becomes the engine...SuggestedFull timeInternship$120k - $200k per year
...Coinbase Ventures, Uniswap Labs, Circle Ventures, Delphi Digital, and many more. ABOUT THE ROLE At LayerZero, our Site Reliability Engineering (SRE) team is at the intersection of software and systems engineering, dedicated to crafting and maintaining large-scale,...Suggested$80k - $100k per year
...you. Job Description We are looking for a talented Site Reliability Engineer to join our growing global team at Sectigo. The Site Reliability... ...our critical products and services by meeting or exceeding SRE objectives. Instantiate and maintain production...SuggestedFull timeInternshipLive outRemote work- ...Saviynt • Work on a mission-critical SaaS platform used by global enterprises • Solve complex reliability challenges at scale • Influence architecture and engineering culture at a company level • Competitive compensation, benefits, and growth opportunities...Suggested
- ...experience with PowerApps, Power Automate, Dataverse Proficiency in PowerShell and Kusto Query Language Experience with SharePoint and Azure services Knowledge of Azure DevOps and CI/CD practices A problem-solver who thrives in a fast-paced environment If you’re...Suggested
$65k - $130k per year
...encourages employees to collaborate and learn from each other, completely free of barriers. Your role: As a DevOps/SRE, you will be responsible for the reliability and smooth operation of your service in both production and test environments. At Global Relay we use leading...Full time$90k - $110k per year
...Job Responsibility: Netskrt is seeking a talented and motivated Software Engineer (Observability) based in Vancouver, BC to be part of our platform team. If you are passionate about building metrics and log collection systems, this is an exciting opportunity to make a...Full timeWork at office$115k - $125k per year
...LNG) export facility on the previous Woodfibre pulp mill site, which would have a storage capacity of 250,000 m3 and would... ...Pacific Energy Corporation Limited. Position The Reliability & Integrity (R&I) Engineer is an integral part of Woodfibre LNG ’s dynamic team....For contractorsWork at office$125k - $150k per year
...unified network security and observability platform to prevent, detect and... ...Role As a Senior TestOps Engineer within our Delivery... ...Golang, Docker/Kubernetes, and Terraform/Pulumi. Eliminate Toil: Identify... ...Software Test Tooling, DevOps, or Site Reliability Engineering (SRE)...Flexible hours$120k - $140k per year
...motivated Senior Infrastructure Engineer to join our dynamic IT... ...key role in ensuring system reliability, scalability, and performance... ...infrastructure solutions across Azure and hybrid on-premises environments... ...PowerShell, ARM, Bicep, or Terraform Implement security...Full time$176.26k - $220.32k per year
...a Senior Principal Software Engineer, you will be a technical leader... ...the highest standards of reliability, performance, and maintainability... ...operating systems on AWS, Azure, or Google Cloud at scale. Deep... ...Knowledge and proficiency in Terraform, CloudFormation, or Ansible....Long term contractRemote work- ...ideal for someone who takes pride in hard work, shows initiative on site, and is eager to be part of a construction team delivering high-... ...to follow instructions and work effectively within a team • Reliable, punctual, and motivated • Good attitude and willingness to learn...Full timeTemporary workFor subcontractor
$85k - $100k per year
...is seeking a skilled DevOps Engineer to join our team in Vancouver... ...lifecycle and overall system reliability. You will be instrumental in... ...consistency. CI/CD and System Observability : You will be responsible... ...hands-on experience in a DevOps, SRE, or similar role. ~ Strong...Work at office$150k - $200k per year
...products, ensuring scalability, reliability, and long-term... ...infrastructure, including orchestration, observability, safety, and integration... ...technical multiplier, setting engineering standards, mentoring... ...cloud infrastructure (AWS, Azure, or GCP) and building reliable...Long term contractDirect hire$37.5 - $39 per hour
...Languages English Education ~ Secondary (high) school graduation certificate Experience 2 years to less than 3 years On site Work must be completed at the physical location. There is no option to work remotely. Responsibilities Tasks Prepare...Permanent employmentFull timeApprenticeshipRemote work- TRS Staffing Solutions has an exciting opportunity for a Site Health & Safety Administrator to support our clients’ team. This is an on... ...TRS Staffing Solutions (Canada) Inc. specializes in supplying engineers, designers, project managers, and other technical and professional...Contract workWork at officeWorldwide
$122.3k - $170.7k per year
...our studio partners to improve reliability, scalability, and efficiency... ...CI/CD pipelines (Jenkins, Azure DevOps, GitLab) Write automation... ...infrastructure using Terraform, Packer, Ansible, or Chef... ...with us to improve reliability, observability, and developer workflows Contribute...Full timeLocal area$28 per hour
...apply for this role. TRS Staffing Solutions has an exciting opportunity for a Site Administrative Assistant role to work for one of Canada's top employers and a global leader in the Engineering, Procurement, Construction Management industry. Responsibilities: Maintain...Contract workWork at officeWorldwide- ...leading AI-powered Quality Engineering Company? Ready to advance your... ...logging (CloudWatch, basic observability tools) * Automate build, deployment... ...(preferred) or Terraform-Containers: Docker-Kubernetes... ...organizations ensure systems perform reliably in real-world conditions....Full timeFor contractorsLocal areaRemote work
- ...Job Title: Senior Site Supervisor – Commercial Construction Company: SAWW Developments Location: Williams Lake, British Columbia, Canada About SAWW Developments SAWW Developments is an established commercial construction firm specializing in complex interior construction...For contractorsFor subcontractorInternshipRelocation
- ...Join INTRALOT as a DevOps Engineer At INTRALOT, we shape the future... ...industry with scalable, reliable, and cutting-edge systems. Here... ...using GitOps, IaC, and advanced observability to keep our systems stable,... ...optimize infrastructure using Terraform, Helm, AWS CloudFormation and...Permanent employmentFull timeWorldwide
$117k - $167k per year
...looking for an Application Security Engineer to join the Agentic Platform pillar,... ...Partner with engineering teams to harden Azure Kubernetes Service (AKS) workloads, identity and access (... ...Infrastructure-as-code: Bicep or Terraform for security configuration, policy-as...Permanent employmentFull timeWorldwideFlexible hoursShift work$120k - $145k per year
...unified network security and observability platform to prevent, detect and... ...will be joining our Delivery Engineering team. You will work closely... ...with many of the following: Terraform, Ansible, Jenkins, SemaphoreCI... ...Ginkgo, GCP, AWS, GKE, EKS, AKS, Rancher, and OpenShift....Flexible hours$120k - $140k per year
...We’re looking for a DevOps Engineer to help support and grow Later... ..., AWS, Kubernetes, CI/CD, Terraform , while also helping support... ...MLOps. You’ll help maintain reliable infrastructure, improve deployment... ...improve system efficiency, observability, developer experience, model...Long term contractPermanent employmentLocal areaRemote work$120k - $145k per year
...Site Superintendent – Vancouver Novacom is a commercial and multi-family General Contractor based in Surrey, BC. We value people, relationships, innovation and culture, and we believe in constantly improving ourselves to provide the best possible service to our clients...Full timeFor contractorsSummer workImmediate start- ...experienced Senior AWS Cloud Engineer to join our engineering team at... ...infrastructure, ensuring system reliability, and implementing disaster... ...Collaboration: ~ Work closely with SRE team on infrastructure... ...code tools, particularly Pulumi, Terraform or Cloud Formation. ~ Strong...Full timeInternshipWork at officeDay shift
- ...Overview Job Skills / Requirements The Site Supervisor oversees and coordinates the security operations at Rocky Mountaineer and other RM properties as assigned; identifies potential security problems and develops corresponding plans and protocols to ensure safety of...RemplacementFull timeContract workImmediate start
$63k - $81k per year
...Vancouver, Canada . This is a full-time, on-site position where you'll support critical... ...and asset tracking Collaborate with engineering and operations teams Participate in capacity... ...hardware and networking knowledge Reliable and professional with strong...Permanent employmentFull timeFreelanceLocal areaFlexible hoursShift workNight shiftWeekend work$120k - $150k per year
...Job Title: Director of Engineering Department: Technology/Engineering... ...Monetization The Coaches Site is an education and social... ...decisions that support scalability, reliability, security, and speed of... ...data infrastructure. Drive observability, reliability, and operational...Long term contractPermanent employmentFull timeInternshipRemote workWorldwideFlexible hours$122k - $170k per year
...and major cloud providers (AWS, Azure, GCP) Calculates and... ...payments experiences and the reliability and scalability of our backend systems We’re a group of engineers who care deeply about code quality, correctness, and observability, and who are comfortable collaborating...Full timeRemote workWorldwideFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Site Reliability Engineer (SRE) - Azure AKS, Observability & Terraform. Be the first to apply!
- site reliability engineer remote Vancouver, BC
- senior site reliability engineer Vancouver, BC
- site reliability engineer Vancouver, BC
- site reliability engineer intern Vancouver, BC
- site carpenter Vancouver, BC
- website developer Vancouver, BC
- site safety Vancouver, BC
- site maintenance Vancouver, BC
- site reliability engineer remote
- senior site reliability engineer
