Site Reliability Engineer, Metal
$100k per yearTenstorrent
Tenstorrent is leading the industry on cutting-edge AI technology, revolutionizing performance expectations, ease of use, and cost efficiency. With AI redefining the computing paradigm, solutions must evolve to unify innovations in software models, compilers, platforms, networking, and semiconductors. Our diverse team of technologists have developed a high performance RISC-V CPU from scratch, and share a passion for AI and a deep desire to build the best AI platform possible. We value collaboration, curiosity, and a commitment to solving hard problems. We are growing our team and looking for contributors of all seniorities.
Tenstorrent is building large-scale AI systems across internal clusters and customer deployments. This role sits at the intersection of site reliability, infrastructure operations, and customer engineering, ensuring our systems are reliable, observable, and production-ready.
This role is hybrid, based out of Toronto, ON; Austin, TX; or Santa Clara, CA.
We welcome candidates at various experience levels for this role. During the interview process, candidates will be assessed for the appropriate level, and offers will align with that level, which may differ from the one in this posting.
Who You Are
- Experienced in site reliability, infrastructure, or systems engineering in distributed environments.
- Strong Linux systems knowledge with the ability to troubleshoot complex multi-layer issues.
- Proficient with observability tools such as Prometheus, Grafana, and alerting systems.
- Comfortable with scripting and automation using Python, Go, or similar languages.
- Solid understanding of networking fundamentals and how systems behave at scale.
What We Need
- Ensure reliability and operational health of Tenstorrent systems across internal and customer environments.
- Troubleshoot complex issues across compute, networking, and software layers.
- Partner with engineering teams and customers to resolve production incidents.
- Design and improve monitoring, observability, and alerting systems.
- Build automation to reduce operational toil and improve system reliability.
What You Will Learn
- How large-scale AI infrastructure is operated across internal clusters and customer deployments.
- How distributed systems behave under real-world production conditions.
- How observability and automation drive reliability at scale.
- How hardware, networking, and software systems interact in AI environments.
- How customer-facing AI infrastructure is deployed, supported, and optimized.
Compensation for all engineers at Tenstorrent ranges from $100k - $500k including base and variable compensation targets. Experience, skills, education, background and location all impact the actual offer made.
Tenstorrent offers a highly competitive compensation package and benefits, and we are an equal opportunity employer.
This offer of employment is contingent upon the applicant being eligible to access U.S. export-controlled technology. Due to U.S. export laws, including those codified in the U.S. Export Administration Regulations (EAR), the Company is required to ensure compliance with these laws when transferring technology to nationals of certain countries (such as EAR Country Groups D:1, E1, and E2). These requirements apply to persons located in the U.S. and all countries outside the U.S. As the position offered will have direct and/or indirect access to information, systems, or technologies subject to these laws, the offer may be contingent upon your citizenship/permanent residency status or ability to obtain prior license approval from the U.S. Commerce Department or applicable federal agency. If employment is not possible due to U.S. export laws, any offer of employment will be rescinded.
- ...environments. We are a modern, IoT-enabled, cloud-based tool for reliability, safety, and operations of physical equipment and... ...valuing the company at $2.5 billion. We’re looking for a Site Reliability Engineer to help advance MaintainX’s reliability, observability, and...Suggested
$110k - $120k per year
...professional development support, discounts through Perkopolis, and recognition programs that celebrate your impact The Job: Site Reliability Engineer The Site Reliability Engineer is responsible for ensuring the availability, performance, and resilience of the...SuggestedTemporary workInternshipWork at officeRemote work$192k - $288k per year
...partnering with product squads to scale reliability best practices and design safe deployment... ...complex architectural challenges, mentoring engineers, and shaping the long-term resilience... ...across multiple teams. Background in Site Reliability Engineering (SRE) Familiarity...SuggestedLong term contractWork at office$136k - $187k per year
...users worldwide. Our commitment to reliability is a key foundation of our product and... ...availability expectations is a core engineering focus. As a Senior Site Reliability Engineer, you'll join... ...solutions that make our system more reliable by design. What you’ll do: Design...SuggestedLocal areaRemote workWorldwide$164.6k - $235.1k per year
...About the Role: Site Reliability Engineering (SRE) at Tubi is not a traditional operations team. We are a software engineering organization that applies a developer's mindset and toolkit to the challenges of building and running large-scale, distributed systems. Our mission...SuggestedLong term contractRemplacementFull timeContract workTemporary workLocal areaFlexible hours- ...best of both work styles in a workplace that is intentional about belonging, collaboration, and accomplishment. Being a Site Reliability Engineer at iManage Means… You are an engineer, a builder, and a systems thinker. You’ll create middleware and platform guardrails...Full timeWork at officeLocal areaRemote workWorldwideMonday to fridayFlexible hours
$100k - $125k per year
...We are seeking an experienced and motivated Software Engineer to join our dynamic Site Reliability Engineering (SRE) team. As a Site Reliability Engineer,... ...culture Tech at Tipalti Our tech teams are the engine behind our business. Tipalti’s tech ecosystem is extremely...Work at officeFlexible hours- ...Technology is a strategic enabler, and reliability, security, and governance are... ...analytics teams. Your new role As a Site Reliability Engineer (SRE) focused on User Access & Applications... .... Your mandate is to ensure secure, reliable, and compliant access across key enterprise...Contract work
- ...Years of Experience: 6-8 We are seeking a Site Reliability Engineer (SRE) to ensure the reliability, scalability, and performance of platform services. The ideal candidate will bring strong expertise in SRE practices, observability, infrastructure automation, and developer...Contract work
- ...data, portfolio management, risk, and investment operations. Reliability, controlled change, and clear operational readiness are... ...technology supports the business. Your new role As a Site Reliability Engineer (SRE) – Applications, you will focus on operational documentation...Long term contractContract work
- ...Site Reliability Engineer – APM, Dynatrace, Observability Duration: 12 months Location: Toronto Hybrid: 2 days in office a week SRE Lead Deep application and system-level knowledge across complex end-to-end environments, including tightly integrated on prem...Contract workWork at office2 days per week
$141k - $191k per year
...and develop your career. As an SRE Manager, you will lead a team of 10+ engineers, oversee their development and ensure operational excellence. About the Role: In this opportunity as Site Reliability Engineering Manager , you will be responsible for: Team Leadership...Work at officeLocal areaFlexible hours2 days per week3 days per week- ...project delivery. We are currently looking for an Operational Reliability Project Engineer to join our team. At Peter Lucas, we offer a variety of... ...join our team in Blind River, Ontario, in a full-time, on-site Monday–Friday role . This position thrives in complex, fast...Full timeInternshipMonday to friday
- .... Job Description We are looking for a Database Reliability Engineer to join our team. This is not a traditional DBA role — you are... ...to join MUFG Investor Services? Take a look at our careers site and you’ll find everything you’d expect working with one of the...Permanent employmentFull timeInternshipRemote workWorldwide
- ...Job Title: Platform Reliability Engineer Location: Toronto, ON Note: Prior experience in BFSI, Public Sector, or Telecom is non-negotiable . Position Overview: Seeking an experienced Platform Reliability Engineer with a strong background in BFSI,...
- ...Job Title: Production Reliability Engineer Location: Toronto, ON Note: Prior experience in BFSI, Public Sector, or Telecom is non-negotiable . Position Overview: Seeking an experienced Production Reliability Engineer with a strong background in BFSI...
- ...Job Title: Mechanical Engineer – Onshore Reliability Experience: Minimum 12 Years Qualification: Bachelor’s Degree in Mechanical Engineering... ..., and optimize maintenance activities to ensure safe, reliable, and efficient plant operations. # Develop and implement...Permanent employmentFull time
- ...Job Title: Mechanical Engineer – Offshore Reliability Experience: Minimum 12 Years Qualification: Bachelor’s Degree in Mechanical Engineering Industry: Oil & Gas / Refinery (Offshore) Work Location : Saudi Arab Job Description: The Mechanical Engineer...Permanent employmentFull time
- ...Job Title: Rotating Engineer – Onshore Reliability Experience: Minimum 12 Years Qualification: Bachelor’s Degree in Mechanical Engineering... ...ensure safe and efficient plant operations. # Ensure reliable operation and optimal performance of rotating equipment including...Permanent employmentFull time
- ...Job Title: Rotating Engineer – Offshore Reliability Experience: Minimum 12 Years Qualification: Bachelor’s Degree in Mechanical Engineering Industry: Oil & Gas / Refinery (Offshore) Work Location : Saudi Arab Job Description: The Rotating Engineer – Offshore...Permanent employmentFull time
$20 - $25 per hour
...Role Overview The Site Hand will be responsible for keeping project sites clean, safe, and stocked. This role involves traveling between... ...handling, and general site upkeep. The ideal candidate is reliable, hardworking, and physically capable of handling labor-intensive...Full timeEarly shift- ...to Ontario Faster, more frequent, and reliable access to rapid transit with more than 227... ...option. Job Description The Site Superintendent supports the TBM operations... ...with General Super, Project Managers, Engineers, and stakeholders to develop and implement...Full timeContract workFor subcontractorWork at officeRelocation
- ...Overview: Within the context set out by the Vice President Reliability Standards and Chief Regulatory Officer, and as informed and... ..., and standards pertaining to the practice of Professional Engineering in Ontario; and # Other external mandatory reliability-related...Work at office
$2000 per month
...About Roles Arvo Metals is currently hiring 2 experienced Sales Representatives to join our growing team in Scarborough . We are seeking dynamic and results-driven candidates with experience in non-ferrous scrap metal sourcing, purchasing, trading, or business development...Long term contract$20.4 per hour
...today! Allied Universal is seeking Security Guard - Corporate Site in Downtown Toronto, Ontario. Job Title : Security Guard... ...Monday to Friday Overview : We are currently seeking a reliable and dedicated Security Guard for a Corporate Site in Downtown...Hourly payFull timeCasual workWork at officeImmediate startMonday to fridayShift workNight shift$80k per year
...Opportunities for career advancement, on-the-job training, and upskilling within a quickly growing company JOB OVERVIEW: As the Site Manager, you will be responsible for managing day-to-day janitorial operations for our client and ensuring a safe and professional work...Permanent employmentFull timeContract workWork at officeFlexible hours$59.14 per hour
Overview Languages English Education Bachelor's degree or equivalent experience Experience 5 years or more On site Work must be completed at the physical location. There is no option to work remotely. Responsibilities Tasks Coordinate...For subcontractorRemote work$60k - $65k per year
...Job Responsibility: Assistant Site Manager Job Overview: Reporting to the Site Manager, the Assistant Site Manager will be an employee of the Aurum Group of Companies and will be working with one of the biggest clients of Aurum. The Assistant Site Manager (ON) is responsible...Permanent employmentFull timeContract workShift work- ...We are seeking a driven and experienced Site Supervisor to lead and manage daily site... ...and reports. ~ Valid drivers license and reliable transportation; safety certifications (e.... ...degree in Construction Management, Civil Engineering, or related field. IICRC Certification...Full timeTemporary workFor contractorsFor subcontractor
$50k - $60k per year
...Commercial Cleaning Services (CCS) is seeking a Site Manager for one of our facilities in the GTA. The Site Manager is responsible for the daily management of janitorial services, staff performance, and service delivery excellence. CCS is a rapidly growing Building Service...RemplacementFull timeFor contractorsWork at office
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Site Reliability Engineer, Metal. Be the first to apply!
- site reliability engineer remote Toronto, ON
- site reliability engineer Toronto, ON
- senior site reliability engineer Toronto, ON
- site reliability engineer intern Toronto, ON
- website developer Toronto, ON
- site safety Toronto, ON
- site maintenance Toronto, ON
- site reliability engineer remote
- site reliability engineer
- senior site reliability engineer
