ML Performance Benchmarking Engineer

Cerebras Systems

Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs.

Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference.

Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation.

About The Role

The Inference Core Platform group is at the heart of Cerebras' mission to deliver the world’s fastest AI inference. Our team builds the foundational software and hardware infrastructure that powers low-latency, high-speed, high-throughput deployment on the Cerebras Wafer-Scale Engine (WSE). We are responsible for the full stack—from model compilation and scheduling down to custom hardware kernels and driver development.

The ML Performance Benchmarking team plays a pivotal role in shaping the performance and scalability of AI inference on one of the most advanced computing systems ever built. We drive the bring-up of core inference capabilities and deliver performance improvements at every stage of development – from early prototyping to production deployment.

We're looking for passionate engineers to join us in redefining the limits of AI inference. If you thrive on building systems that measure, analyze, and optimize performance at scale, this is your opportunity to make a transformative impact on the future of AI.

Scope of the team includes:

Core Inference Observability – Design and implement end-to-end telemetry systems across the software stack, providing deep visibility into inference performance and enabling rapid iteration before and after deployment.
Benchmarking Infrastructure – Architect, build, and scale the automation that generates, analyzes, and visualizes performance data used to inform business decisions across engineering and leadership.
Performance Analysis – Dive deep into system behavior, dissect performance bottlenecks, and deliver actionable insights that directly influence which features ship and how they evolve.
Feature Integration – Partner closely with Core Platform teams to define rigorous testing methodologies that validate inference features for peak performance.

Skills & Qualifications

Bachelor’s or Master’s degree in Computer Engineering, Systems Engineering, or a related field.
Proficiency in Python and/or C++ programming.
Proven experience in building and scaling automated infrastructure.
Strong background in throughput and performance optimization techniques, especially in complex, large-scale systems.
Excellent problem-solving skills and a strong analytical mindset.
Demonstrated ability to dive deep into new domains.
Ability to work in a fast-paced, ambiguous, and collaborative environment.

Preferred Skills & Qualifications

Familiarity with problem-solving at the intersection of hardware and software.
Hands-on experience with AI workloads and architectures is a plus.

Location

On-site or hybrid at our Toronto office

#LI-WA1

Why Join Cerebras

People who are serious about software make their own hardware. At Cerebras we have built a breakthrough architecture that is unlocking new opportunities for the AI industry. With dozens of model releases and rapid growth, we’ve reached an inflection point in our business. Members of our team tell us there are five main reasons they joined Cerebras:

Build a breakthrough AI platform beyond the constraints of the GPU.
Publish and open source their cutting-edge AI research.
Work on one of the fastest AI supercomputers in the world.
Enjoy job stability with startup vitality.
Our simple, non-corporate work culture that respects individual beliefs.

Read our blog: Five Reasons to Join Cerebras in 2026.

Apply today and become part of the forefront of groundbreaking advancements in AI!

Cerebras Systems is committed to creating an equal and diverse environment and is proud to be an equal opportunity employer. We celebrate different backgrounds, perspectives, and skills. We believe inclusive teams build better products and companies. We try every day to build a work environment that empowers people to do their best work through continuous learning, growth and support of those around them.

This website or its third-party tools process personal data. For more details, click here to review our CCPA disclosure notice.

Apply

Vacancy posted 10 hours ago

Similar jobs that could be interesting for youBased on the ML Performance Benchmarking Engineer in Toronto, ON vacancy

Performance Engineer
...learning users to effortlessly run large-scale ML applications, without the hassle of... ...About The Role Join Cerebras as a Performance Engineer within our innovative Runtime Team. Our... ...and powerful x86 machines, has set new benchmarks in high-performance ML training and inference...
Performance
Local area
Cerebras Systems
Toronto, ON
10 hours ago
ML/AI Engineer
$110k - $150k per year
...you the space to grow. About the Role We are seeking ML/AI Engineers to contribute to major projects. The ML / AI Engineer design... ...automated deployment and rollback Monitor AI systems for performance, drift, bias, and reliability in production Optimize compute...
Performance
Permanent employment
Full time
Remote work
Flexible hours
Levio
Toronto, ON
10 hours ago
LLM Inference Performance & Evals Engineer
...learning users to effortlessly run large-scale ML applications, without the hassle of... ...will prototype architectural tweaks, build performance-eval pipelines, and turn hard numbers... ...Key Responsibilities Prototype and benchmark cutting-edge ideas: new attentions, MoE,...
Performance
Cerebras Systems
Toronto, ON
10 hours ago
Manager, AI/ML Models - Financial Engineering & Modeling
$101k - $169k per year
...clients’ Generative AI offerings, including testing, evaluating and benchmarking different LLMs and toolchains; Significant experience and... ...when it comes to the salaries of our people. We regularly benchmark across a variety of positions, industries, sectors, targets, and...
Suggested
Permanent employment
Flexible hours
Deloitte
Toronto, ON
2 hours ago
Director, ML Engineering & Infrastructure
$188.2k - $268.9k per year
...We are seeking a Director of Machine Learning Engineering and Infrastructure to lead a hybrid team bridging advanced ML engineering with world-class infrastructure design... ...deliver both foundational ML systems and high-performance distributed services. This is a hybrid role...
Performance
Long term contract
Remplacement
Full time
Temporary work
Work at office
Local area
Flexible hours
Tubi - Canada
Toronto, ON
10 hours ago
ML Engineer (BFSI) - MLEBAS
ML Engineer (BFSI) Job Title: Machine Learning Engineer Position Overview: The ML Engineer will develop, deploy, and optimize machine... .... Deploy production-grade ML solutions. Optimize model performance and scalability. Collaborate with data scientists and...
Performance
NavitasPartners
Toronto, ON
8 days ago
ML/AI Research Engineer
$103.2k - $192k per year
...Master's or Ph.D. in Computer Science, Engineering, Mathematics, or a related quantitative field... ...-paced environment. Exemplifies high performance, integrity, and partnership.... ...Designs and develops machine learning (ML) and deep learning systems. Runs machine...
Performance
Full time
Contract work
Part time
Shift work
Toronto, ON
9 days ago
Staff AI / ML Engineer
$171k - $225k per year
...people learn from the world's best. This is not a side initiative. It is the direction of the company. We are looking for a Staff ML Engineer to join our AI engineering team and help define and deliver these products. This role is ultimately about delivery. You will be...
Local area
Remote work
Flexible hours
MasterClass
Toronto, ON
10 hours ago
performance engineer qa.
$55.55 - $61.93 per hour
Our client, a leading organization in the financial technology space, is seeking a highly experienced Load and Performance Lead Engineer to join their team. In this pivotal role, you will drive advanced performance testing capabilities for complex, high-throughput applications...
Performance
Long term contract
Full time
Contract work
Remote work
3 days per week
Randstad
Toronto, ON
11 days ago
Manager, ML/AI Engineer, Data & AI
$103k - $135k per year
...business-oriented Machine Learning / AI Engineer with a passion for building and scaling... ...hands-on engineer with deep experience in AI/ML engineering and AI/ML engineering... ...monitoring. Implement model monitoring, performance tuning, drift detection, and retraining strategies...
Performance
Full time
Internship
Toronto, ON
3 days ago
Senior Systems Engineer - Performance Engineer
$13k - $15.6k per year
...Most Innovative Companies, and Forbes World’s Best Bank. Visit our institutional page About the role Senior System Engineer - Systems Performance Team The Systems Performance team is part of the Computing Squad (Foundation / Runtime Platforms). You will be part of a team...
Performance
Long term contract
Remote work
Work from home
Relocation package
Flexible hours
Nubank
Toronto, ON
10 hours ago
ML Data Engineer
...looking for a Machine Learning Data Engineer to enable data and ML capabilities that directly power the... ...experimental approaches into reliable, high-performing systems Developing and deploying... ..., reliability, governance, and performance at scale while enabling hyper-...
Performance
Full time
Local area
Flexible hours
Royal Bank of Canada
Toronto, ON
a month ago
ML Engineer
$88k - $132k per year
...’re empowered to do your best work. We are looking for an ML Engineer who can design, build, and integrate AI-powered systems by combining... ...systems are production-ready—scalable, reliable, secure, and performant Continuously improve system performance through testing,...
Performance
Full time
Work at office
Equinix
Toronto, ON
a month ago
AWS ML Engineer
...Job Description: Role: AWS ML Engineer Location: Toronto Office Hybrid: 2 days a week in office Primary Skills: (AWS ML... ...with applications using APIs and cloud services Optimize model performance, scalability, and cost on AWS infrastructure Collaborate with...
Performance
Contract work
Work at office
2 days per week
Astra North Infoteck Inc.
Toronto, ON
a month ago
Engineering Manager, Performance & Resilience (Auth0)
$146k - $201.3k per year
...opportunity to do career-defining work. We're all in on this mission. If you are too, let's talk. The Engineering Opportunity Reporting to the Director of Quality & Performance, this role as Engineering Manager of Performance & Resilience will drive performance and...
Performance
Local area
Worldwide
Okta
Toronto, ON
10 hours ago
AI/ML Engineer (GenAI, LLM, Java)
...AI/ML Engineer (GenAI, LLM, Java) Location: Toronto, ON Experience Required: 5+ Role Overview We are seeking a highly skilled... ...requirements Optimize AI workflows for: Scalability Performance Cost-efficiency Leverage Oracle databases for data...
Performance
Contract work
Astra North Infoteck Inc.
Toronto, ON
17 days ago
Performance Quality Engineering Consultant
$80k - $130k per year
Performance Quality Engineering Consultant Position Description This role is hybrid and requires you to be at our client's and/or downtown... ...detailed performance test reports, dashboards, and performance benchmarking metrics. • Support root cause analysis and...
Performance
2 days per week
Toronto, ON
a month ago
Senior Performance Test Engineer (NFT - JMeter & ReadyAPI)
...Senior QA Performance Tester Location: Toronto, ON Work Model: Hybrid (3 Days WFO) Duration: 6–12 Months Mandatory... ...validation. Ensure applications meet defined performance benchmarks, scalability targets, and reliability standards before...
Performance
Contract work
Shift work
Astra North Infoteck Inc.
Toronto, ON
14 days ago
QA Engineer - Oracle, ETL & Performance Testing
...QA Engineer – Oracle, ETL & Performance Testing Role Description QA Engineer role with 8–10 years of experience in Oracle, SQL, and performance testing tools. Strong expertise in SQL (complex queries) and Oracle databases with performance optimization skills....
Performance
Contract work
Astra North Infoteck Inc.
Toronto, ON
16 days ago
Social Performance Analyst
...ROLE: SOCIAL PERFORMANCE ANALYST TEAM: THE KITCHEN NORTH AMERICA LOCATION: TORONTO (HYBRID) COMPANY OVERVIEW: The Kitchen brings... ...-based) ~ Partner with strategy teams to define KPIs, benchmarks, and measurement frameworks that reflect real success INNOVATION...
Performance
Full time
Live In
Shift work
SALT XC
Toronto, ON
10 hours ago
Performance Manager
...15 new stations and related works being performed by three key P3 Contractors and multiple... ...Consultant (PCSC). Position Summary The Performance Manager, reporting to the Production... ...systems to measure performance of engineering and construction projects. Experience...
Performance
Long term contract
Full time
Contract work
For contractors
Work at office
Local area
Remote work
Relocation
Bechtel
Toronto, ON
9 days ago
Senior Analyst, Performance Solutions
$76k - $90k per year
...powered StackAdapt Marketing Platform seamlessly connects brand and performance marketing to drive measurable results across the entire... ...productization. Define and track performance OKRs (e.g., data benchmarks, measurement accuracy, lower-funnel KPIs) and communicate strategic...
Performance
Local area
Remote work
Work from home
Home office
StackAdapt
Toronto, ON
10 hours ago
Investment Performance Analyst
...research labs. Headquartered in San Francisco, our investors include Benchmark , General Catalyst , Peter Thiel , Adam D'Angelo ,... ...and asynchronously to meet deadlines and improve AI model performance . Qualifications Must-Have ~2+ years of experience...
Performance
Remote job
Hourly pay
Weekly pay
Contract work
For contractors
Summer work
Mercor
Toronto, ON
16 days ago
Principal Machine Learning Engineer, General AI, ML & Big Data
...just encouraged it's expected. The Role As one of our Principal ML Engineer’s, you'll be a key technical leader and thought leader, shaping our ML strategy and building intelligent, high-performance multi-agent systems that perceive, learn, and act in real time. What...
Performance
Shift work
C-Serv
Toronto, ON
14 days ago
Robotics ML Expert
..., and interact with the physical world? We're looking for Robotics ML Experts in Canada's vibrant AI research hub to design, build, and refine MuJoCo simulation environments that train AI systems to perform real-world tasks — from locomotion and dexterous manipulation to complex...
Hourly pay
Ongoing contract
Contract work
Freelance
Remote work
Flexible hours
Alignerr
Toronto, ON
24 days ago
Senior Technical Product Manager - ML & Analytics
$92k - $142k per year
...of automation tools, data pipelines, and ML models, empowering users to effectively and... ...that will help improve operations and performance across our business. Partner with executive... ...with a high-performing group of Engineers, Data Scientists, and Analysts acting as...
Performance
Full time
Local area
Zynga
Toronto, ON
10 hours ago
Senior Software Engineer, ML Pipelines, Data & Automation
$155k - $213k per year
...learn more visit: As a Senior Software Engineer embedded within our Autonomy & Algorithms... .... You will... Work across the ML development lifecycle, including dataset... ...provide a holistic understanding of model performance and enable the discovery of interesting scenarios...
Performance
Remote job
Full time
Work at office
Work from home
Flexible hours
Waabi
Toronto, ON
more than 2 months ago
Coordinator, Fitness & Performance
$72.12k per year
...through outstanding undergraduate and graduate education programs, cutting-edge research and the delivery of sport, recreation and high performance athletic opportunities for students, staff, faculty and community members across the three campuses. In achieving this vision, the...
Performance
Full time
Casual work
Day shift
Afternoon shift
University of Toronto
Toronto, ON
21 hours ago
Applied ML Researcher - Fully Remote | Upto $90/hr
$90 per hour
...in San Francisco, our investors include Benchmark , General Catalyst , Peter Thiel ,... ...Dorsey . Position: Machine Learning Engineer Expert Type: Contract Compensation... ...strategies, and evaluation metrics. Perform exploratory data analysis, feature...
Performance
Remote job
Contract work
Summer work
Mercor
Toronto, ON
2 days ago
Performance Marketing Specialist - Google Ads
usd60k - usd80k per year
We're looking for a sharp, strategic Performance Marketing Specialist- Google Ads to manage paid... ...marketers, Google Ads specialists, and engineers Conduct regular keyword research to... ...Analytics An understanding of search engine optimization (SEO) and search engine...
Performance
Full time
Internship
Local area
Silk & Snow
Toronto, ON
1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to ML Performance Benchmarking Engineer. Be the first to apply!