ML Performance Benchmarking Engineer
Cerebras Systems
Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs.
Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference.
Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation.
About The Role
The Inference Core Platform group is at the heart of Cerebras' mission to deliver the world’s fastest AI inference. Our team builds the foundational software and hardware infrastructure that powers low-latency, high-speed, high-throughput deployment on the Cerebras Wafer-Scale Engine (WSE). We are responsible for the full stack—from model compilation and scheduling down to custom hardware kernels and driver development.
The ML Performance Benchmarking team plays a pivotal role in shaping the performance and scalability of AI inference on one of the most advanced computing systems ever built. We drive the bring-up of core inference capabilities and deliver performance improvements at every stage of development – from early prototyping to production deployment.
We're looking for passionate engineers to join us in redefining the limits of AI inference. If you thrive on building systems that measure, analyze, and optimize performance at scale, this is your opportunity to make a transformative impact on the future of AI.
Scope of the team includes:
- Core Inference Observability – Design and implement end-to-end telemetry systems across the software stack, providing deep visibility into inference performance and enabling rapid iteration before and after deployment.
- Benchmarking Infrastructure – Architect, build, and scale the automation that generates, analyzes, and visualizes performance data used to inform business decisions across engineering and leadership.
- Performance Analysis – Dive deep into system behavior, dissect performance bottlenecks, and deliver actionable insights that directly influence which features ship and how they evolve.
- Feature Integration – Partner closely with Core Platform teams to define rigorous testing methodologies that validate inference features for peak performance.
Skills & Qualifications
- Bachelor’s or Master’s degree in Computer Engineering, Systems Engineering, or a related field.
- Proficiency in Python and/or C++ programming.
- Proven experience in building and scaling automated infrastructure.
- Strong background in throughput and performance optimization techniques, especially in complex, large-scale systems.
- Excellent problem-solving skills and a strong analytical mindset.
- Demonstrated ability to dive deep into new domains.
- Ability to work in a fast-paced, ambiguous, and collaborative environment.
Preferred Skills & Qualifications
- Familiarity with problem-solving at the intersection of hardware and software.
- Hands-on experience with AI workloads and architectures is a plus.
Location
- On-site or hybrid at our Toronto office
#LI-WA1
Why Join Cerebras
People who are serious about software make their own hardware. At Cerebras we have built a breakthrough architecture that is unlocking new opportunities for the AI industry. With dozens of model releases and rapid growth, we’ve reached an inflection point in our business. Members of our team tell us there are five main reasons they joined Cerebras:
- Build a breakthrough AI platform beyond the constraints of the GPU.
- Publish and open source their cutting-edge AI research.
- Work on one of the fastest AI supercomputers in the world.
- Enjoy job stability with startup vitality.
- Our simple, non-corporate work culture that respects individual beliefs.
Read our blog: Five Reasons to Join Cerebras in 2026.
Apply today and become part of the forefront of groundbreaking advancements in AI!
Cerebras Systems is committed to creating an equal and diverse environment and is proud to be an equal opportunity employer. We celebrate different backgrounds, perspectives, and skills. We believe inclusive teams build better products and companies. We try every day to build a work environment that empowers people to do their best work through continuous learning, growth and support of those around them.
This website or its third-party tools process personal data. For more details, click here to review our CCPA disclosure notice.
- ...learning users to effortlessly run large-scale ML applications, without the hassle of... ...About The Role Join Cerebras as a Performance Engineer within our innovative Runtime Team. Our... ...and powerful x86 machines, has set new benchmarks in high-performance ML training and inference...PerformanceLocal area
$110k - $150k per year
...you the space to grow. About the Role We are seeking ML/AI Engineers to contribute to major projects. The ML / AI Engineer design... ...automated deployment and rollback Monitor AI systems for performance, drift, bias, and reliability in production Optimize compute...PerformancePermanent employmentFull timeRemote workFlexible hours- ...learning users to effortlessly run large-scale ML applications, without the hassle of... ...will prototype architectural tweaks, build performance-eval pipelines, and turn hard numbers... ...Key Responsibilities Prototype and benchmark cutting-edge ideas: new attentions, MoE,...Performance
$101k - $169k per year
...clients’ Generative AI offerings, including testing, evaluating and benchmarking different LLMs and toolchains; Significant experience and... ...when it comes to the salaries of our people. We regularly benchmark across a variety of positions, industries, sectors, targets, and...SuggestedPermanent employmentFlexible hours$188.2k - $268.9k per year
...We are seeking a Director of Machine Learning Engineering and Infrastructure to lead a hybrid team bridging advanced ML engineering with world-class infrastructure design... ...deliver both foundational ML systems and high-performance distributed services. This is a hybrid role...PerformanceLong term contractRemplacementFull timeTemporary workWork at officeLocal areaFlexible hours- ML Engineer (BFSI) Job Title: Machine Learning Engineer Position Overview: The ML Engineer will develop, deploy, and optimize machine... .... Deploy production-grade ML solutions. Optimize model performance and scalability. Collaborate with data scientists and...Performance
$103.2k - $192k per year
...Master's or Ph.D. in Computer Science, Engineering, Mathematics, or a related quantitative field... ...-paced environment. Exemplifies high performance, integrity, and partnership.... ...Designs and develops machine learning (ML) and deep learning systems. Runs machine...PerformanceFull timeContract workPart timeShift work$171k - $225k per year
...people learn from the world's best. This is not a side initiative. It is the direction of the company. We are looking for a Staff ML Engineer to join our AI engineering team and help define and deliver these products. This role is ultimately about delivery. You will be...Local areaRemote workFlexible hours$55.55 - $61.93 per hour
Our client, a leading organization in the financial technology space, is seeking a highly experienced Load and Performance Lead Engineer to join their team. In this pivotal role, you will drive advanced performance testing capabilities for complex, high-throughput applications...PerformanceLong term contractFull timeContract workRemote work3 days per week$103k - $135k per year
...business-oriented Machine Learning / AI Engineer with a passion for building and scaling... ...hands-on engineer with deep experience in AI/ML engineering and AI/ML engineering... ...monitoring. Implement model monitoring, performance tuning, drift detection, and retraining strategies...PerformanceFull timeInternship$13k - $15.6k per year
...Most Innovative Companies, and Forbes World’s Best Bank. Visit our institutional page About the role Senior System Engineer - Systems Performance Team The Systems Performance team is part of the Computing Squad (Foundation / Runtime Platforms). You will be part of a team...PerformanceLong term contractRemote workWork from homeRelocation packageFlexible hours- ...looking for a Machine Learning Data Engineer to enable data and ML capabilities that directly power the... ...experimental approaches into reliable, high-performing systems Developing and deploying... ..., reliability, governance, and performance at scale while enabling hyper-...PerformanceFull timeLocal areaFlexible hours
$88k - $132k per year
...’re empowered to do your best work. We are looking for an ML Engineer who can design, build, and integrate AI-powered systems by combining... ...systems are production-ready—scalable, reliable, secure, and performant Continuously improve system performance through testing,...PerformanceFull timeWork at office- ...Job Description: Role: AWS ML Engineer Location: Toronto Office Hybrid: 2 days a week in office Primary Skills: (AWS ML... ...with applications using APIs and cloud services Optimize model performance, scalability, and cost on AWS infrastructure Collaborate with...PerformanceContract workWork at office2 days per week
$146k - $201.3k per year
...opportunity to do career-defining work. We're all in on this mission. If you are too, let's talk. The Engineering Opportunity Reporting to the Director of Quality & Performance, this role as Engineering Manager of Performance & Resilience will drive performance and...PerformanceLocal areaWorldwide- ...AI/ML Engineer (GenAI, LLM, Java) Location: Toronto, ON Experience Required: 5+ Role Overview We are seeking a highly skilled... ...requirements Optimize AI workflows for: Scalability Performance Cost-efficiency Leverage Oracle databases for data...PerformanceContract work
$80k - $130k per year
Performance Quality Engineering Consultant Position Description This role is hybrid and requires you to be at our client's and/or downtown... ...detailed performance test reports, dashboards, and performance benchmarking metrics. • Support root cause analysis and...Performance2 days per week- ...Senior QA Performance Tester Location: Toronto, ON Work Model: Hybrid (3 Days WFO) Duration: 6–12 Months Mandatory... ...validation. Ensure applications meet defined performance benchmarks, scalability targets, and reliability standards before...PerformanceContract workShift work
- ...QA Engineer – Oracle, ETL & Performance Testing Role Description QA Engineer role with 8–10 years of experience in Oracle, SQL, and performance testing tools. Strong expertise in SQL (complex queries) and Oracle databases with performance optimization skills....PerformanceContract work
- ...ROLE: SOCIAL PERFORMANCE ANALYST TEAM: THE KITCHEN NORTH AMERICA LOCATION: TORONTO (HYBRID) COMPANY OVERVIEW: The Kitchen brings... ...-based) ~ Partner with strategy teams to define KPIs, benchmarks, and measurement frameworks that reflect real success INNOVATION...PerformanceFull timeLive InShift work
- ...15 new stations and related works being performed by three key P3 Contractors and multiple... ...Consultant (PCSC). Position Summary The Performance Manager, reporting to the Production... ...systems to measure performance of engineering and construction projects. Experience...PerformanceLong term contractFull timeContract workFor contractorsWork at officeLocal areaRemote workRelocation
$76k - $90k per year
...powered StackAdapt Marketing Platform seamlessly connects brand and performance marketing to drive measurable results across the entire... ...productization. Define and track performance OKRs (e.g., data benchmarks, measurement accuracy, lower-funnel KPIs) and communicate strategic...PerformanceLocal areaRemote workWork from homeHome office- ...research labs. Headquartered in San Francisco, our investors include Benchmark , General Catalyst , Peter Thiel , Adam D'Angelo ,... ...and asynchronously to meet deadlines and improve AI model performance . Qualifications Must-Have ~2+ years of experience...PerformanceRemote jobHourly payWeekly payContract workFor contractorsSummer work
- ...just encouraged it's expected. The Role As one of our Principal ML Engineer’s, you'll be a key technical leader and thought leader, shaping our ML strategy and building intelligent, high-performance multi-agent systems that perceive, learn, and act in real time. What...PerformanceShift work
- ..., and interact with the physical world? We're looking for Robotics ML Experts in Canada's vibrant AI research hub to design, build, and refine MuJoCo simulation environments that train AI systems to perform real-world tasks — from locomotion and dexterous manipulation to complex...Hourly payOngoing contractContract workFreelanceRemote workFlexible hours
$92k - $142k per year
...of automation tools, data pipelines, and ML models, empowering users to effectively and... ...that will help improve operations and performance across our business. Partner with executive... ...with a high-performing group of Engineers, Data Scientists, and Analysts acting as...PerformanceFull timeLocal area$155k - $213k per year
...learn more visit: As a Senior Software Engineer embedded within our Autonomy & Algorithms... .... You will... Work across the ML development lifecycle, including dataset... ...provide a holistic understanding of model performance and enable the discovery of interesting scenarios...PerformanceRemote jobFull timeWork at officeWork from homeFlexible hours$72.12k per year
...through outstanding undergraduate and graduate education programs, cutting-edge research and the delivery of sport, recreation and high performance athletic opportunities for students, staff, faculty and community members across the three campuses. In achieving this vision, the...PerformanceFull timeCasual workDay shiftAfternoon shift$90 per hour
...in San Francisco, our investors include Benchmark , General Catalyst , Peter Thiel ,... ...Dorsey . Position: Machine Learning Engineer Expert Type: Contract Compensation... ...strategies, and evaluation metrics. Perform exploratory data analysis, feature...PerformanceRemote jobContract workSummer workusd60k - usd80k per year
We're looking for a sharp, strategic Performance Marketing Specialist- Google Ads to manage paid... ...marketers, Google Ads specialists, and engineers Conduct regular keyword research to... ...Analytics An understanding of search engine optimization (SEO) and search engine...PerformanceFull timeInternshipLocal area
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to ML Performance Benchmarking Engineer. Be the first to apply!
- machine learning engineer Toronto, ON
- junior machine learning developer Toronto, ON
- acting performance Toronto, ON
- performance engineer Toronto, ON
- performance testing Toronto, ON
- building performance specialist Toronto, ON
- intern quantum machine learning for quantum computing Toronto, ON
- machine learning researcher Toronto, ON
- machine learning Toronto, ON
- software engineer - ai machine learning Toronto, ON
