Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

ML Performance Benchmarking Engineer

Cerebras Systems

Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs.  

Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups.  OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. 

Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation.

About The Role

The  Inference Core Platform group is at the heart of Cerebras' mission to deliver the world’s fastest AI inference. Our team builds the foundational software and hardware infrastructure that powers low-latency, high-speed, high-throughput deployment on the Cerebras Wafer-Scale Engine (WSE). We are responsible for the full stack—from model compilation and scheduling down to custom hardware kernels and driver development.

The ML Performance Benchmarking team plays a pivotal role in shaping the performance and scalability of AI inference on one of the most advanced computing systems ever built. We drive the bring-up of core inference capabilities and deliver performance improvements at every stage of development – from early prototyping to production deployment.

We're looking for passionate engineers to join us in redefining the limits of AI inference. If you thrive on building systems that measure, analyze, and optimize performance at scale, this is your opportunity to make a transformative impact on the future of AI.

Scope of the team includes:

  • Core Inference Observability – Design and implement end-to-end telemetry systems across the software stack, providing deep visibility into inference performance and enabling rapid iteration before and after deployment.
  • Benchmarking Infrastructure – Architect, build, and scale the automation that generates, analyzes, and visualizes performance data used to inform business decisions across engineering and leadership.
  • Performance Analysis – Dive deep into system behavior, dissect performance bottlenecks, and deliver actionable insights that directly influence which features ship and how they evolve.
  • Feature Integration – Partner closely with Core Platform teams to define rigorous testing methodologies that validate inference features for peak performance.

Skills & Qualifications

  • Bachelor’s or Master’s degree in Computer Engineering, Systems Engineering, or a related field.
  • Proficiency in Python and/or C++ programming.
  • Proven experience in building and scaling automated infrastructure.
  • Strong background in throughput and performance optimization techniques, especially in complex, large-scale systems.
  • Excellent problem-solving skills and a strong analytical mindset.
  • Demonstrated ability to dive deep into new domains.
  • Ability to work in a fast-paced, ambiguous, and collaborative environment.

Preferred Skills & Qualifications

  • Familiarity with problem-solving at the intersection of hardware and software.
  • Hands-on experience with AI workloads and architectures is a plus.

Location

  • On-site or hybrid at our Toronto office

 

#LI-WA1

Why Join Cerebras

People who are serious about software make their own hardware. At Cerebras we have built a breakthrough architecture that is unlocking new opportunities for the AI industry. With dozens of model releases and rapid growth, we’ve reached an inflection  point in our business. Members of our team tell us there are five main reasons they joined Cerebras:

  1. Build a breakthrough AI platform beyond the constraints of the GPU.
  2. Publish and open source their cutting-edge AI research.
  3. Work on one of the fastest AI supercomputers in the world.
  4. Enjoy job stability with startup vitality.
  5. Our simple, non-corporate work culture that respects individual beliefs.

Read our blog:  Five Reasons to Join Cerebras in 2026.

Apply today and become part of the forefront of groundbreaking advancements in AI!

Cerebras Systems is committed to creating an equal and diverse environment and is proud to be an equal opportunity employer.  We celebrate different backgrounds, perspectives, and skills. We believe inclusive teams build better products and companies. We try every day to build a work environment that empowers people to do their best work through continuous learning, growth and support of those around them.

This website or its third-party tools process personal data. For more details, click here to review our CCPA disclosure notice.

Vacancy posted 10 hours ago
Similar jobs that could be interesting for youBased on the ML Performance Benchmarking Engineer in Toronto, ON vacancy
  •  ...learning users to effortlessly run large-scale ML applications, without the hassle of...  ...About The Role Join Cerebras as a Performance Engineer within our innovative Runtime Team. Our...  ...and powerful x86 machines, has set new benchmarks in high-performance ML training and inference... 
    Performance
    Local area

    Cerebras Systems

    Toronto, ON
    10 hours ago
  • $110k - $150k per year

     ...you the space to grow.   About the Role We are seeking ML/AI Engineers to contribute to major projects.  The ML / AI Engineer design...  ...automated deployment and rollback Monitor AI systems for performance, drift, bias, and reliability in production Optimize compute... 
    Performance
    Permanent employment
    Full time
    Remote work
    Flexible hours

    Levio

    Toronto, ON
    10 hours ago
  •  ...learning users to effortlessly run large-scale ML applications, without the hassle of...  ...will prototype architectural tweaks, build performance-eval pipelines, and turn hard numbers...  ...Key Responsibilities Prototype and benchmark cutting-edge ideas: new attentions, MoE,... 
    Performance

    Cerebras Systems

    Toronto, ON
    10 hours ago
  • $101k - $169k per year

     ...clients’ Generative AI offerings, including testing, evaluating and benchmarking different LLMs and toolchains; Significant experience and...  ...when it comes to the salaries of our people. We regularly benchmark across a variety of positions, industries, sectors, targets, and... 
    Suggested
    Permanent employment
    Flexible hours

    Deloitte

    Toronto, ON
    2 hours ago
  • $188.2k - $268.9k per year

     ...We are seeking a Director of Machine Learning Engineering and Infrastructure to lead a hybrid team bridging advanced ML engineering with world-class infrastructure design...  ...deliver both foundational ML systems and high-performance distributed services. This is a hybrid role... 
    Performance
    Long term contract
    Remplacement
    Full time
    Temporary work
    Work at office
    Local area
    Flexible hours

    Tubi - Canada

    Toronto, ON
    10 hours ago
  • ML Engineer (BFSI) Job Title: Machine Learning Engineer Position Overview: The ML Engineer will develop, deploy, and optimize machine...  .... Deploy production-grade ML solutions. Optimize model performance and scalability. Collaborate with data scientists and... 
    Performance

    NavitasPartners

    Toronto, ON
    8 days ago
  • $103.2k - $192k per year

     ...Master's or Ph.D. in Computer Science, Engineering, Mathematics, or a related quantitative field...  ...-paced environment. Exemplifies high performance, integrity, and partnership....  ...Designs and develops machine learning (ML) and deep learning systems. Runs machine... 
    Performance
    Full time
    Contract work
    Part time
    Shift work
    Toronto, ON
    9 days ago
  • $171k - $225k per year

     ...people learn from the world's best. This is not a side initiative. It is the direction of the company. We are looking for a Staff ML Engineer to join our AI engineering team and help define and deliver these products. This role is ultimately about delivery. You will be... 
    Local area
    Remote work
    Flexible hours

    MasterClass

    Toronto, ON
    10 hours ago
  • $55.55 - $61.93 per hour

    Our client, a leading organization in the financial technology space, is seeking a highly experienced Load and Performance Lead Engineer to join their team. In this pivotal role, you will drive advanced performance testing capabilities for complex, high-throughput applications... 
    Performance
    Long term contract
    Full time
    Contract work
    Remote work
    3 days per week

    Randstad

    Toronto, ON
    11 days ago
  • $103k - $135k per year

     ...business-oriented Machine Learning / AI Engineer with a passion for building and scaling...  ...hands-on engineer with deep experience in AI/ML engineering and AI/ML engineering...  ...monitoring. Implement model monitoring, performance tuning, drift detection, and retraining strategies... 
    Performance
    Full time
    Internship
    Toronto, ON
    3 days ago
  • $13k - $15.6k per year

     ...Most Innovative Companies, and Forbes World’s Best Bank. Visit our institutional page  About the role Senior System Engineer - Systems Performance Team The Systems Performance team is part of the Computing Squad (Foundation / Runtime Platforms). You will be part of a team... 
    Performance
    Long term contract
    Remote work
    Work from home
    Relocation package
    Flexible hours

    Nubank

    Toronto, ON
    10 hours ago
  •  ...looking for a Machine Learning Data Engineer to enable data and ML capabilities that directly power the...  ...experimental approaches into reliable, high-performing systems Developing and deploying...  ..., reliability, governance, and performance at scale while enabling hyper-... 
    Performance
    Full time
    Local area
    Flexible hours

    Royal Bank of Canada

    Toronto, ON
    a month ago
  • $88k - $132k per year

     ...’re empowered to do your best work. We are looking for an ML Engineer who can design, build, and integrate AI-powered systems by combining...  ...systems are production-ready—scalable, reliable, secure, and performant Continuously improve system performance through testing,... 
    Performance
    Full time
    Work at office

    Equinix

    Toronto, ON
    a month ago
  •  ...Job Description: Role: AWS ML Engineer Location: Toronto Office Hybrid: 2 days a week in office  Primary Skills: (AWS ML...  ...with applications using APIs and cloud services Optimize model performance, scalability, and cost on AWS infrastructure Collaborate with... 
    Performance
    Contract work
    Work at office
    2 days per week

    Astra North Infoteck Inc.

    Toronto, ON
    a month ago
  • $146k - $201.3k per year

     ...opportunity to do career-defining work. We're all in on this mission. If you are too, let's talk. The Engineering Opportunity Reporting to the Director of Quality & Performance, this role as Engineering Manager of Performance & Resilience will drive performance and... 
    Performance
    Local area
    Worldwide

    Okta

    Toronto, ON
    10 hours ago
  •  ...AI/ML Engineer (GenAI, LLM, Java) Location: Toronto, ON Experience Required: 5+ Role Overview We are seeking a highly skilled...  ...requirements Optimize AI workflows for: Scalability Performance Cost-efficiency Leverage Oracle databases for data... 
    Performance
    Contract work

    Astra North Infoteck Inc.

    Toronto, ON
    17 days ago
  • $80k - $130k per year

    Performance Quality Engineering Consultant Position Description This role is hybrid and requires you to be at our client's and/or downtown...  ...detailed performance test reports, dashboards, and performance benchmarking metrics. • Support root cause analysis and... 
    Performance
    2 days per week
    Toronto, ON
    a month ago
  •  ...Senior QA Performance Tester Location: Toronto, ON Work Model: Hybrid (3 Days WFO) Duration: 6–12 Months Mandatory...  ...validation. Ensure applications meet defined performance benchmarks, scalability targets, and reliability standards before... 
    Performance
    Contract work
    Shift work

    Astra North Infoteck Inc.

    Toronto, ON
    14 days ago
  •  ...QA Engineer – Oracle, ETL & Performance Testing Role Description QA Engineer role with 8–10 years of experience in Oracle, SQL, and performance testing tools. Strong expertise in SQL (complex queries) and Oracle databases with performance optimization skills.... 
    Performance
    Contract work

    Astra North Infoteck Inc.

    Toronto, ON
    16 days ago
  •  ...ROLE: SOCIAL PERFORMANCE ANALYST TEAM: THE KITCHEN NORTH AMERICA LOCATION: TORONTO (HYBRID) COMPANY OVERVIEW:  The Kitchen brings...  ...-based)   ~ Partner with strategy teams to define  KPIs, benchmarks, and measurement frameworks that reflect real success   INNOVATION... 
    Performance
    Full time
    Live In
    Shift work

    SALT XC

    Toronto, ON
    10 hours ago
  •  ...15 new stations and related works being performed by three key P3 Contractors and multiple...  ...Consultant (PCSC). Position Summary The Performance Manager, reporting to the Production...  ...systems to measure performance of engineering and construction projects. Experience... 
    Performance
    Long term contract
    Full time
    Contract work
    For contractors
    Work at office
    Local area
    Remote work
    Relocation

    Bechtel

    Toronto, ON
    9 days ago
  • $76k - $90k per year

     ...powered StackAdapt Marketing Platform seamlessly connects brand and performance marketing to drive measurable results across the entire...  ...productization.  Define and track performance OKRs (e.g., data benchmarks, measurement accuracy, lower-funnel KPIs) and communicate strategic... 
    Performance
    Local area
    Remote work
    Work from home
    Home office

    StackAdapt

    Toronto, ON
    10 hours ago
  •  ...research labs. Headquartered in San Francisco, our investors include Benchmark , General Catalyst , Peter Thiel , Adam D'Angelo ,...  ...and asynchronously to meet deadlines and improve AI model performance . Qualifications Must-Have ~2+ years of experience... 
    Performance
    Remote job
    Hourly pay
    Weekly pay
    Contract work
    For contractors
    Summer work

    Mercor

    Toronto, ON
    16 days ago
  •  ...just encouraged it's expected. The Role As one of our Principal ML Engineer’s, you'll be a key technical leader and thought leader, shaping our ML strategy and building intelligent, high-performance multi-agent systems that perceive, learn, and act in real time. What... 
    Performance
    Shift work

    C-Serv

    Toronto, ON
    14 days ago
  •  ..., and interact with the physical world? We're looking for Robotics ML Experts in Canada's vibrant AI research hub to design, build, and refine MuJoCo simulation environments that train AI systems to perform real-world tasks — from locomotion and dexterous manipulation to complex... 
    Hourly pay
    Ongoing contract
    Contract work
    Freelance
    Remote work
    Flexible hours

    Alignerr

    Toronto, ON
    24 days ago
  • $92k - $142k per year

     ...of automation tools, data pipelines, and ML models, empowering users to effectively and...  ...that will help improve operations and performance across our business. Partner with executive...  ...with a high-performing group of Engineers, Data Scientists, and Analysts acting as... 
    Performance
    Full time
    Local area

    Zynga

    Toronto, ON
    10 hours ago
  • $155k - $213k per year

     ...learn more visit: As a Senior Software Engineer embedded within our Autonomy & Algorithms...  .... You will... Work across the ML development lifecycle, including dataset...  ...provide a holistic understanding of model performance and enable the discovery of interesting scenarios... 
    Performance
    Remote job
    Full time
    Work at office
    Work from home
    Flexible hours

    Waabi

    Toronto, ON
    more than 2 months ago
  • $72.12k per year

     ...through outstanding undergraduate and graduate education programs, cutting-edge research and the delivery of sport, recreation and high performance athletic opportunities for students, staff, faculty and community members across the three campuses. In achieving this vision, the... 
    Performance
    Full time
    Casual work
    Day shift
    Afternoon shift

    University of Toronto

    Toronto, ON
    21 hours ago
  • $90 per hour

     ...in San Francisco, our investors include Benchmark , General Catalyst , Peter Thiel ,...  ...Dorsey . Position: Machine Learning Engineer Expert Type: Contract Compensation...  ...strategies, and evaluation metrics. Perform exploratory data analysis, feature... 
    Performance
    Remote job
    Contract work
    Summer work

    Mercor

    Toronto, ON
    2 days ago
  • usd60k - usd80k per year

    We're looking for a sharp, strategic Performance Marketing Specialist- Google Ads to manage paid...  ...marketers, Google Ads specialists, and engineers Conduct regular keyword research to...  ...Analytics An understanding of search engine optimization (SEO) and search engine... 
    Performance
    Full time
    Internship
    Local area

    Silk & Snow

    Toronto, ON
    1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to ML Performance Benchmarking Engineer. Be the first to apply!