Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior Research Engineer, Model Evaluation

Cohere

Who are we?

Our mission is to scale intelligence to serve humanity. We’re training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semantic search, RAG, and agents. We believe that our work is instrumental to the widespread adoption of AI.

We obsess over what we build. Each one of us is responsible for contributing to increasing the capabilities of our models and the value they drive for our customers. We like to work hard and move fast to do what’s best for our customers.

Cohere is a team of researchers, engineers, designers, and more, who are passionate about their craft. Each person is one of the best in the world at what they do. We believe that a diverse range of perspectives is a requirement for building great products.

Join us on our mission and shape the future!

Why this role?

Evaluation is critical to making progress in scaling intelligence. As models continue to become superhuman in many real-world use cases, we must continue to develop new techniques to accurately measure our models' performance on frontier capabilities. In this role, you are responsible for creating next-generation evaluation methods and scalable infrastructure to measure LLM progress.

As a Senior Research Engineer, Model Evaluation, you will:

  • Develop evaluation benchmarks, datasets, and environments for measuring the bleeding edge of model capabilities

  • Conduct research to push the state-of-the-art in LLM evaluation methods, including training LLM judges; improving evaluation efficiency; and scalably building high-quality datasets

  • Build scalable tools for investigating and understanding evaluation results that are used by all members of technical staff at Cohere, as well as leadership and our CEO

  • Learn from and work with the best researchers and engineers in the field

You may be a good fit if:

  • You enjoy pushing the limits of what LLMs are capable of, and you have built high-quality evaluation resources to measure those capabilities (datasets, simulators, environments, etc.)

  • You have a track record of developing new methods and/or data to evaluate LLMs, e.g. publications at top-tier conferences, popular benchmarks, etc.

  • You have deep experience building with and around LLMs, and you have built tools for analyzing and understanding their performance

  • You have strong software engineering skills

If some of the above doesn’t line up perfectly with your experience, we still encourage you to apply! If you want to work really hard on a glorious mission with teammates that want the same thing, Cohere is the place for you.

We value and celebrate diversity and strive to create an inclusive work environment for all. We welcome applicants from all backgrounds and are committed to providing equal opportunities. Should you require any accommodations during the recruitment process, please submit an Accommodations Request Form , and we will work together to meet your needs.

Full-Time Employees at Cohere enjoy these Perks:

An open and inclusive culture and work environment 

‍ Work closely with a team on the cutting edge of AI research 

Weekly lunch stipend, in-office lunches & snacks

Full health and dental benefits, including a separate budget to take care of your mental health 

100% Parental Leave top-up for 6 months for employees based in Canada, the US, and the UK

Personal enrichment benefits towards arts and culture, fitness and well-being, quality time, and workspace improvement

Remote-flexible, offices in Toronto, New York, San Francisco and London and co-working stipend

✈️ 6 weeks of vacation

Note: This post is co-authored by both Cohere humans and Cohere technology.

Vacancy posted more than 2 months ago
Similar jobs that could be interesting for youBased on the Senior Research Engineer, Model Evaluation in Toronto, ON vacancy
  • $84k - $126k per year

     ...Job Type:  Permanent   Work Model:  Hybrid   Reference code: 133164 Primary Location:  Toronto, ON All Available Locations...  ...various complex financial analyses including independent derivative evaluation, customer behavior modeling, and new innovations such as Machine... 
    Senior
    Permanent employment
    Flexible hours

    Deloitte

    Toronto, ON
    17 hours ago
  • $56k - $84k per year

     ...Job Type:  Permanent   Work Model:  Hybrid   Reference code: 133442 Primary Location:  Toronto...  ...look like? As an Analyst, Consultant, or Senior Consultant focusing on the Insurance practice in our Financial Engineering & Modeling team, you will: Conduct... 
    Senior
    Permanent employment
    Flexible hours

    Deloitte

    Toronto, ON
    17 hours ago
  • $155k - $269k per year

     ..., scalable, controllable, and efficient simulation. As a Research Scientist in World Models, you will develop algorithms and productionize the next generation...  ...data of driving scenes. Collaborate with simulation engineers to integrate models into large-scale, distributed... 
    Suggested
    Remote job
    Full time
    Work at office
    Work from home
    Flexible hours

    Waabi

    Toronto, ON
    more than 2 months ago
  • $100k - $145k per year

     ...for technology at Thomson Reuters Labs. We are seeking a Senior Research Engineer who will bring expertise in AI and ML and is interested in...  ...Typescript, etc.)   #LI-SM2 What’s in it For You? Hybrid Work Model: We’ve adopted a flexible hybrid working environment (2-3... 
    Senior
    Full time
    Work at office
    Local area
    Remote work
    Flexible hours
    2 days per week
    3 days per week

    Thomson Reuters

    Toronto, ON
    1 day ago
  • $101k - $169k per year

     ...Job Type:  Permanent   Work Model:  Hybrid   Reference code: 133422 Primary Location...  ...our exponentially expanding Financial Engineering and Modeling group? Are you up for the challenge...  ...,000 (Manager) and $126,000 - $234,000 (Senior Manager), and individuals may be eligible... 
    Senior
    Permanent employment
    Flexible hours

    Deloitte

    Toronto, ON
    17 hours ago
  • $119k per year

     ...and enduring. The Opportunity The Senior Specialist, Data Modeler (Microsoft) is responsible for designing...  ...governance and architecture Data Engineering, ETL & Platforms Identify data...  ..., privacy, and security standards Evaluate vendor deliverables and support production... 
    Senior
    Permanent employment
    Full time
    Work at office

    Vale Base Metals

    Toronto, ON
    11 days ago
  • $20 per hour

     ...technical talent with leading AI research labs. Headquartered in San...  ...Generate high-quality human evaluation data by identifying response...  ...completeness of responses. Ensure model responses align with expected...  ..., analytics, linguistics, engineering) Preferred Prior... 
    Remote job
    Contract work
    Part time
    Summer work

    Mercor

    Toronto, ON
    1 day ago
  • $101k - $169k per year

     ...Job Type:  Permanent   Work Model:  Hybrid   Reference code: 133157 Primary Location:  Toronto, ON All Available Locations:  Toronto, ON   Our Purpose   At Deloitte, our Purpose is to make an impact that matters. We exist to inspire and help our people,... 
    Permanent employment
    Manual labor
    Flexible hours

    Deloitte

    Toronto, ON
    17 hours ago
  • $118k - $162k per year

     ...s talk. Position Description: As a Senior Research Operations Program Manager at Okta, you...  ...collaboration across research, design, product and engineering teams, you will programmatically manage...  ...or managing beta and/or early release evaluation programs ~ Experience managing... 
    Senior
    Local area
    Worldwide

    Okta

    Toronto, ON
    11 hours ago
  • $103.2k - $192k per year

     ...obtain insights. Designs and constructs new processes for modeling data. Develops predictive models and leverages big data technology to design solutions...  ...enterprise-wide level and serves as a specialist resource to senior leaders and stakeholders. Applies expertise and... 
    Senior
    Contract work
    Temporary work
    Part time
    Shift work
    Toronto, ON
    11 days ago
  • $96.6k - $180.6k per year

     ...Family Group: Customer Solutions Senior Quantitative Researcher - Alpha Research Team Location :...  ...research activities through alpha modeling, risk modeling, and optimization techniques...  ...integrity; Collaborate with data engineering teams to improve data pipelines and infrastructure... 
    Senior
    Full time
    Contract work
    Part time
    Toronto, ON
    11 days ago
  •  ...talent in the space. We are looking for a Senior Data Scientist with a good blend of data...  ..., practical experience in Operation research strategies and Pricing Analytics within supply...  ...Optimization. Develop and implement predictive models and optimization algorithms to improve... 
    Senior
    Full time
    Local area
    Remote work

    Tiger Analytics Inc.

    Toronto, ON
    8 days ago
  • $140k - $175k per year

     ...Thomson Reuters Labs. We are seeking a Lead Research Engineer who will bring expertise in AI and ML...  ...with research scientists to evaluate, prototype and productionize research concepts...  ...technology Familiarity with probabilistic models and have an understanding of the mathematical... 
    Full time
    Work at office
    Local area
    Remote work
    Flexible hours
    2 days per week
    3 days per week

    Thomson Reuters

    Toronto, ON
    1 day ago
  • $100 per hour

     ...technical talent with leading AI research labs. Headquartered in San...  ...Role Responsibilities Evaluate AI-generated responses to enhance reasoning and rigor in model outputs . Provide structured...  ...application of structured evaluation guidelines. Ability to work... 
    Senior
    Remote job
    Contract work
    Summer work

    Mercor

    Toronto, ON
    23 days ago
  • $50 per hour

     ...FI client is looking for a UX Researcher to join their team. In this...  ...designers, data analysts, and engineering partners to ensure every decision...  ..., concept testing, usability evaluation, and post-launch measurement....  .... Gillian Singerman - Senior Solutions Delivery Recruiter... 
    Senior
    Ongoing contract

    Creative Circle

    Toronto, ON
    6 days ago
  • $155k - $213k per year

     ...datasets essential for training and evaluating our online mapping system....  ...vehicles.  Champion engineering excellence, ensuring high-quality...  ...Contribute to the broader research community by publishing findings...  ...machine learning features/models into production. Previous... 
    Remote job
    Full time
    Work at office
    Work from home
    Flexible hours
    Shift work

    Waabi

    Toronto, ON
    more than 2 months ago
  • $65k - $85k per year

     ...Canada is part of the SYSTRA group, an international consulting and engineering group, a world leader in the design of transport...  ...Manager / Structural Lead Engineer Job Purpose The BIM Modeller – Structural Engineering is responsible for developing, managing... 
    Internship
    Toronto, ON
    26 days ago
  • $92k - $121k per year

     ...replace cars.   Could you be the Senior Requirements Engineer we’re looking for? Join us in a full...  ...with suppliers and customer. Evaluate risks & validation (including internal...  ...necessary verification means, including modelling and simulation means Participating... 
    Senior
    Full time
    Contract work
    Work at office
    Worldwide

    Alstom

    Toronto, ON
    17 hours ago
  •  ...Sago is a full-service market research company providing comprehensive research solutions to a diverse range of clients. We specialize in delivering actionable insights through innovative methodologies and a commitment to excellence. Our quantitative research division is a... 
    Senior
    Work at office

    Sago

    Toronto, ON
    27 days ago
  • $80 - $150 per hour

     ...creative and technical talent with leading AI research labs. Headquartered in San Francisco,...  ...Position: MS Excel / Google Sheets Evaluator Type: Contract Compensation...  ...structured feedback to enhance document evaluation processes. Utilize MS Excel and... 
    Remote job
    Contract work
    Summer work

    Mercor

    Toronto, ON
    3 days ago
  • $122k - $192k per year

     ...is seeking an experienced DSM Evaluation Manager - EM&V to join our...  ...impact evaluations and providing engineering and analysis support....  ...collection, energy analyses, project research, etc. Oversee primary and...  ...recommendations. Support senior staff with development of presentations... 
    Senior
    Full time
    Work at office
    Local area

    Resource Innovations

    Toronto, ON
    22 days ago
  • $105 per hour

     ...technical talent with leading AI research labs. Headquartered in San...  ...across sectors to support AI model training . Build and interpret...  ...AI-generated outputs . Evaluate AI-generated outputs for...  ...quality. Collaborate with senior experts to refine model... 
    Senior
    Remote job
    Contract work
    Summer work

    Mercor

    Toronto, ON
    17 days ago
  •  ...contribution.  Job Description The Senior Associate Director, IT, Data Modelling and Reporting will report to IT...  ..., logical, and physical data models for investment accounting, focusing...  ...Support: Work closely with Data Engineering team to Map data from varied systems... 
    Senior
    Full time
    Internship
    Work at office

    MUFG Investor Services

    Toronto, ON
    20 hours ago
  • $100k - $130k per year

     ...manufacturers and provide their process engineers and scientists with a next...  ...on an expert in Process Modelling who will be able to make...  ...contributions to our process simulation engine. The Process Modeller will be...  ...modeling solutions Research, develop and validate... 
    Full time
    Work from home
    Flexible hours

    Basetwo

    Toronto, ON
    22 days ago
  •  ...We’re training and deploying frontier models for developers and enterprises who are...  ...our customers. Cohere is a team of researchers, engineers, designers, and more, who are passionate...  ...of Technical Staff in Data Analysis and Evaluation, you will play a pivotal role in ensuring... 
    Full time
    Work at office
    Remote work
    Flexible hours

    Cohere

    Toronto, ON
    15 days ago
  • $147k - $245k per year

     ...apply now. We are currently seeking a Senior AI Engineer - Remote to join our team in Toronto,...  ...a strong background in implementing AI models, analyzing and improving existing AI architectures...  ...improving existing AI architectures Researching and implementing new AI technologies... 
    Senior
    Work at office
    Remote work
    Flexible hours

    NTT DATA Services

    Toronto, ON
    17 hours ago
  • $118k - $157.5k per year

     ...Learning, Data Science, Software Engineering, Operations, and Big Data...  ...We are looking for Applied Researchers to join us and help improve,...  ...opportunity to work on cutting-edge research in NLP and deep learning,...  ...development of machine learning models, data analysis, preferably... 
    Immediate start

    eBay

    Toronto, ON
    15 days ago
  •  ...alongside iconic brand names. About the Role As a luxury brand evaluator, you will step into the world of luxury to discreetly assess...  ...in the luxury market. Compensation • Non-Purchase Evaluations: Earn a fee based on mission complexity. • Purchase-Based Evaluations... 
    Contract work
    Worldwide
    Flexible hours

    CXG

    Toronto, ON
    15 days ago
  •  ...technical talent with leading AI research labs. Headquartered in San...  ...Role Responsibilities Evaluate hedge fund strategies related...  ...long/short equity to enhance AI model training . Develop and...  ...summaries, and memos for AI model evaluation . Predict and analyze... 
    Remote job
    Hourly pay
    Weekly pay
    Contract work
    For contractors
    Summer work

    Mercor

    Toronto, ON
    1 day ago
  •  ...At RBC Borealis, you’ll be joining a team of leading researchers and software engineering specializing in machine learning. You will have access to...  ...We’re looking for an enthusiastic Machine Learning Research Engineer who’s excited by the opportunity of being at the... 
    Full time
    Local area
    Flexible hours

    Royal Bank of Canada

    Toronto, ON
    a month ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior Research Engineer, Model Evaluation. Be the first to apply!