Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior Research Engineer, Model Evaluation

Cohere

Who are we?

Our mission is to scale intelligence to serve humanity. We’re training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semantic search, RAG, and agents. We believe that our work is instrumental to the widespread adoption of AI.

We obsess over what we build. Each one of us is responsible for contributing to increasing the capabilities of our models and the value they drive for our customers. We like to work hard and move fast to do what’s best for our customers.

Cohere is a team of researchers, engineers, designers, and more, who are passionate about their craft. Each person is one of the best in the world at what they do. We believe that a diverse range of perspectives is a requirement for building great products.

Join us on our mission and shape the future!

Why this role?

Evaluation is critical to making progress in scaling intelligence. As models continue to become superhuman in many real-world use cases, we must continue to develop new techniques to accurately measure our models' performance on frontier capabilities. In this role, you are responsible for creating next-generation evaluation methods and scalable infrastructure to measure LLM progress.

As a Senior Research Engineer, Model Evaluation, you will:

  • Develop evaluation benchmarks, datasets, and environments for measuring the bleeding edge of model capabilities

  • Conduct research to push the state-of-the-art in LLM evaluation methods, including training LLM judges; improving evaluation efficiency; and scalably building high-quality datasets

  • Build scalable tools for investigating and understanding evaluation results that are used by all members of technical staff at Cohere, as well as leadership and our CEO

  • Learn from and work with the best researchers and engineers in the field

You may be a good fit if:

  • You enjoy pushing the limits of what LLMs are capable of, and you have built high-quality evaluation resources to measure those capabilities (datasets, simulators, environments, etc.)

  • You have a track record of developing new methods and/or data to evaluate LLMs, e.g. publications at top-tier conferences, popular benchmarks, etc.

  • You have deep experience building with and around LLMs, and you have built tools for analyzing and understanding their performance

  • You have strong software engineering skills

If some of the above doesn’t line up perfectly with your experience, we still encourage you to apply! If you want to work really hard on a glorious mission with teammates that want the same thing, Cohere is the place for you.

We value and celebrate diversity and strive to create an inclusive work environment for all. We welcome applicants from all backgrounds and are committed to providing equal opportunities. Should you require any accommodations during the recruitment process, please submit an Accommodations Request Form , and we will work together to meet your needs.

Full-Time Employees at Cohere enjoy these Perks:

An open and inclusive culture and work environment 

‍ Work closely with a team on the cutting edge of AI research 

Weekly lunch stipend, in-office lunches & snacks

Full health and dental benefits, including a separate budget to take care of your mental health 

100% Parental Leave top-up for 6 months for employees based in Canada, the US, and the UK

Personal enrichment benefits towards arts and culture, fitness and well-being, quality time, and workspace improvement

Remote-flexible, offices in Toronto, New York, San Francisco and London and co-working stipend

✈️ 6 weeks of vacation

Note: This post is co-authored by both Cohere humans and Cohere technology.

Vacancy posted more than 2 months ago
Similar jobs that could be interesting for youBased on the Senior Research Engineer, Model Evaluation in United States vacancy
  • $155k - $269k per year

     ..., scalable, controllable, and efficient simulation. As a Research Scientist in World Models, you will develop algorithms and productionize the next generation...  ...data of driving scenes. Collaborate with simulation engineers to integrate models into large-scale, distributed... 
    Suggested
    Remote job
    Full time
    Work at office
    Work from home
    Flexible hours

    Waabi

    United States
    more than 2 months ago
  • $155k - $213k per year

     ...datasets essential for training and evaluating our online mapping system....  ...vehicles.  Champion engineering excellence, ensuring high-quality...  ...Contribute to the broader research community by publishing findings...  ...machine learning features/models into production. Previous... 
    Suggested
    Remote job
    Full time
    Work at office
    Work from home
    Flexible hours
    Shift work

    Waabi

    United States
    more than 2 months ago
  • $185k - $225k per year

     ...Gauntlet leads the field in quantitative research and optimization of DeFi economics. We manage...  ...-edge research that informs our risk models, alerts, and analysis, and is among the most...  ...and code reviews, maintaining high engineering standards. Leverage AI-assisted development... 
    Senior
    Full time
    Contract work
    Work at office
    Remote work
    Work from home

    SevenStar HR

    United States
    23 days ago
  •  ...by Greylock, Y Combinator, and other top investors. As a  Senior Backend Engineer on the  App Systems team, you'll be a high-ownership...  ...relational and NoSQL databases, SQL, and thoughtful database modeling ~ You're familiar with building and observing applications... 
    Senior
    Full time
    Remote work
    Work from home
    Flexible hours

    Apollo Technical LLC

    United States
    a month ago
  •  ...around the world.  THE ROLE   As a Senior Software Engineer II on the Khan Academy Kids team, you...  ...design, query optimization, and data modeling for scalable applications. Demonstrated...  ...our users. We leverage user insights, research, and experience to build content,... 
    Senior
    Long term contract
    Full time
    Internship
    Remote work
    Worldwide

    HireDNA

    United States
    more than 2 months ago
  • $248.53k - $288.1k per year

     ...place to chat, explore and build with a wide variety of AI language models (bots), including GPT-5.4, Claude-Opus-4.6, Gemini-3.1-Pro, Nano...  ...on our Quora product. About the Team and Role: Our small engineering team works on challenging problems every day. We have a culture... 
    Senior
    Full time
    Remote work
    Monday to friday
    Flexible hours

    Stonehill

    United States
    25 days ago
  •  ...intelligence, and over 15 years of cutting-edge research at top universities, Keebo reduces...  .... About the Opportunity As a Senior Machine Learning Engineer on our Algorithms team, you will...  ...ML engineers and data engineers to evaluate algorithmic and business problems and... 
    Senior
    Remote job
    Full time
    Internship
    Local area
    Worldwide
    Home office

    Keebo

    United States
    more than 2 months ago
  • $201k - $251k per year

     ...you will: Partner with data science & engineering teams to design and deploy ML & Gen AI...  ...~ SQL, dbt, Python ~ OLAP / OLTP data modelling and architecture ~ Key-value stores: Redis...  ...qualify it as an AEDT. As part of the evaluation process we provide Covey with job... 
    Senior
    Remote job

    Mercury

    United States
    more than 2 months ago
  • $167k - $208k per year

     ...embodies the elegance of simplicity in engineering, transforming the demanding task of controlling...  ...is growing and we’re looking to hire senior backend engineers . We’re a team of...  ...may qualify it as an AEDT. As part of the evaluation process we provide Covey with job... 
    Senior
    Remote job

    Mercury

    United States
    more than 2 months ago
  •  ...intelligence, and over 15 years of cutting-edge research at top universities, Keebo reduces...  ...customers worldwide. As a Senior Software Engineer on our AI team, you will bring your expertise...  ...data quality, and implementing data modeling for use by ML/analytical models... 
    Senior
    Remote job
    Full time
    Internship
    Local area
    Worldwide
    Home office

    Keebo

    United States
    more than 2 months ago
  • $239k - $299k per year

     ...systems to make it happen.  Your job as an engineering manager at Mercury is to make our...  ...role, you will: Lead a team of 4 to 8 senior engineers to deliver high-availability banking...  ...qualify it as an AEDT. As part of the evaluation process we provide Covey with job... 
    Senior
    Remote job

    Mercury

    United States
    more than 2 months ago
  • $150k - $180k per year

     ...most complete and connected ecosystem in senior living. Founded by Michael Wang, a former...  ...augments and empowers human care. As a Software Engineer on our Intelligence & Integrations team,...  ...leadership to evolve our core data models and APIs, transitioning from our current point... 
    Senior
    Remote job
    Flexible hours

    Inspiren

    United States
    more than 2 months ago
  • $166k - $195k per year

     ..., CNBC Disruptor 50 , and TIME Magazine's 100 Most Influential Companies . About the Role We are seeking a Systems Designer-Engineer motivated by the opportunity to learn from an exceptional team and define how designers and engineers work together to deliver exceptional... 
    Senior
    Remote job
    Long term contract
    Full time
    Work from home
    Home office
    Relocation package
    Flexible hours

    Ramp

    United States
    more than 2 months ago
  • $155k - $213k per year

     ...positive way. To learn more visit: As a Senior Software Engineer embedded within our Autonomy &...  ...develop data pipelines needed to train and evaluate Waabi’s autonomous platform, enabling our...  ...to provide a holistic understanding of model performance and enable the discovery of... 
    Senior
    Remote job
    Full time
    Work at office
    Work from home
    Flexible hours

    Waabi

    United States
    more than 2 months ago
  • $120k - $130k per year

     ...The Role: Code at the Speed of Thought We are looking for a  Senior Engineer who is 100% all-in on AI-native workflows. If you believe in...  ...AI Sandbox: We encourage you to experiment with the newest models and tools. If a new agentic tool comes out that saves time, we... 
    Senior
    Full time
    Live In
    Remote work

    ACCRUEPARTNERS

    United States
    more than 2 months ago
  • $201k - $251k per year

     ...Roman engineers built aqueducts that quietly carried water across cities, sustaining empires...  ...business today and for years to come. As a Senior Engineer on this team, you won’t just...  ...qualify it as an AEDT. As part of the evaluation process we provide Covey with job requirements... 
    Senior
    Remote job
    Long term contract
    Bank staff
    Immediate start

    Mercury

    United States
    more than 2 months ago
  • $210k - $365k per year

     ...Foundation (SDF) is expanding its scientific and technological research efforts and looking for someone to help guide that growth with purpose...  ...(e.g., tenured or tenure-track faculty) or industry (e.g., senior research scientist or director-level role). ~ Strong track record... 
    Senior
    Remote job
    Long term contract
    Temporary work
    Internship
    Work at office
    Local area
    Worldwide
    Flexible hours

    Stellar Development Foundation

    United States
    more than 2 months ago
  •  ...amazing work! The Invert team is comprised of creative and talented engineers, data scientists, biologists, and more, and we are supported by...  ...: GitHub, Linear, Slack, Notion The role Mission As a Senior Software Engineer, you will ensure that your team efficiently... 
    Senior
    Remote job

    Invert

    United States
    more than 2 months ago
  • $160k - $190k per year

     ...payment fraud, account takeovers, and social engineering scams. We have raised $75M from world-...  ...We are seeking a highly skilled Senior Software Engineer to lead the development...  ...Learning Integration : Apply machine learning models where appropriate to enhance device recognition... 
    Senior
    Remote job
    Worldwide
    Home office
    Flexible hours

    Sardine

    United States
    more than 2 months ago
  • $170k - $230k per year

     ...meetings, and improve efficiency within their inbound pipeline motion. Overview We're looking for a deeply experienced Senior Software Engineer to join our dynamic Product Team. This is a foundational role where you will be at the forefront of integrating AI technologies... 
    Senior
    Remote job
    Internship

    Qualified

    United States
    more than 2 months ago
  •  ...developing applications powered by our cutting-edge AI research. As a Data Infrastructure Engineer, you will lead the development of fundamental data...  ...applications Storage and computation efficiency AI model evaluation and productionization infrastructure Collaborate... 
    Remote job

    HeyGen

    United States
    more than 2 months ago
  •  ...efficiency. Its team of skilled Data Engineers, Data Scientists, Machine Learning (...  ...and Responsibilities: The Senior Machine Learning Consultant at OneSix...  ...most pressing problems. ~ Conduct research and development on AI/ML models to solve complex technical challenges... 
    Senior
    Remote job
    Live In
    Local area
    Home office
    Flexible hours

    OneSix

    United States
    more than 2 months ago
  • $100k per year

     ...to unify innovations in software models, compilers, platforms, networking...  ...looking for contributors of all seniorities. As our TT-Distributed Software Engineer, you will develop and optimize distributed...  .... Collaborate closely with AI researchers and hardware engineers to... 
    Permanent employment

    Tenstorrent

    United States
    more than 2 months ago
  •  ...the top. This person would help add more structure to our current engineering team. Why join us? In addition to a family type atmosphere...  ...an existing suspension bridge ~ Develop a 3D finite element model for an existing long-span suspension bridge based on the design... 
    Long term contract
    Full time
    For contractors

    TorontoJobs.ca Recruitment Division

    United States
    7 days ago
  •  ...Software Engineer – AI-Powered Debt Management Platform (Remote) Join a team building the infrastructure...  ...testing Develop and evolve data models in a document database environment (...  ...cycles   Logistics Level: Mid-level to Senior Location: Fully remote (U.S. time zones... 
    Senior
    Full time
    Local area
    Remote work

    Apollo Technical LLC

    United States
    more than 2 months ago
  • $158k - $269k per year

     ...positive way. To learn more visit: You will… Collaborating with research scientists during algorithm design to ensure code is efficiently...  ...from inception Identify and communicate best practices for model development Reformulate performance bottlenecks Develop a framework... 
    Remote job
    Full time
    Pnp
    Work at office
    Work from home
    Flexible hours

    Waabi

    United States
    more than 2 months ago
  • $170k - $230k per year

     ...Overview We are looking for an experienced Engineering Manager to lead our AI Core & Product...  ...features. You'll partner closely with senior technical leaders and help drive execution...  ...design and implementation of robust data models to support RAG models' training and inference... 
    Senior
    Remote job
    Internship

    Qualified

    United States
    more than 2 months ago
  • $160k - $190k per year

     ...within their inbound pipeline motion. As a Senior Product Manager, you’ll lead efforts to...  ...customers and analyzing metrics, you’ll evaluate new opportunities and define strategic...  ...collaborate closely with product designers and engineers to plan and execute projects. And you’ll... 
    Senior
    Remote job
    Long term contract

    Qualified

    United States
    more than 2 months ago
  • $150k - $300k per year

     ...in project-based units of work. We’re looking for a Software Engineer with senior level experience to take ownership over the technical direction...  ...customers, and commuting ~ Paid parental leave  ~ Hybrid work model with catered lunches everyday (M-F), snacks, and beverages in... 
    Senior
    Remote job

    Assembled

    United States
    more than 2 months ago
  •  ...As a Senior Software Engineer, you'll help shape the technical direction of our platform and bring to life features that are as scalable as they are intuitive. We’re tackling complex backend challenges, building APIs that serve both performance and elegance, and crafting systems... 
    Senior
    Long term contract
    Full time
    Remote work

    Advantage Resourcing

    United States
    a month ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior Research Engineer, Model Evaluation. Be the first to apply!