Senior Research Engineer, Model Evaluation
Cohere
Who are we?
Our mission is to scale intelligence to serve humanity. We’re training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semantic search, RAG, and agents. We believe that our work is instrumental to the widespread adoption of AI.
We obsess over what we build. Each one of us is responsible for contributing to increasing the capabilities of our models and the value they drive for our customers. We like to work hard and move fast to do what’s best for our customers.
Cohere is a team of researchers, engineers, designers, and more, who are passionate about their craft. Each person is one of the best in the world at what they do. We believe that a diverse range of perspectives is a requirement for building great products.
Join us on our mission and shape the future!
Why this role?
Evaluation is critical to making progress in scaling intelligence. As models continue to become superhuman in many real-world use cases, we must continue to develop new techniques to accurately measure our models' performance on frontier capabilities. In this role, you are responsible for creating next-generation evaluation methods and scalable infrastructure to measure LLM progress.
As a Senior Research Engineer, Model Evaluation, you will:
Develop evaluation benchmarks, datasets, and environments for measuring the bleeding edge of model capabilities
Conduct research to push the state-of-the-art in LLM evaluation methods, including training LLM judges; improving evaluation efficiency; and scalably building high-quality datasets
Build scalable tools for investigating and understanding evaluation results that are used by all members of technical staff at Cohere, as well as leadership and our CEO
Learn from and work with the best researchers and engineers in the field
You may be a good fit if:
You enjoy pushing the limits of what LLMs are capable of, and you have built high-quality evaluation resources to measure those capabilities (datasets, simulators, environments, etc.)
You have a track record of developing new methods and/or data to evaluate LLMs, e.g. publications at top-tier conferences, popular benchmarks, etc.
You have deep experience building with and around LLMs, and you have built tools for analyzing and understanding their performance
You have strong software engineering skills
If some of the above doesn’t line up perfectly with your experience, we still encourage you to apply! If you want to work really hard on a glorious mission with teammates that want the same thing, Cohere is the place for you.
We value and celebrate diversity and strive to create an inclusive work environment for all. We welcome applicants from all backgrounds and are committed to providing equal opportunities. Should you require any accommodations during the recruitment process, please submit an Accommodations Request Form , and we will work together to meet your needs.
Full-Time Employees at Cohere enjoy these Perks:
An open and inclusive culture and work environment
Work closely with a team on the cutting edge of AI research
Weekly lunch stipend, in-office lunches & snacks
Full health and dental benefits, including a separate budget to take care of your mental health
100% Parental Leave top-up for 6 months for employees based in Canada, the US, and the UK
Personal enrichment benefits towards arts and culture, fitness and well-being, quality time, and workspace improvement
Remote-flexible, offices in Toronto, New York, San Francisco and London and co-working stipend
✈️ 6 weeks of vacation
Note: This post is co-authored by both Cohere humans and Cohere technology.
$155k - $269k per year
..., scalable, controllable, and efficient simulation. As a Research Scientist in World Models, you will develop algorithms and productionize the next generation... ...data of driving scenes. Collaborate with simulation engineers to integrate models into large-scale, distributed...SuggestedRemote jobFull timeWork at officeWork from homeFlexible hours$155k - $213k per year
...datasets essential for training and evaluating our online mapping system.... ...vehicles. Champion engineering excellence, ensuring high-quality... ...Contribute to the broader research community by publishing findings... ...machine learning features/models into production. Previous...SuggestedRemote jobFull timeWork at officeWork from homeFlexible hoursShift work$185k - $225k per year
...Gauntlet leads the field in quantitative research and optimization of DeFi economics. We manage... ...-edge research that informs our risk models, alerts, and analysis, and is among the most... ...and code reviews, maintaining high engineering standards. Leverage AI-assisted development...SeniorFull timeContract workWork at officeRemote workWork from home- ...by Greylock, Y Combinator, and other top investors. As a Senior Backend Engineer on the App Systems team, you'll be a high-ownership... ...relational and NoSQL databases, SQL, and thoughtful database modeling ~ You're familiar with building and observing applications...SeniorFull timeRemote workWork from homeFlexible hours
- ...around the world. THE ROLE As a Senior Software Engineer II on the Khan Academy Kids team, you... ...design, query optimization, and data modeling for scalable applications. Demonstrated... ...our users. We leverage user insights, research, and experience to build content,...SeniorLong term contractFull timeInternshipRemote workWorldwide
$248.53k - $288.1k per year
...place to chat, explore and build with a wide variety of AI language models (bots), including GPT-5.4, Claude-Opus-4.6, Gemini-3.1-Pro, Nano... ...on our Quora product. About the Team and Role: Our small engineering team works on challenging problems every day. We have a culture...SeniorFull timeRemote workMonday to fridayFlexible hours- ...intelligence, and over 15 years of cutting-edge research at top universities, Keebo reduces... .... About the Opportunity As a Senior Machine Learning Engineer on our Algorithms team, you will... ...ML engineers and data engineers to evaluate algorithmic and business problems and...SeniorRemote jobFull timeInternshipLocal areaWorldwideHome office
$201k - $251k per year
...you will: Partner with data science & engineering teams to design and deploy ML & Gen AI... ...~ SQL, dbt, Python ~ OLAP / OLTP data modelling and architecture ~ Key-value stores: Redis... ...qualify it as an AEDT. As part of the evaluation process we provide Covey with job...SeniorRemote job$167k - $208k per year
...embodies the elegance of simplicity in engineering, transforming the demanding task of controlling... ...is growing and we’re looking to hire senior backend engineers . We’re a team of... ...may qualify it as an AEDT. As part of the evaluation process we provide Covey with job...SeniorRemote job- ...intelligence, and over 15 years of cutting-edge research at top universities, Keebo reduces... ...customers worldwide. As a Senior Software Engineer on our AI team, you will bring your expertise... ...data quality, and implementing data modeling for use by ML/analytical models...SeniorRemote jobFull timeInternshipLocal areaWorldwideHome office
$239k - $299k per year
...systems to make it happen. Your job as an engineering manager at Mercury is to make our... ...role, you will: Lead a team of 4 to 8 senior engineers to deliver high-availability banking... ...qualify it as an AEDT. As part of the evaluation process we provide Covey with job...SeniorRemote job$150k - $180k per year
...most complete and connected ecosystem in senior living. Founded by Michael Wang, a former... ...augments and empowers human care. As a Software Engineer on our Intelligence & Integrations team,... ...leadership to evolve our core data models and APIs, transitioning from our current point...SeniorRemote jobFlexible hours$166k - $195k per year
..., CNBC Disruptor 50 , and TIME Magazine's 100 Most Influential Companies . About the Role We are seeking a Systems Designer-Engineer motivated by the opportunity to learn from an exceptional team and define how designers and engineers work together to deliver exceptional...SeniorRemote jobLong term contractFull timeWork from homeHome officeRelocation packageFlexible hours$155k - $213k per year
...positive way. To learn more visit: As a Senior Software Engineer embedded within our Autonomy &... ...develop data pipelines needed to train and evaluate Waabi’s autonomous platform, enabling our... ...to provide a holistic understanding of model performance and enable the discovery of...SeniorRemote jobFull timeWork at officeWork from homeFlexible hours$120k - $130k per year
...The Role: Code at the Speed of Thought We are looking for a Senior Engineer who is 100% all-in on AI-native workflows. If you believe in... ...AI Sandbox: We encourage you to experiment with the newest models and tools. If a new agentic tool comes out that saves time, we...SeniorFull timeLive InRemote work$201k - $251k per year
...Roman engineers built aqueducts that quietly carried water across cities, sustaining empires... ...business today and for years to come. As a Senior Engineer on this team, you won’t just... ...qualify it as an AEDT. As part of the evaluation process we provide Covey with job requirements...SeniorRemote jobLong term contractBank staffImmediate start$210k - $365k per year
...Foundation (SDF) is expanding its scientific and technological research efforts and looking for someone to help guide that growth with purpose... ...(e.g., tenured or tenure-track faculty) or industry (e.g., senior research scientist or director-level role). ~ Strong track record...SeniorRemote jobLong term contractTemporary workInternshipWork at officeLocal areaWorldwideFlexible hours- ...amazing work! The Invert team is comprised of creative and talented engineers, data scientists, biologists, and more, and we are supported by... ...: GitHub, Linear, Slack, Notion The role Mission As a Senior Software Engineer, you will ensure that your team efficiently...SeniorRemote job
$160k - $190k per year
...payment fraud, account takeovers, and social engineering scams. We have raised $75M from world-... ...We are seeking a highly skilled Senior Software Engineer to lead the development... ...Learning Integration : Apply machine learning models where appropriate to enhance device recognition...SeniorRemote jobWorldwideHome officeFlexible hours$170k - $230k per year
...meetings, and improve efficiency within their inbound pipeline motion. Overview We're looking for a deeply experienced Senior Software Engineer to join our dynamic Product Team. This is a foundational role where you will be at the forefront of integrating AI technologies...SeniorRemote jobInternship- ...developing applications powered by our cutting-edge AI research. As a Data Infrastructure Engineer, you will lead the development of fundamental data... ...applications Storage and computation efficiency AI model evaluation and productionization infrastructure Collaborate...Remote job
- ...efficiency. Its team of skilled Data Engineers, Data Scientists, Machine Learning (... ...and Responsibilities: The Senior Machine Learning Consultant at OneSix... ...most pressing problems. ~ Conduct research and development on AI/ML models to solve complex technical challenges...SeniorRemote jobLive InLocal areaHome officeFlexible hours
$100k per year
...to unify innovations in software models, compilers, platforms, networking... ...looking for contributors of all seniorities. As our TT-Distributed Software Engineer, you will develop and optimize distributed... .... Collaborate closely with AI researchers and hardware engineers to...Permanent employment- ...the top. This person would help add more structure to our current engineering team. Why join us? In addition to a family type atmosphere... ...an existing suspension bridge ~ Develop a 3D finite element model for an existing long-span suspension bridge based on the design...Long term contractFull timeFor contractors
- ...Software Engineer – AI-Powered Debt Management Platform (Remote) Join a team building the infrastructure... ...testing Develop and evolve data models in a document database environment (... ...cycles Logistics Level: Mid-level to Senior Location: Fully remote (U.S. time zones...SeniorFull timeLocal areaRemote work
$158k - $269k per year
...positive way. To learn more visit: You will… Collaborating with research scientists during algorithm design to ensure code is efficiently... ...from inception Identify and communicate best practices for model development Reformulate performance bottlenecks Develop a framework...Remote jobFull timePnpWork at officeWork from homeFlexible hours$170k - $230k per year
...Overview We are looking for an experienced Engineering Manager to lead our AI Core & Product... ...features. You'll partner closely with senior technical leaders and help drive execution... ...design and implementation of robust data models to support RAG models' training and inference...SeniorRemote jobInternship$160k - $190k per year
...within their inbound pipeline motion. As a Senior Product Manager, you’ll lead efforts to... ...customers and analyzing metrics, you’ll evaluate new opportunities and define strategic... ...collaborate closely with product designers and engineers to plan and execute projects. And you’ll...SeniorRemote jobLong term contract$150k - $300k per year
...in project-based units of work. We’re looking for a Software Engineer with senior level experience to take ownership over the technical direction... ...customers, and commuting ~ Paid parental leave ~ Hybrid work model with catered lunches everyday (M-F), snacks, and beverages in...SeniorRemote job- ...As a Senior Software Engineer, you'll help shape the technical direction of our platform and bring to life features that are as scalable as they are intuitive. We’re tackling complex backend challenges, building APIs that serve both performance and elegance, and crafting systems...SeniorLong term contractFull timeRemote work
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior Research Engineer, Model Evaluation. Be the first to apply!
- deep learning research engineer United States
- research engineer United States
- model agency United States
- senior engineering manager United States
- senior mechanical designer United States
- public senior accountant United States
- senior web developer United States
- senior accountant remote United States
- senior engineer United States
- senior sales manager United States
