Senior Research Engineer, Model Evaluation
Cohere
Who are we?
Our mission is to scale intelligence to serve humanity. We’re training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semantic search, RAG, and agents. We believe that our work is instrumental to the widespread adoption of AI.
We obsess over what we build. Each one of us is responsible for contributing to increasing the capabilities of our models and the value they drive for our customers. We like to work hard and move fast to do what’s best for our customers.
Cohere is a team of researchers, engineers, designers, and more, who are passionate about their craft. Each person is one of the best in the world at what they do. We believe that a diverse range of perspectives is a requirement for building great products.
Join us on our mission and shape the future!
Why this role?
Evaluation is critical to making progress in scaling intelligence. As models continue to become superhuman in many real-world use cases, we must continue to develop new techniques to accurately measure our models' performance on frontier capabilities. In this role, you are responsible for creating next-generation evaluation methods and scalable infrastructure to measure LLM progress.
As a Senior Research Engineer, Model Evaluation, you will:
Develop evaluation benchmarks, datasets, and environments for measuring the bleeding edge of model capabilities
Conduct research to push the state-of-the-art in LLM evaluation methods, including training LLM judges; improving evaluation efficiency; and scalably building high-quality datasets
Build scalable tools for investigating and understanding evaluation results that are used by all members of technical staff at Cohere, as well as leadership and our CEO
Learn from and work with the best researchers and engineers in the field
You may be a good fit if:
You enjoy pushing the limits of what LLMs are capable of, and you have built high-quality evaluation resources to measure those capabilities (datasets, simulators, environments, etc.)
You have a track record of developing new methods and/or data to evaluate LLMs, e.g. publications at top-tier conferences, popular benchmarks, etc.
You have deep experience building with and around LLMs, and you have built tools for analyzing and understanding their performance
You have strong software engineering skills
If some of the above doesn’t line up perfectly with your experience, we still encourage you to apply! If you want to work really hard on a glorious mission with teammates that want the same thing, Cohere is the place for you.
We value and celebrate diversity and strive to create an inclusive work environment for all. We welcome applicants from all backgrounds and are committed to providing equal opportunities. Should you require any accommodations during the recruitment process, please submit an Accommodations Request Form , and we will work together to meet your needs.
Full-Time Employees at Cohere enjoy these Perks:
An open and inclusive culture and work environment
Work closely with a team on the cutting edge of AI research
Weekly lunch stipend, in-office lunches & snacks
Full health and dental benefits, including a separate budget to take care of your mental health
100% Parental Leave top-up for 6 months for employees based in Canada, the US, and the UK
Personal enrichment benefits towards arts and culture, fitness and well-being, quality time, and workspace improvement
Remote-flexible, offices in Toronto, New York, San Francisco and London and co-working stipend
✈️ 6 weeks of vacation
Note: This post is co-authored by both Cohere humans and Cohere technology.
$84k - $126k per year
...Job Type: Permanent Work Model: Hybrid Reference code: 133164 Primary Location: Toronto, ON All Available Locations... ...various complex financial analyses including independent derivative evaluation, customer behavior modeling, and new innovations such as Machine...SeniorPermanent employmentFlexible hours$56k - $84k per year
...Job Type: Permanent Work Model: Hybrid Reference code: 133442 Primary Location: Toronto... ...look like? As an Analyst, Consultant, or Senior Consultant focusing on the Insurance practice in our Financial Engineering & Modeling team, you will: Conduct...SeniorPermanent employmentFlexible hours$155k - $269k per year
..., scalable, controllable, and efficient simulation. As a Research Scientist in World Models, you will develop algorithms and productionize the next generation... ...data of driving scenes. Collaborate with simulation engineers to integrate models into large-scale, distributed...SuggestedRemote jobFull timeWork at officeWork from homeFlexible hours$100k - $145k per year
...for technology at Thomson Reuters Labs. We are seeking a Senior Research Engineer who will bring expertise in AI and ML and is interested in... ...Typescript, etc.) #LI-SM2 What’s in it For You? Hybrid Work Model: We’ve adopted a flexible hybrid working environment (2-3...SeniorFull timeWork at officeLocal areaRemote workFlexible hours2 days per week3 days per week$101k - $169k per year
...Job Type: Permanent Work Model: Hybrid Reference code: 133422 Primary Location... ...our exponentially expanding Financial Engineering and Modeling group? Are you up for the challenge... ...,000 (Manager) and $126,000 - $234,000 (Senior Manager), and individuals may be eligible...SeniorPermanent employmentFlexible hours$119k per year
...and enduring. The Opportunity The Senior Specialist, Data Modeler (Microsoft) is responsible for designing... ...governance and architecture Data Engineering, ETL & Platforms Identify data... ..., privacy, and security standards Evaluate vendor deliverables and support production...SeniorPermanent employmentFull timeWork at office$20 per hour
...technical talent with leading AI research labs. Headquartered in San... ...Generate high-quality human evaluation data by identifying response... ...completeness of responses. Ensure model responses align with expected... ..., analytics, linguistics, engineering) Preferred Prior...Remote jobContract workPart timeSummer work$101k - $169k per year
...Job Type: Permanent Work Model: Hybrid Reference code: 133157 Primary Location: Toronto, ON All Available Locations: Toronto, ON Our Purpose At Deloitte, our Purpose is to make an impact that matters. We exist to inspire and help our people,...Permanent employmentManual laborFlexible hours$118k - $162k per year
...s talk. Position Description: As a Senior Research Operations Program Manager at Okta, you... ...collaboration across research, design, product and engineering teams, you will programmatically manage... ...or managing beta and/or early release evaluation programs ~ Experience managing...SeniorLocal areaWorldwide$103.2k - $192k per year
...obtain insights. Designs and constructs new processes for modeling data. Develops predictive models and leverages big data technology to design solutions... ...enterprise-wide level and serves as a specialist resource to senior leaders and stakeholders. Applies expertise and...SeniorContract workTemporary workPart timeShift work$96.6k - $180.6k per year
...Family Group: Customer Solutions Senior Quantitative Researcher - Alpha Research Team Location :... ...research activities through alpha modeling, risk modeling, and optimization techniques... ...integrity; Collaborate with data engineering teams to improve data pipelines and infrastructure...SeniorFull timeContract workPart time- ...talent in the space. We are looking for a Senior Data Scientist with a good blend of data... ..., practical experience in Operation research strategies and Pricing Analytics within supply... ...Optimization. Develop and implement predictive models and optimization algorithms to improve...SeniorFull timeLocal areaRemote work
$140k - $175k per year
...Thomson Reuters Labs. We are seeking a Lead Research Engineer who will bring expertise in AI and ML... ...with research scientists to evaluate, prototype and productionize research concepts... ...technology Familiarity with probabilistic models and have an understanding of the mathematical...Full timeWork at officeLocal areaRemote workFlexible hours2 days per week3 days per week$100 per hour
...technical talent with leading AI research labs. Headquartered in San... ...Role Responsibilities Evaluate AI-generated responses to enhance reasoning and rigor in model outputs . Provide structured... ...application of structured evaluation guidelines. Ability to work...SeniorRemote jobContract workSummer work$50 per hour
...FI client is looking for a UX Researcher to join their team. In this... ...designers, data analysts, and engineering partners to ensure every decision... ..., concept testing, usability evaluation, and post-launch measurement.... .... Gillian Singerman - Senior Solutions Delivery Recruiter...SeniorOngoing contract$155k - $213k per year
...datasets essential for training and evaluating our online mapping system.... ...vehicles. Champion engineering excellence, ensuring high-quality... ...Contribute to the broader research community by publishing findings... ...machine learning features/models into production. Previous...Remote jobFull timeWork at officeWork from homeFlexible hoursShift work$65k - $85k per year
...Canada is part of the SYSTRA group, an international consulting and engineering group, a world leader in the design of transport... ...Manager / Structural Lead Engineer Job Purpose The BIM Modeller – Structural Engineering is responsible for developing, managing...Internship$92k - $121k per year
...replace cars. Could you be the Senior Requirements Engineer we’re looking for? Join us in a full... ...with suppliers and customer. Evaluate risks & validation (including internal... ...necessary verification means, including modelling and simulation means Participating...SeniorFull timeContract workWork at officeWorldwide- ...Sago is a full-service market research company providing comprehensive research solutions to a diverse range of clients. We specialize in delivering actionable insights through innovative methodologies and a commitment to excellence. Our quantitative research division is a...SeniorWork at office
$80 - $150 per hour
...creative and technical talent with leading AI research labs. Headquartered in San Francisco,... ...Position: MS Excel / Google Sheets Evaluator Type: Contract Compensation... ...structured feedback to enhance document evaluation processes. Utilize MS Excel and...Remote jobContract workSummer work$122k - $192k per year
...is seeking an experienced DSM Evaluation Manager - EM&V to join our... ...impact evaluations and providing engineering and analysis support.... ...collection, energy analyses, project research, etc. Oversee primary and... ...recommendations. Support senior staff with development of presentations...SeniorFull timeWork at officeLocal area$105 per hour
...technical talent with leading AI research labs. Headquartered in San... ...across sectors to support AI model training . Build and interpret... ...AI-generated outputs . Evaluate AI-generated outputs for... ...quality. Collaborate with senior experts to refine model...SeniorRemote jobContract workSummer work- ...contribution. Job Description The Senior Associate Director, IT, Data Modelling and Reporting will report to IT... ..., logical, and physical data models for investment accounting, focusing... ...Support: Work closely with Data Engineering team to Map data from varied systems...SeniorFull timeInternshipWork at office
$100k - $130k per year
...manufacturers and provide their process engineers and scientists with a next... ...on an expert in Process Modelling who will be able to make... ...contributions to our process simulation engine. The Process Modeller will be... ...modeling solutions Research, develop and validate...Full timeWork from homeFlexible hours- ...We’re training and deploying frontier models for developers and enterprises who are... ...our customers. Cohere is a team of researchers, engineers, designers, and more, who are passionate... ...of Technical Staff in Data Analysis and Evaluation, you will play a pivotal role in ensuring...Full timeWork at officeRemote workFlexible hours
$147k - $245k per year
...apply now. We are currently seeking a Senior AI Engineer - Remote to join our team in Toronto,... ...a strong background in implementing AI models, analyzing and improving existing AI architectures... ...improving existing AI architectures Researching and implementing new AI technologies...SeniorWork at officeRemote workFlexible hours$118k - $157.5k per year
...Learning, Data Science, Software Engineering, Operations, and Big Data... ...We are looking for Applied Researchers to join us and help improve,... ...opportunity to work on cutting-edge research in NLP and deep learning,... ...development of machine learning models, data analysis, preferably...Immediate start- ...alongside iconic brand names. About the Role As a luxury brand evaluator, you will step into the world of luxury to discreetly assess... ...in the luxury market. Compensation • Non-Purchase Evaluations: Earn a fee based on mission complexity. • Purchase-Based Evaluations...Contract workWorldwideFlexible hours
- ...technical talent with leading AI research labs. Headquartered in San... ...Role Responsibilities Evaluate hedge fund strategies related... ...long/short equity to enhance AI model training . Develop and... ...summaries, and memos for AI model evaluation . Predict and analyze...Remote jobHourly payWeekly payContract workFor contractorsSummer work
- ...At RBC Borealis, you’ll be joining a team of leading researchers and software engineering specializing in machine learning. You will have access to... ...We’re looking for an enthusiastic Machine Learning Research Engineer who’s excited by the opportunity of being at the...Full timeLocal areaFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior Research Engineer, Model Evaluation. Be the first to apply!
- deep learning research engineer Toronto, ON
- research engineer Toronto, ON
- mechanical research engineer Toronto, ON
- ingénieur de recherche Toronto, ON
- hair model Toronto, ON
- director quantitative analyst model validation Toronto, ON
- energy modelling Toronto, ON
- hat model Toronto, ON
- clothes model Toronto, ON
- fashion model Toronto, ON
