Member of Technical Staff, Model Efficiency
Cohere
Who are we?
Cohere is the leading security-first enterprise AI company. We build cutting-edge foundation AI models and end-to-end products that are designed to solve real-world business problems.
We’re training and deploying frontier models for enterprises who are building AI systems. We believe that our work is instrumental to the widespread adoption of AI and we are looking for folks that want to be part of that.
We obsess over what we build. Each one of us is responsible for contributing to increasing the capabilities of our models and the value they drive for our customers. Cohere is a team of researchers, engineers, designers, and more, who are all passionate about their craft.
We are a global technology company co-headquartered in Toronto and San Francisco, with key offices in London, New York City, Montreal, Seoul, Germany and Paris. Join us!
Why this role?
Our team is a fast-growing group of researchers and engineers focused on building reliable ML systems and pushing the boundaries of LLM inference efficiency. We develop techniques that improve how models execute in production, driving lower latency, higher throughput, and consistent quality across diverse workloads.
As an engineer on this team, you’ll work across the inference stack to improve core performance metrics by diving deep into model execution, identifying bottlenecks, and developing innovative optimizations. You’ll collaborate closely with modeling and systems teams to experiment, measure, and ship improvements that meaningfully accelerate inference. As the team evolves, you’ll have opportunities to build expertise in advanced performance techniques, including GPU/CUDA optimizations, kernel-level improvements, and model execution strategies for MoE and large-scale architectures.
Please Note: We have offices in Toronto, Montreal, San Francisco, New York, Paris, Seoul and London. We embrace a remote-friendly environment, and as part of this approach, we strategically distribute teams based on interests, expertise, and time zones to promote collaboration and flexibility. You'll find the Model Efficiency team concentrated in the EST and PST time zones, these are our preferred locations.
You may be a good fit for the Model Efficiency team if you have:
- 5+ years of experience writing high-performance, production-quality code
- Strong programming skills in C++ or Python (Rust/Go also welcome)
- Experience working with large language models and familiarity with the LLM inference ecosystem (e.g., vLLM, SGLang, etc.)
- Ability to diagnose and resolve performance bottlenecks across the model execution stack
- A strong bias for action — you ship fast, measure impact, and iterate
It’s a big plus if you have experience with:
- GPU programming, CUDA, or low-level systems optimization
- Language modeling with transformers (MoE, speculative decoding, KV-cache optimizations)
- Scaling performance-critical distributed systems (e.g., computation, search, storage)
Full-Time Employees At Cohere Enjoy These Perks
- A weekly lunch stipend of $75/£75 or equivalent in your local currency for lunch.
- Full health and dental benefits, including a separate budget for mental health.
- RRSP matching, 401K, Pension Scheme.
- 100% Parental Leave top-up for up to 6 months, for either parent.
Arts & culture, fitness/wellness, quality time, and a workspace improvement credit.
Education & learning stipend for conferences, courses, and coaching.
- 6 weeks of paid vacation (30 working days!)
- Budget for traveling to other offices if you are remote, plus an annual company offsite.
How And Where We Work
- Cohere is remote-friendly. We have offices in Toronto, San Francisco, New York City, London, Paris, Montreal, and more coming soon.
- For those in the office: a daily lunch program, plenty of snacks, and regular community and social events.
- For those not near an office: a co-working benefit so you can work alongside others in your city.
- Everyone receives a $500 home office stipend to set up your workspace properly.
If any of the above doesn’t line up exactly with your experience, we still encourage you to apply.
We strive to create an inclusive work environment for all; we welcome applicants from all backgrounds and are committed to providing equal opportunities. Should you require any accommodations during the recruitment process, please submit an Accommodations Request Form, and we will work together to meet your needs.
We may use AI-enabled tools to screen and assess applicants against the criteria for this position. This helps our recruiters identify potentially qualified candidates, but it doesn't limit the applications our recruiters may review or consider.
- ...company. We build cutting-edge foundation AI models and end-to-end products that are... ...to help Cohere's mission of providing efficient and reliable language understanding and... ...hours for effective collaboration. As a Member Of Technical Staff, You Will Design and write...SuggestedFull timeWork at officeLocal areaRemote workHome office
- ...the Role We are seeking a Staff Data Developer to join our Data... ...data products. As a senior member of the team, you’ll... ...Translate business needs into technical requirements and ensure alignment... ...performance, scalability, cost-efficiency, and automation. Ensure data...SuggestedWork at officeFlexible hours
- ...their best work. Reporting to the Lead Technical Animator you will play a meaningful role... ...and implement tools to improve workflow efficiency for animators. Optimize animation... ..., and training materials to educate team members. What You'll Bring to the team: ~3+...Suggested
- ...Work. About You We are looking for a Staff AI Software Developer to join the core... ...environment, focusing on solving the hardest technical problems in enterprise AI application... ...develop and deploy new integrations using the Model Context Protocol. Own Workflow...SuggestedWorldwide
- Description de poste Poste : Serveur-se LIEU : Montréal, boulevard Gouin. Horaire : À discuter lors de l’entrevue, horaire variable. Salaire : 17,50$ DATE DE DÉBUT: Dès maintenant. Responsabilités liées au poste Mise en place des tables Prise ...SuggestedDay shift
- ...positions: a Senior Organic/Mechanical Modeling Artist (permanent), a Senior... ...help maintain the visual quality and technical robustness of the delivered models. They produce optimized assets that... ...perspectives are embraced. Every team member brings unique insight that...Permanent employmentFull timeContract workImmediate startFlexible hours3 days per week
- ...through play. Mattel is at its best when every member of our team feels respected, included,... ...well as in the creation of mock-ups and models used to promote different toy lines.... ...preventive maintenance and assisting with technical troubleshooting. Qualifications...Full timeLocal area
- ...Modéliser et texturer des modèles 3D conformément à des documents... ...Job Description: As a 3D modeler, you will work closely with the... ...documents, art direction and technical guidelines. Integrate 3D models... ...KRAFTON Montréal we want our team members to have ownership of their...Remote work
- ...quality, and cost of the sourced data. The most successful team members will have a working knowledge of disparate effective sources of data... ...supply diverse data they need for building new machine learning models, and with the Pipeline and Product teams to ensure they have the...Full timeApprenticeshipLocal area
- ...learning, computer vision, and 3D modeling applications. This software... ...and create realistic 3D models based on real-world data.... ...ability to translate complex technical concepts into clear, accessible... ...and building high-performance, efficient systems using C++, with a strong...Full timeApprenticeshipLocal areaRemote work
- ...mission is to bring on individuals, from creative producers to technical experts to entrepreneurial leaders, who can help us realize this... ...same. The Opportunity This is not a traditional chief of staff role. At Human Agency, the chief of staff is the prototype for every...Full timeFor contractorsRemote work
- ...tax research and contribute to technical analysis and documentation... ...Collaborate with senior team members to deliver high-quality work in... ...execution Qualifications Tax Staff - 1 or more years of US Tax... ...process to enhance efficiency and improve the candidate experience...Weekend work
- ...on individuals, from creative producers to technical experts to entrepreneurial leaders, who... ...We are seeking an exceptional Chief of Staff for our Managing Director of AI—someone who... ...architectures, evaluation routines, and governance models Design prompts, playbooks, and...Full timeFor contractorsFor subcontractorRemote work
$84k - $126k per year
...Type de poste : Permanent Modèle de travail : Hybride Numéro de demande: 132340 Lieu principal : Montreal, QC Emplacements additionnels : Montreal, QC Notre raison d’être Chez Deloitte, notre raison d’être est d’avoir une influence marquante. Nous...Permanent employmentApprenticeshipWork at officeFlexible hoursDay shift- ...Role We are seeking a Senior Staff Fullstack Developer to join... ...components, patterns, tooling, and technical standards. You will have... ...API design, performance, data modeling, and end-to-end product... ...and it is expected that team members are present in the office on those...Long term contractTemporary workWork at officeFlexible hours
- ...potentiality to turn into full-time in the future. Purchasing and Technical Support Arrangements Assist the Vessel Manager and Fleet... ...and benefits package. We believe in the potential of our team members and are dedicated to fostering careers, not just jobs. If the...Permanent employmentFull timePart timeWork at office
usd120k - usd140k per year
Sleep Country Canada is looking for a Technical Lead, Digital Platforms to play a critical... ...initiatives, drives standardization and efficiency through modern development practices, and... ...located outside of Quebec. Why members of our Corporate team love working at Sleep...Full timeLocal area- ...des risques pour SG. Relevant directement du Responsable local, la personne se concentrera sur les sujets liés à la Gestion du Risque Modèle (MRM). Conformément à la réglementation SR 11-7, SG doit revoir en continu la solidité conceptuelle, la précision et les méthodes de...Daily paidContract workLocal area
- ...use of artificial intelligence (AI) to screen, assess, or select applicants. All hiring decisions are made by human reviewers. Company: Construction PCL Inc. Primary Location: Montréal, Quebec Job Title: Engineering/Technical Student Requisition ID: 12565...Full timeInternship
- ...Job Description We are seeking a Staff MLOps Engineer with experience building and... ...scale. You will work on enabling seamless model training, deployment, and monitoring... ...agriculture, game development, or aerospace. Technical Proficiency: Core Tools: Fluency with...Full timeApprenticeshipLocal area
- ...that enable our product teams to deliver high-quality software efficiently and reliably. We are building the Catalyst team to serve as... ...intersection of development teams, platform teams, and senior technical leadership, focusing on driving technical excellence and cross-...Full timeWorldwideShift work
- ...Role Overview: The HRSD Technical Architect is responsible for designing, architecting... ...This role ensures scalable, secure, and efficient HR digital transformation by aligning... ...Agent implementation * CMDB * Data modeling Architecture Skills *...Contract work
- ...the flexibility of a hybrid work environment, while remote team members play an integral role in shaping our dynamic culture from afar. We... ...~ Understanding Data flow Concepts. ~ Understanding Business Model Concepts. As an equal opportunity employer, we celebrate...Full timeWork at officeLocal areaRemote work
- ...Facebook . Veuillez noter : Au Canada , Mistplay suit un modèle hybride de 2 jours/semaine en bureau à Toronto (400 University Ave... ...Note: In Canada , Mistplay follows a 2 days/week in-office hybrid model in Toronto (400 University Ave) & Montreal (1001 Blvd. Robert-...Full timeApprenticeshipWork at officeDay shift2 days per week
- ...functions Embedded software for steering, braking, propulsion, and vehicle control systems Production-ready code generated from models and integrated into embedded platforms Test environments supporting simulation (MIL), HIL, and vehicle validation activities Qualifications...Permanent employmentFull timeFlexible hours
- ...As a Tech Lead Software Development, you will be the technical anchor of your team. You’ll drive technical decisions, guide developers, ensure code and architectural quality, and partner closely with Product Managers, Designers, and Principal Devs to deliver meaningful outcomes...InternshipRemote workFlexible hours
- ...colleagues lead the way to greener and smarter mobility worldwide, connecting cities as we reduce carbon and replace cars. Provide technical support to the railway maintenance and operations team in order to maintain equipment performance and reliability, ensuring service...Remote workWorldwideNight shiftAfternoon shift
- ...proud of: The role We are seeking an experienced Technical Architect to lead the technical assessment and architectural definition... ...processes with the systems, integrations, APIs, and data models that support them. Experience in the following areas will help...Long term contractRemplacementFull timeTemporary workWork at officeImmediate startTrial periodFlexible hours
$146.4k - $236k per year
...of top 10 US banks, Camunda helps enterprises boost operational efficiency, accelerate time-to-value, and deliver better customer... ...outcomes into a realistic delivery plan and steering execution with technical credibility —not just status reporting. They drive scope decomposition...Full timeWork at officeLocal areaRemote workWork from homeWorldwideHome officeFlexible hours- ...ABOUT YOU We are looking for a Technical Lead who is strategic, hands-on, and deeply committed to engineering excellence to join our engineering team. The best candidate will be someone who thrives in a fast-paced, highly collaborative, and exceptionally dynamic setting...Full timeWorldwide
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Member of Technical Staff, Model Efficiency. Be the first to apply!
- decision support analyst Montréal, QC
- IT support technician Montréal, QC
- technical sales specialist Montréal, QC
- computer support technician Montréal, QC
- assistant technique en pharmacie Montréal, QC
- senior technical specialist Montréal, QC
- service desk analyst Montréal, QC
- network support technician Montréal, QC
- user support technician Montréal, QC
- technical system analyst Montréal, QC
