Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Principal Platform Engineer, ML

Full-time, Temporary

mistplay

Mistplay est l'application de fidélité n°1 pour les joueurs mobiles. Notre communauté de millions de joueurs mobiles engagés utilise Mistplay pour découvrir de nouveaux jeux et gagner des récompenses. Les joueurs sont récompensés pour le temps et l'argent qu'ils consacrent aux jeux et peuvent échanger ces récompenses contre des cartes cadeaux. Mistplay a pour mission d'être le meilleur moyen de jouer à des jeux mobiles pour tous, partout dans le monde ! Téléchargez Mistplay sur le Google Play Store  ici et suivez-nous sur  Instagram ,  Twitter et  Facebook .

Veuillez noter : Au Canada , Mistplay suit un modèle hybride de 2 jours/semaine en bureau à Toronto (400 University Ave) & Montréal (1001 Blvd. Robert-Bourassa)

Mistplay is the #1 loyalty app for mobile gamers. Our community of millions of engaged mobile gamers come to Mistplay to discover new games to play and earn rewards. Gamers are rewarded for their time and money spent within the games and can redeem those rewards for gift cards. Mistplay is on a mission to be the best way to play mobile games for everyone everywhere! Download Mistplay on the Google Play Store  here and follow us on  Instagram ,  Twitter and  Facebook .

Please Note: In Canada , Mistplay follows a 2 days/week in-office hybrid model in Toronto (400 University Ave) & Montreal (1001 Blvd. Robert-Bourassa)

English Description is Below ⬇️

 

Rattaché au vice-président de la plateforme de données et d'apprentissage automatique (Data and Machine Learning Platform), l'ingénieur Staff en plateforme ML au sein de l'équipe de données de Mistplay jouera un rôle clé dans la recherche et le développement de solutions d'apprentissage automatique pour résoudre des problèmes commerciaux complexes. L'ingénieur Staff en plateforme ML travaillera en étroite collaboration avec une équipe interfonctionnelle pour identifier les domaines à améliorer, concevoir et mettre en œuvre des solutions évolutives. L'expérience pertinente peut aller de l'infrastructure de travail et des logiciels pour prendre en charge les applications d'apprentissage automatique sur une grande variété de systèmes de recommandation en ligne, de systèmes d'apprentissage par renforcement ou d'autres applications d'apprentissage automatique en ligne.

 

 

Ce que vous ferez

Être le principal moteur et expert pour la conception, la construction et l'exploitation de :

• Solutions d'infrastructure machine et de données pour l'entraînement des modèles.

• Systèmes d'inférence en temps réel pour exploiter et servir des modèles dans un environnement de production en temps réel.

• Capacités de plateforme de fonctionnalités de haute convivialité et précision pour générer, remplir rétrospectivement et stocker des fonctionnalités au niveau de l'utilisateur.

• Couche de service de fonctionnalités à haute précision et faible latence, et solutions de pré-traitement pour prendre en charge le service en ligne des modèles.

• Construire des abstractions de plateforme et des chemins dorés (golden paths) : modèles Airflow DAG, CLI/SDK, dépôts cookie-cutter et pipelines CI/CD qui font passer les modèles des notebooks à la production de manière prévisible.

• Mettre en œuvre l'observabilité de bout en bout : vérifications de la fraîcheur des données/fonctionnalités, portes de dérive/qualité, SLO de performance/latence des modèles, tableaux de bord de santé de l'infrastructure, traçage et alertes, plus réponse aux incidents et analyses post-mortem.

• Collaborer avec la sécurité, SRE et l'ingénierie des données sur les réseaux privés, la politique en tant que code, la gestion des informations personnelles identifiables (PII), la gestion des accès et des identités (IAM) du moindre privilège et les architectures rentables dans tous les environnements.

• Évaluer, intégrer et rationaliser les outils de plateforme (par exemple, registre MLflow, magasins de fonctionnalités, passerelles de service); mener des migrations avec une gestion claire des changements et un temps d'arrêt minimal.

 

Ce que vous apporterez

• 10 ans et plus d'expérience dans la construction et l'exploitation de plateformes ML/de données de qualité production, en mettant l'accent sur le service, la fiabilité et l'expérience développeur.

• Solides compétences en génie logiciel en Python, Go ou Java; expérience dans la création de services résilients, d'API et d'outils d'automatisation avec une couverture de tests élevée.

• Expérience approfondie avec les solutions d'inférence : configuration de point de terminaison, conteneurisation, packaging de modèles, mise à l'échelle automatique (autoscaling), compromis entre sans serveur (serverless) et temps réel, MME, déploiements A/B et canary.

• Expertise des paradigmes de magasin de fonctionnalités en ligne (online feature store) et des solutions de stockage sous-jacentes dans les contextes de service ML.

• Expérience avérée avec Terraform pour la gestion de l'infrastructure ML et de données de bout en bout : modules, espaces de travail, détection de dérive, révisions de changements et restaurations sécurisées (safe rollbacks); familiarité avec les modèles GitOps.

• Orchestration Airflow à grande échelle : modélisation de dépendances, capteurs, nouvelles tentatives, ANS (SLAs), remplissages rétrospectifs (backfills), usines de DAG et intégrations avec les registres, les magasins d'artefacts et les pipelines Terraform.

• Familiarité avec les frameworks ML (scikit-learn, XGBoost, PyTorch, TensorFlow) du point de vue de l'intégration de la plateforme pour prendre en charge divers environnements d'exécution (runtimes) et conteneurs.

• Observabilité pour les flux de travail ML : métriques/journaux/traces, profilage des performances, planification de la capacité, surveillance des coûts et procédures d'exécution (runbooks).

• Excellente communication et collaboration interfonctionnelle avec la Science des Données, l'Ingénierie des Données, le DevOps et le Backend.

 

 

English Description:

 

Reporting to the VP of Data and Machine Learning Platform, the Staff ML Platform Engineer within Mistplay’s Data Team will play a key role in researching and developing machine learning solutions to solve complex business problems. The Staff ML Platform Engineer will work closely with a cross-functional team to identify areas for improvement and design and implement scalable solutions. Relevant experience can range from working infrastructure and software to support machine learning applications on a wide variety of online recommendation systems, reinforcement learning systems or other online machine learning applications. 

 

What you’ll do:

Be the main driver and expert for designing, building, and operating:

• Machine and data infrastructure solutions for training models 

• Real-time inference systems to operate and serve models in a real time production environment. 

• High usability and accuracy feature platform capabilities for generating, backfilling and storing user level features.

• High accuracy low latency feature serving layer and preprocessing solutions to support online serving of the models

• Build platform abstractions and golden paths: Airflow DAG templates, CLI/SDKs, cookie-cutter repos, and CI/CD pipelines that take models from notebooks to production predictably.

• Implement end-to-end observability: data/feature freshness checks, drift/quality gates, model performance/latency SLOs, infra health dashboards, tracing, and alerting—plus incident response and postmortems.

• Partner with Security, SRE, and Data Engineering on private networking, policy-as-code, PII handling, least-privilege IAM, and cost-efficient architectures across environments.

• Evaluate, integrate, and rationalize platform tooling (e.g., MLflow registry, feature stores, serving gateways); lead migrations with clear change management and minimal downtime.

 

What you’ll bring: 

• 10+ years building and operating production-grade ML/data platforms with a focus on serving, reliability, and developer experience.

• Strong software engineering in Python, Go, or Java; experience building resilient services, APIs, and automation tooling with high test coverage.

• Deep experience with inference solutions: endpoint configuration, containerization, model packaging, autoscaling, serverless vs. real-time trade-offs, MME, A/B and canary releases.

• Expertise with online feature store paradigms and underlying storage solutions in ML serving contexts.

• Proven Terraform experience managing ML and data infra end-to-end: modules, workspaces, drift detection, change reviews, and safe rollbacks; familiarity with GitOps patterns.

• Airflow orchestration at scale: dependency modeling, sensors, retries, SLAs, backfills, DAG factories, and integrations with registries, artifact stores, and Terraform pipelines.

• Familiarity with ML frameworks (scikit-learn, XGBoost, PyTorch, TensorFlow) from a platform-integration perspective to support diverse runtimes and containers.

• Observability for ML Workflows: metrics/logs/traces, performance profiling, capacity planning, cost monitoring, and runbooks.

Excellent communication and cross-functional collaboration with Data Science, Data Engineering, DevOps and Backend.

 

Vacancy posted 7 hours ago
Similar jobs that could be interesting for youBased on the Principal Platform Engineer, ML in Toronto, ON vacancy
  •  ...RAVL is a boutique technology advisory and engineering firm focused on the financial services industry. Everything we do is centered on helping...  .... We’re growing our engineering team and hiring AI Platform Engineers to build next-generation AI systems across clients. This... 
    Suggested
    Permanent employment
    Full time

    ravl_io

    Toronto, ON
    7 days ago
  •  ...RAVL is a boutique technology advisory and engineering firm focused on the financial services industry. Everything we do is centered on...  ...investments. We’re growing our engineering team and hiring ML Platform Engineers to design and build scalable machine learning platforms... 
    Suggested
    Permanent employment
    Full time
    Immediate start

    ravl_io

    Toronto, ON
    7 days ago
  •  ...Principal Software Engineer – CoCounsel     CoCounsel is our most advanced AI offering to date – combining generative...  ...organization, partnering closely with AI/ML engineers, researchers, product, and our identity and platform teams to deliver world-class legal content... 
    Suggested
    Long term contract
    Remplacement
    Full time
    Work at office
    Local area
    Flexible hours
    2 days per week
    3 days per week

    Thomson Reuters

    Toronto, ON
    13 hours ago
  • $110k - $150k per year

     ...give you the space to grow.   About the Role We are seeking ML/AI Engineers to contribute to major projects.  The ML / AI Engineer...  ...operating ML models in production ~ Experience with cloud ML platforms, especially AWS SageMaker ~ Strong understanding of data pipelines... 
    Suggested
    Permanent employment
    Full time
    Remote work
    Flexible hours

    Levio

    Toronto, ON
    13 hours ago
  • $120k - $155k per year

     ...Fitch Group is currently seeking a Lead/Principal Data Engineer based out of our Toronto office. As...  ...pipelines and solutions on modern cloud-based platforms, including Snowflake, Databricks, and...  ...data infrastructure to support AI/ML workloads, including feature stores, vector... 
    Suggested
    Long term contract
    Temporary work
    Work at office
    Immediate start
    Worldwide
    Shift work
    2 days per week

    Fitch Group

    Toronto, ON
    3 days ago
  • $115k - $125k per year

     ...Job Summary Kinross Gold Corporation is seeking a Power Platform Engineer to serve as the technical backbone of our Power Platform development...  ...-end enterprise systems using Managed Identities, service principals, and certificate-based authentication. Architecture... 
    Long term contract
    Temporary work
    Casual work
    Pnp
    Work at office
    Immediate start
    Remote work
    3 days per week

    Kinross Gold Corporation

    Toronto, ON
    1 day ago
  •  ...And we’d love for you to join us!   About the job - Principal Software Engineer ContactMonkey's platform already runs AI in production - AI-powered template...  ...where we fine-tune smaller ones, and where classical ML beats an LLM. Retrieval and grounding infrastructure... 
    Work at office
    Remote work
    Worldwide
    1 day per week

    ContactMonkey

    Toronto, ON
    13 hours ago
  • $188.2k - $268.9k per year

     ...We are seeking a Director of Machine Learning Engineering and Infrastructure to lead a hybrid team bridging advanced ML engineering with world-class infrastructure design...  ..., including low-latency services, streaming platforms, and large-scale serving. ~ Hands-on experience... 
    Long term contract
    Remplacement
    Full time
    Temporary work
    Work at office
    Local area
    Flexible hours

    Tubi - Canada

    Toronto, ON
    13 hours ago
  •  ...you work fast, flexibly, and collaboratively — without compromising standards — we want to hear from you. We’re looking for an AI/ML Engineer who will develop, optimize, and scale machine learning models that power our next generation of user experiences. Working... 
    Full time
    Worldwide

    USMobile

    Toronto, ON
    7 days ago
  •  ...on a distributed, multi-chiplet hardware platform featuring heterogeneous compute elements such...  ...as in-memory tensor processors, vector engines, and hierarchical memory. Your compiler...  ...cross-functionally with systems architects, ML framework teams, runtime developers,... 
    3 days per week

    d-Matrix

    Toronto, ON
    more than 2 months ago
  • $135k - $150k per year

     ...Kinross, we are modernizing and transforming our global technology platforms to enable secure, scalable, and resilient operations across...  ...We are seeking a highly technical and hands-on Senior Platform Engineer to drive the evolution of our hybrid infrastructure ecosystem spanning... 
    Long term contract
    Temporary work
    Casual work
    Local area
    Immediate start
    Remote work

    Kinross Gold Corporation

    Toronto, ON
    1 day ago
  • $135k - $145k per year

     ...The Opportunity We’re looking to fill an opening for a Principal Network Engineer to join our Technology Ops & Support Partners team. Reporting...  ...with cloud, security, and application teams on scalable platform architecture that aligns with enterprise platform and... 
    Long term contract
    Full time
    Internship

    Aviso Wealth

    Toronto, ON
    1 day ago
  • $180k - $220k per year

     ...Xello is looking for a Principal Engineer This role is a remote role, looking for candidates within Canada only, working in Eastern Time...  ...transformative technical journey. Your expertise won’t just elevate our platform, it will profoundly impact the futures of students. You will... 
    Long term contract
    Full time
    Remote work
    Flexible hours

    Xello

    Toronto, ON
    4 days ago
  • $171k - $225k per year

     ...Who we are: MasterClass is the streaming platform where the world’s best come together so anyone, anywhere, can access and be inspired...  ...is the direction of the company. We are looking for a Staff ML Engineer to join our AI engineering team and help define and deliver... 
    Local area
    Remote work
    Flexible hours

    MasterClass

    Toronto, ON
    16 days ago
  •  ...them, and to create a place where our people are proud to Build. Better. As a Principal Data Engineering leader at RAVL, you will design, define, and elevate enterprise-grade data and AI platforms that are secure, scalable, and high-performance. Operating as a Community of... 
    Permanent employment
    Full time

    ravl_io

    Toronto, ON
    7 days ago
  •  ...Principal Software Engineer The global capital markets are among the largest markets in the world valued...  ...collaborate with a talented team of AI/ML PhDs, legal SMEs, and market...  ...influence the technical direction of our platform and decide our velocity of making impacts... 
    Long term contract
    Full time
    Work at office
    Local area
    Flexible hours
    2 days per week
    3 days per week

    Thomson Reuters

    Toronto, ON
    13 hours ago
  •  ...any endpoint, anywhere in the world. We engineer the end-to-end device experience—from our...  ...at the intersection of deep client-side platform engineering and massive-scale distributed...  ...the Okta Engineering Blog .   The Principal Software Engineer Opportunity We seek... 
    Long term contract
    Local area
    Remote work
    Worldwide

    Okta

    Toronto, ON
    14 days ago
  •  ...Identity Governance (OIG) organization is looking for a Principal Engineer to join our team — OIG is Okta’s Identity...  ...access management, or governance (IAM/IGA) products or platforms Familiarity with workflow engines or approval/routing systems (e.g., finite state machines... 
    Local area
    Remote work
    Worldwide
    Flexible hours

    Okta

    Toronto, ON
    14 days ago
  •  ...Azure AI/ML Cloud Engineer – Azure AI, Azure ML, Python, Cosmos DB Location: Toronto, ON (Hybrid – 2 days onsite) Experience: 6...  ...Key Vault, RBAC). Support production issues and continuous platform improvements. Work with Cosmos DB , REST APIs, and microservices... 

    Astra North Infoteck Inc.

    Toronto, ON
    4 days ago
  • $140k - $160k per year

     ...Overview: We are building a modern internal platform that enables application, data, and AI...  ...are looking for an experienced Platform Engineering Lead to drive the development of our...  ...supporting microservices, data workloads, and AI/ML systems.   Implement GitOps-based... 
    Full time
    Work at office
    Worldwide

    Guidepoint

    Toronto, ON
    1 day ago
  •  ...here to leave our clients better than we found them, and to create a place where our people are proud to Build. Better. As a Platform Engineer at RAVL, you’ll design and deliver the platforms, infrastructure, and automation that power modern engineering teams. You’ll bring... 
    Permanent employment
    Full time

    ravl_io

    Toronto, ON
    7 days ago
  •  ...RAVL is a boutique technology advisory and engineering firm focused on the financial services industry. Everything we do is centered on...  ...ROI from their technology investments. We’re hiring Senior ML Platform Engineers (Contract) to contribute to building and scaling machine... 
    Contract work

    ravl_io

    Toronto, ON
    7 days ago
  • $180k - $275k per year

     ...impact on its customers, employees, and communities. The Role As Principal Software Engineer for a new product within Veeva, you will be a founding member of a team building our next major AI-driven platform — one that will transform how Life Sciences companies manage... 
    Internship
    Work at office
    Local area
    Remote work
    Work from home
    Flexible hours

    Veeva Systems

    Toronto, ON
    1 day ago
  • $145k - $170k per year

     ...Kinross Gold is seeking a Principal Security Architect to lead the...  ...posture. Establish detection engineering standards and validate AI- and...  .../XDR, and threat intelligence platforms; able to architect, integrate,...  ...Experience securing or governing AI/ML systems and AI-augmented... 
    Long term contract
    Temporary work
    Casual work
    Immediate start

    Kinross Gold Corporation

    Toronto, ON
    2 hours ago
  • $130k - $160k per year

     ...together the best of Europe and North America to shape the future of automotive retail.   --- About the role Join our Platform Engineering team at AutoScout24 Group / Trader Corporation. We build the internal platform that 600+ developers rely on to ship software quickly... 

    AutoScout24

    Toronto, ON
    13 hours ago
  • ________This is an example!________ PLEASE READ: these jobs are testing jobs of Lever's testing environment - please do not apply for this job. Lever was founded ten years ago to tackle the most strategic challenge that companies face: how to recruit and hire top talent...

    leverdemo-8

    Toronto, ON
    7 days ago
  • $156.8k - $196k per year

     ...As Marqeta's Manager, Software Engineering within our Risk, Fraud and Disputes pod, you will...  ...development, and execution of our Identity Platform . You will be responsible for building...  ...Hands-on AI-first mindset: you embrace AI/ML to improve team velocity, drive automation... 
    Internship
    Work at office
    Remote work
    Flexible hours

    Marqeta

    Toronto, ON
    5 days ago
  •  ...product, and continuous personal development. We love what we do, and we support the team around us.   About the Team   The Platform Engineering team is a highly collaborative group of software engineers committed to building, scaling, and maintaining the foundations on... 
    Full time

    air-tek

    Toronto, ON
    7 days ago
  • $137.2k - $196k per year

     ...About the Role: Tubi's content platform is the engine behind one of the largest free streaming services in the world. Every play, every deal, every...  ...services as first-class clients. Integrate LLMs and ML pipelines into content workflows (metadata enrichment, image/video... 
    Long term contract
    Remplacement
    Full time
    Temporary work
    Work at office
    Local area
    Flexible hours
    2 days per week

    Tubi - Canada

    Toronto, ON
    4 days ago
  • $216k - $297k per year

     ...talk. This is a hybrid role. It requires going to the local office 3 times a week. The Auth0Lab Team  We are a small team of engineers exploring new Auth0 products and features ideas. We take things from 0 to 1 and we look to shape the future of identity. Our team... 
    Work at office
    Local area
    Worldwide

    Okta

    Toronto, ON
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Principal Platform Engineer, ML. Be the first to apply!