Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Software Engineer - LLM Inference

$128.8k - $193.2k per year
Full-time

Nutanix

Hungry, Humble, Honest, with Heart.

The Opportunity

When people talk about generative AI and other ML-powered solutions in today's conversation, they often refer to generative pre-trained transformers like ChatGPT that can respond to queries from a position of deep learning. A GPT-in-a-box solution removes the burden of building or implementing these AI solutions yourself. It also makes overcoming the complexity, inefficiency, and security challenges of generative AI and AI/ML applications easy. Nutanix simplifies your learning curve on AI-ready infrastructure with Nutanix Cloud Platform for AI (GPT-in-a-Box). This high-performant Machine Learning full-stack cloud platform helps you optimise IT costs with a software-defined cloud operating model. Harness AI-ready capabilities right out of the box, simplified to build, fine-tune, and run models, including GPTs and LLMs, while you continue to use existing teams and skills.

Join the Nutanix AI team, responsible for the magic behind the scenes.

About The Team

The Nutanix Enterprise AI team is responsible for strategic product areas including LLM Inference and the AI Gateway. We are at the forefront of Nutanix's mission to simplify AI deployment, recently showcasing our Agentic AI platform at NVIDIA GTC and NEXT 2026. This team is fast-paced, globally distributed, and focused on building the foundational layers of the AI stack.

You will report to a seasoned Technical Manager who will provide mentorship and guidance as you navigate through your responsibilities. The work setup at Nutanix AI is a hybrid model, offering a blend of in-office collaboration and remote work flexibility. As a new hire, you will be expected to be in the office for 3 days a week, ensuring that you have the opportunity to engage with your team and foster strong working relationships.

Your Role

  • Architect, design, and develop horizontally scalable, containerized, fault-tolerant services on Kubernetes.
  • Improve the performance of systems to deliver for low-latency and high-throughput use cases.
  • Optimize any part of the stack, including low-level systems.
  • Leverage and contribute to relevant open-source cloud native projects.
  • Develop scalable, efficient, and fault-tolerant observability architectures for collecting, analyzing, and reporting metrics for various platform services.
  • Collaborate closely with globally located product management and backend development teams to deliver high-quality products in a fast-paced environment.
  • Contribute to all stages of the product development cycle: technical design, development, test, experimentation, analysis, and launch.
  • Be a team player by reviewing code and design docs, giving feedback on product specs and mocks, and documentation.
  • Participate in an ongoing process definition and technology selection to ensure our technology stack is current with relevant trends.
  • Continuously learn and improve your technical and non-technical abilities.
  • What You Will Bring
  • 2-5 years of experience developing maintainable, modular, resilient, fail-safe, and long-lasting code from a Product Development company.
  • Have strong programming fundamentals, data structure, and algorithms.
  • Strong experience in Docker, Kubernetes, and Cloud native technologies
  • Experience building applications with Go and Python
  • Experience building and managing CI/CD pipelines
  • Strong understanding of datacenter design, including computing, storage, and networking.
  • Familiarity with on-prem, cloud, and hybrid software deployment architectures
  • Good experience in designing and tuning high-performance system software
  • Strong understanding of distributed computing and storage architectures
  • Strong knowledge of OS internals, virtualization, application performance monitoring, compute storage, and networking management
  • Familiarity with machine learning concepts and popular frameworks (like TensorFlow, PyTorch, etc) is a strong plus
  • Experience with hardware accelerators, such as GPUs, is a strong plus.
  • Experience working with large codebases or contributing to open source is a strong plus.
  • Experience in building multi-tenant services on a virtualized infrastructure is a solid plus.
  • Detail-oriented with a strong focus on quality, design, and user experience.
  • Inquisitive and highly motivated self-starter and problem solver with a drive to integrate, communicate, and work well with large projects and teams.
  • Track record of being reliable, responsible, and thorough.
  • Bachelor's/Master's in Computer Science or equivalent work experience

Learn More About the Technology:

Highlighted Benefits (Vancouver, Canada)

Retirement: RRSP with dollar-for-dollar matching up to 7% of base salary

Mental Health: Dedicated mental health coverage plus top-tier paramedical benefits

Family: Fully paid maternity and parental leave and generous bereavement leave, including time for the loss of a pet

Equity: RSUs and Employee Stock Purchase Plan at a 15% discount

Time Off: Company holidays, sick days, company wellness days, and vacation starting at 10 days

Work Arrangement Hybrid: This role operates in a hybrid capacity, blending the benefits of remote work with the advantages of in-person collaboration. In locations where our workplace policy applies (i.e. San Jose, Durham, Mexico City, Vancouver, Bangalore, Pune, Hoofddorp, Belgrade, Barcelona, Singapore, Sydney and Tokyo), employees are expected to work onsite a minimum of 3 days per week to foster collaboration, team alignment, and access to in-office resources. Workplace type may vary based on location and team requirements. Please speak with your recruiter for details. Additional team-specific guidance and norms will be provided by your manager.

Pay Transparency - Role Location The pay range for this position at commencement of employment is expected to be between CAD $128,800 and CAD $193,200 per annual.

However, base pay offered may vary depending on multiple individualized factors, including market location, job-related knowledge, skills, and experience. The total compensation package for this position may also include other elements, including a sign-on bonus, restricted stock units, and discretionary awards in addition to a full range of medical, financial and/or other benefits (including 401(k) eligibility and various paid time off benefits, such as vacation, sick time, and parental leave), dependent on the position offered. Details of participation in these benefit plans will be provided if an employee receives an offer of employment.

If hired, employee will be in an “at-will position” and the Company reserves the right to modify base salary (as well as any other discretionary payment or compensation program) at any time, including for reasons related to individual performance, Company or individual department/team performance, and market factors. Our application deadline is 40 days from the date of posting. In good faith, the posting may be removed prior to this date if the position is filled or extended in good faith.

Vacancy posted 8 days ago
Similar jobs that could be interesting for youBased on the Software Engineer - LLM Inference in Vancouver, BC vacancy
  • $23 per hour

     ...with them by embedding expert data science, engineering, and AI talent directly into projects...  ...Excellence, and Ownership Job Title:  LLM Research Intern   Job Summary   Reporting...  ...reranking pipelines   ~ Contribute to inference optimization and deployment pipeline... 
    Suggested
    Summer work
    Internship
    Work at office

    ProCogia

    Vancouver, BC
    1 day ago
  •  ...in the world. One of our ventures is building machine learning inference systems for audio at scale. The work spans music, podcasts, and...  ...About the role This role sits at the intersection of ML engineering, platform / infrastructure work, and inference systems. You will... 
    Suggested
    Manual labor
    Shift work

    Beatdapp

    Vancouver, BC
    29 days ago
  • $176.26k - $220.32k per year

     ...Make An Impact As a Senior Principal Software Engineer, you will be a technical leader driving...  ...Augmented Generation), prompt engineering, and LLM integration strategies. Cloud &...  ...fine tuning, model distillation, building inference infrastructure/framework, model... 
    Suggested
    Long term contract
    Remote work

    Boomi

    Vancouver, BC
    21 days ago
  • $86.32k - $107.9k per year

     ...Position Overview: As a Software Engineer II at Diligent, you’ll take on a hands-on technical role in building secure, scalable, and high...  ...context length, embeddings, hallucinations), understands high-level LLM behavior, and recognizes safe vs. unsafe use cases (privacy,... 
    Suggested
    Work at office
    Local area
    Flexible hours

    Diligent Corporation

    Vancouver, BC
    1 day ago
  • $147k - $174k per year

     ...role We are seeking an experienced and ambitious Full Stack Software Engineer passionate about building high-quality, large-scale web and desktop...  ...modern AI-powered developer tools and workflows (e.g., LLM-assisted development, code generation, debugging, and productivity... 
    Suggested
    Long term contract
    Work at office
    Shift work
    Rotating shift

    Dialpad

    Vancouver, BC
    6 days ago
  •  ...made without data. Most of it has never been touched by modern software engineering, let alone AI. EviSmart is the dental industry's Autopilot....  ...~ Hands-on access to the full AI toolchain: Claude, Cursor, LLM-powered workflows in production, not a sandbox. ~ Direct line... 
    Long term contract
    Full time
    Temporary work
    Internship
    Work at office
    Immediate start

    Evismart

    Vancouver, BC
    1 day ago
  • $150k - $200k per year

     ...a rapidly growing company that helps B2B software companies sell through cloud marketplaces...  ...• Act as a technical multiplier, setting engineering standards, mentoring engineers, and raising...  ...AI-enabled systems (e.g., AI agents, LLM-backed services, ML-powered workflows, or... 
    Long term contract
    Direct hire

    Jobright.ai

    Vancouver, BC
    21 days ago
  • $70.51k - $88.13k per year

     ...businesses think about cybersecurity, digital experiences, and identity and access management.  Join the PingAccess Software team as a Software Engineer, where you’ll develop features and updates for our mission-critical platform that secures billions of identities for world... 
    Local area
    Worldwide
    Flexible hours

    Ping Identity

    Vancouver, BC
    11 days ago
  • $150.5k - $175.25k per year

     ...tooling and frameworks for AI/LLM-driven systems, including prompt...  ...Applied AI teams on prompt engineering, model selection, and evaluation...  ...~5+ years of experience in software engineering or SDET roles with...  ...Cloud Platform, Cloud Run / App Engine, Kubernetes, Datastore, Redis,... 
    Work at office
    Shift work

    Dialpad

    Vancouver, BC
    1 day ago
  • $92.65k - $119.9k per year

     ...Summary NetApp is pioneering the development of StorageGRID object storage – AWS cloud compatible software powering the exponential growth in AI data lakes. As a Software Engineer, this is your chance to work alongside a group of talented developers, impart your vision, and... 
    Summer work
    Work at office
    Local area
    Work from home
    Home office
    Flexible hours
    Day shift

    NetApp

    Vancouver, BC
    17 days ago
  • $155.9k - $219.7k per year

     ...and closed EHR systems into a single, modern platform that powers software, APIs, payments, and patient experiences across the ecosystem....  ...industry. What You’ll Do ~ You'll one of NexHealth's first engineers based in Canada — setting the technical and cultural tone for what... 
    Live In
    Remote work
    Flexible hours

    NexHealth

    Vancouver, BC
    1 day ago
  •  ...Elasticsearch, Redis, ScyllaDB, Redshift, TiDB, MariaDB Build software that utilize messaging queues such as Kafka, SQS, and Kinesis...  ...candidates that have: ~2+ years of experience as a Backend Software Engineer. ~ Very strong problem solving skills in data structures,... 
    Local area
    Remote work
    Work from home
    Home office

    StackAdapt

    Vancouver, BC
    11 days ago
  • $146k - $162k per year

     ...believe AI represents the future of work, and APIs are at the heart of how AI connects with the tools where work happens. As a Software Engineer on our API & Developer Platform team in Vancouver, you will help make our developer platform come to life for AI, designing intuitive... 
    Long term contract
    Work at office
    Local area
    Work from home
    Worldwide

    Asana

    Vancouver, BC
    7 days ago
  •  ...high-growth, well-funded SaaS company that helps answer questions software development teams have about their applications. This allows...  ...built. About You You’re an experienced software development engineer with a track record of building and shipping products that customers... 
    Long term contract
    Full time
    Work at office
    Flexible hours

    unblocked

    Vancouver, BC
    1 day ago
  • $132.6k - $174k per year

     ...our guests post-purchase experiences after placing their orders on the Lululemon website. Core Responsibilities As a Senior Software Engineer, you will lead the design and implementation of complex software systems and features spanning multiple services and components,... 
    Long term contract
    Permanent employment
    Full time
    Part time

    lululemon

    Vancouver, BC
    8 days ago
  •  ...fun sessions and celebrations that are often open to the entire organization. Position Summary We are looking for a Junior Software Engineer who is passionate about building scalable cloud platforms, distributed systems, and modern data infrastructure. Reporting to the... 
    Permanent employment
    Full time
    Work at office
    Local area
    Worldwide
    Flexible hours
    3 days per week

    Trulioo

    Vancouver, BC
    4 days ago
  • $40 - $75 per hour

     ...situations - Believes rewards should follow meaningful results, with engineers who make a real impact on the product and the company sharing in...  ...team and purpose above self-gain Curious about life as a Software Engineer at Vizzion? Click below to see what it’s all about.... 
    Hourly pay
    Long term contract
    Full time
    Temporary work
    Flexible hours

    Vizzion

    Vancouver, BC
    18 days ago
  • $128.15k - $151.93k per year

     ...Optimistic, Persistent, and Empathetic . Your role At Dialpad, we are building the future of AI-driven communication. As a Software Development Engineer in Test (SDET), you will lead the charge in building scalable, world-class mobile test automation frameworks that empower... 
    Work at office
    Shift work

    Dialpad

    Vancouver, BC
    1 day ago
  •  ...Ready to make an impact as a Full Stack Software Engineer for an innovative SaaS startup? At Remarcable , we’re not just building software—we’re transforming how contractors and distributors work together in the construction industry. We’re looking for a resourceful... 
    Full time
    For contractors
    Remote work

    remarcable-inc

    Vancouver, BC
    1 day ago
  •  ...loving culture, and a drive to do what it takes to make great games. And this is where you come in…   The key function of the Software Engineer Co-op (Gameplay ) is to gain knowledge and experience in building gameplay features and mechanics, and creating tools that... 
    Contract work
    Internship

    kabam

    Vancouver, BC
    2 days ago
  •  ...running in real industrial pilots - and we’re growing the team to take it even further. About the Role As a Senior Embedded Software Engineer at Humanoid, you will play a pivotal role in designing, developing, and optimizing embedded systems for cutting-edge robotic... 
    Full time
    Work at office
    Worldwide

    humanoid

    Vancouver, BC
    1 day ago
  • $146k - $162k per year

     ...We are looking for a Software Engineer to join our Product team in Vancouver, where we build features end-to-end, from designing our data models to implementing the subtle interaction behaviors that differentiate good software from great software. In this role, you won’t just... 
    Long term contract
    Work at office
    Local area
    Work from home
    Worldwide

    Asana

    Vancouver, BC
    17 days ago
  •  ...Sonus Microsystems is reimagining ultrasound for continuous, operator-independent patient monitoring, and we're looking for a Software Engineer to join our highly collaborative team of scientists, engineers, and innovators working at the intersection of deep tech and real-... 
    Full time
    Internship

    Sonus Microsystems

    Vancouver, BC
    29 days ago
  • $192k - $240k per year

     ...banking with intuitive spend management, bill pay, and travel software, Brex enables founders and finance teams to accelerate operations...  ...tools, resources, and support you need to grow your career. Engineering at Brex Engineering at Brex is about building systems that... 
    Long term contract
    Work at office
    Remote work
    Work from home

    Brex

    Vancouver, BC
    7 days ago
  •  ...culture, and a drive to do what it takes to make great games. And this is where you come in... We are looking for a Senior Software Engineer to join us on a contract for for an unannounced project with a large IP partner. The key function of the Senior Software Engineer... 
    Contract work
    For contractors

    kabam

    Vancouver, BC
    5 days ago
  • $122.3k - $170.7k per year

     ...and developer workflows Contribute to incident response and continuous system improvements What You Bring ~7+ years as a Software Engineer, with increasing levels of responsibility. ~ Experience working on large-scale build/release systems or developer platforms (e... 
    Full time
    Local area

    Electronic Arts (EA)

    Vancouver, BC
    8 days ago
  • $110k - $160k per year

     ...technology.      The Role You will join TrustFlight as an AI Software Engineer and serve as a core member of the AI Team. In this role, you...  ...and internal tools. Build, orchestrate, and integrate LLM-based systems into production applications. Apply AI patterns... 
    Permanent employment
    Full time
    Work at office

    TrustFlight

    Vancouver, BC
    8 days ago
  •  ...focused on precision, speed, and real outcomes. Our AI-powered engine learns, adapts, and improves continuously. Employers don’t just...  ...leveraging AI tools.   Who You Are ~10+ years of experience in software engineering, with a demonstrated track record of leading and... 
    Hourly pay
    Long term contract
    Remote work
    Flexible hours

    JobGet

    Vancouver, BC
    24 days ago
  • $279.12k - $332.44k per year

     ...been the foundation of the way users build in Roblox. Our physics engine enables dynamic environments with emergent behavior making games...  ...knowledge of modern computer architecture and its impact on software performance. Physics and Numerical Methods Experience: Practical... 
    Full time
    Work at office
    Local area
    Visa sponsorship
    Monday to friday

    Roblox

    Vancouver, BC
    14 days ago
  •  ...Principal Software Engineer   Saviynt's AI-powered identity platform manages and governs human and non-human access to all of an organization's applications, data, and business processes. Customers trust Saviynt to safeguard their digital assets, drive operational efficiency... 
    Internship

    saviynt

    Vancouver, BC
    6 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Software Engineer - LLM Inference. Be the first to apply!