Software Engineer - LLM Inference
$128.8k - $193.2k per yearNutanix
Hungry, Humble, Honest, with Heart.
The Opportunity
When people talk about generative AI and other ML-powered solutions in today's conversation, they often refer to generative pre-trained transformers like ChatGPT that can respond to queries from a position of deep learning. A GPT-in-a-box solution removes the burden of building or implementing these AI solutions yourself. It also makes overcoming the complexity, inefficiency, and security challenges of generative AI and AI/ML applications easy. Nutanix simplifies your learning curve on AI-ready infrastructure with Nutanix Cloud Platform for AI (GPT-in-a-Box). This high-performant Machine Learning full-stack cloud platform helps you optimise IT costs with a software-defined cloud operating model. Harness AI-ready capabilities right out of the box, simplified to build, fine-tune, and run models, including GPTs and LLMs, while you continue to use existing teams and skills.
Join the Nutanix AI team, responsible for the magic behind the scenes.
About The Team
The Nutanix Enterprise AI team is responsible for strategic product areas including LLM Inference and the AI Gateway. We are at the forefront of Nutanix's mission to simplify AI deployment, recently showcasing our Agentic AI platform at NVIDIA GTC and NEXT 2026. This team is fast-paced, globally distributed, and focused on building the foundational layers of the AI stack.
You will report to a seasoned Technical Manager who will provide mentorship and guidance as you navigate through your responsibilities. The work setup at Nutanix AI is a hybrid model, offering a blend of in-office collaboration and remote work flexibility. As a new hire, you will be expected to be in the office for 3 days a week, ensuring that you have the opportunity to engage with your team and foster strong working relationships.
Your Role
- Architect, design, and develop horizontally scalable, containerized, fault-tolerant services on Kubernetes.
- Improve the performance of systems to deliver for low-latency and high-throughput use cases.
- Optimize any part of the stack, including low-level systems.
- Leverage and contribute to relevant open-source cloud native projects.
- Develop scalable, efficient, and fault-tolerant observability architectures for collecting, analyzing, and reporting metrics for various platform services.
- Collaborate closely with globally located product management and backend development teams to deliver high-quality products in a fast-paced environment.
- Contribute to all stages of the product development cycle: technical design, development, test, experimentation, analysis, and launch.
- Be a team player by reviewing code and design docs, giving feedback on product specs and mocks, and documentation.
- Participate in an ongoing process definition and technology selection to ensure our technology stack is current with relevant trends.
- Continuously learn and improve your technical and non-technical abilities.
- What You Will Bring
- 2-5 years of experience developing maintainable, modular, resilient, fail-safe, and long-lasting code from a Product Development company.
- Have strong programming fundamentals, data structure, and algorithms.
- Strong experience in Docker, Kubernetes, and Cloud native technologies
- Experience building applications with Go and Python
- Experience building and managing CI/CD pipelines
- Strong understanding of datacenter design, including computing, storage, and networking.
- Familiarity with on-prem, cloud, and hybrid software deployment architectures
- Good experience in designing and tuning high-performance system software
- Strong understanding of distributed computing and storage architectures
- Strong knowledge of OS internals, virtualization, application performance monitoring, compute storage, and networking management
- Familiarity with machine learning concepts and popular frameworks (like TensorFlow, PyTorch, etc) is a strong plus
- Experience with hardware accelerators, such as GPUs, is a strong plus.
- Experience working with large codebases or contributing to open source is a strong plus.
- Experience in building multi-tenant services on a virtualized infrastructure is a solid plus.
- Detail-oriented with a strong focus on quality, design, and user experience.
- Inquisitive and highly motivated self-starter and problem solver with a drive to integrate, communicate, and work well with large projects and teams.
- Track record of being reliable, responsible, and thorough.
- Bachelor's/Master's in Computer Science or equivalent work experience
Learn More About the Technology:
Highlighted Benefits (Vancouver, Canada)
Retirement: RRSP with dollar-for-dollar matching up to 7% of base salary
Mental Health: Dedicated mental health coverage plus top-tier paramedical benefits
Family: Fully paid maternity and parental leave and generous bereavement leave, including time for the loss of a pet
Equity: RSUs and Employee Stock Purchase Plan at a 15% discount
Time Off: Company holidays, sick days, company wellness days, and vacation starting at 10 days
Work Arrangement Hybrid: This role operates in a hybrid capacity, blending the benefits of remote work with the advantages of in-person collaboration. In locations where our workplace policy applies (i.e. San Jose, Durham, Mexico City, Vancouver, Bangalore, Pune, Hoofddorp, Belgrade, Barcelona, Singapore, Sydney and Tokyo), employees are expected to work onsite a minimum of 3 days per week to foster collaboration, team alignment, and access to in-office resources. Workplace type may vary based on location and team requirements. Please speak with your recruiter for details. Additional team-specific guidance and norms will be provided by your manager.
Pay Transparency - Role Location The pay range for this position at commencement of employment is expected to be between CAD $128,800 and CAD $193,200 per annual.
However, base pay offered may vary depending on multiple individualized factors, including market location, job-related knowledge, skills, and experience. The total compensation package for this position may also include other elements, including a sign-on bonus, restricted stock units, and discretionary awards in addition to a full range of medical, financial and/or other benefits (including 401(k) eligibility and various paid time off benefits, such as vacation, sick time, and parental leave), dependent on the position offered. Details of participation in these benefit plans will be provided if an employee receives an offer of employment.
If hired, employee will be in an “at-will position” and the Company reserves the right to modify base salary (as well as any other discretionary payment or compensation program) at any time, including for reasons related to individual performance, Company or individual department/team performance, and market factors. Our application deadline is 40 days from the date of posting. In good faith, the posting may be removed prior to this date if the position is filled or extended in good faith.
$23 per hour
...with them by embedding expert data science, engineering, and AI talent directly into projects... ...Excellence, and Ownership Job Title: LLM Research Intern Job Summary Reporting... ...reranking pipelines ~ Contribute to inference optimization and deployment pipeline...SuggestedSummer workInternshipWork at office- ...in the world. One of our ventures is building machine learning inference systems for audio at scale. The work spans music, podcasts, and... ...About the role This role sits at the intersection of ML engineering, platform / infrastructure work, and inference systems. You will...SuggestedManual laborShift work
$176.26k - $220.32k per year
...Make An Impact As a Senior Principal Software Engineer, you will be a technical leader driving... ...Augmented Generation), prompt engineering, and LLM integration strategies. Cloud &... ...fine tuning, model distillation, building inference infrastructure/framework, model...SuggestedLong term contractRemote work$86.32k - $107.9k per year
...Position Overview: As a Software Engineer II at Diligent, you’ll take on a hands-on technical role in building secure, scalable, and high... ...context length, embeddings, hallucinations), understands high-level LLM behavior, and recognizes safe vs. unsafe use cases (privacy,...SuggestedWork at officeLocal areaFlexible hours$147k - $174k per year
...role We are seeking an experienced and ambitious Full Stack Software Engineer passionate about building high-quality, large-scale web and desktop... ...modern AI-powered developer tools and workflows (e.g., LLM-assisted development, code generation, debugging, and productivity...SuggestedLong term contractWork at officeShift workRotating shift- ...made without data. Most of it has never been touched by modern software engineering, let alone AI. EviSmart is the dental industry's Autopilot.... ...~ Hands-on access to the full AI toolchain: Claude, Cursor, LLM-powered workflows in production, not a sandbox. ~ Direct line...Long term contractFull timeTemporary workInternshipWork at officeImmediate start
$150k - $200k per year
...a rapidly growing company that helps B2B software companies sell through cloud marketplaces... ...• Act as a technical multiplier, setting engineering standards, mentoring engineers, and raising... ...AI-enabled systems (e.g., AI agents, LLM-backed services, ML-powered workflows, or...Long term contractDirect hire$70.51k - $88.13k per year
...businesses think about cybersecurity, digital experiences, and identity and access management. Join the PingAccess Software team as a Software Engineer, where you’ll develop features and updates for our mission-critical platform that secures billions of identities for world...Local areaWorldwideFlexible hours$150.5k - $175.25k per year
...tooling and frameworks for AI/LLM-driven systems, including prompt... ...Applied AI teams on prompt engineering, model selection, and evaluation... ...~5+ years of experience in software engineering or SDET roles with... ...Cloud Platform, Cloud Run / App Engine, Kubernetes, Datastore, Redis,...Work at officeShift work$92.65k - $119.9k per year
...Summary NetApp is pioneering the development of StorageGRID object storage – AWS cloud compatible software powering the exponential growth in AI data lakes. As a Software Engineer, this is your chance to work alongside a group of talented developers, impart your vision, and...Summer workWork at officeLocal areaWork from homeHome officeFlexible hoursDay shift$155.9k - $219.7k per year
...and closed EHR systems into a single, modern platform that powers software, APIs, payments, and patient experiences across the ecosystem.... ...industry. What You’ll Do ~ You'll one of NexHealth's first engineers based in Canada — setting the technical and cultural tone for what...Live InRemote workFlexible hours- ...Elasticsearch, Redis, ScyllaDB, Redshift, TiDB, MariaDB Build software that utilize messaging queues such as Kafka, SQS, and Kinesis... ...candidates that have: ~2+ years of experience as a Backend Software Engineer. ~ Very strong problem solving skills in data structures,...Local areaRemote workWork from homeHome office
$146k - $162k per year
...believe AI represents the future of work, and APIs are at the heart of how AI connects with the tools where work happens. As a Software Engineer on our API & Developer Platform team in Vancouver, you will help make our developer platform come to life for AI, designing intuitive...Long term contractWork at officeLocal areaWork from homeWorldwide- ...high-growth, well-funded SaaS company that helps answer questions software development teams have about their applications. This allows... ...built. About You You’re an experienced software development engineer with a track record of building and shipping products that customers...Long term contractFull timeWork at officeFlexible hours
$132.6k - $174k per year
...our guests post-purchase experiences after placing their orders on the Lululemon website. Core Responsibilities As a Senior Software Engineer, you will lead the design and implementation of complex software systems and features spanning multiple services and components,...Long term contractPermanent employmentFull timePart time- ...fun sessions and celebrations that are often open to the entire organization. Position Summary We are looking for a Junior Software Engineer who is passionate about building scalable cloud platforms, distributed systems, and modern data infrastructure. Reporting to the...Permanent employmentFull timeWork at officeLocal areaWorldwideFlexible hours3 days per week
$40 - $75 per hour
...situations - Believes rewards should follow meaningful results, with engineers who make a real impact on the product and the company sharing in... ...team and purpose above self-gain Curious about life as a Software Engineer at Vizzion? Click below to see what it’s all about....Hourly payLong term contractFull timeTemporary workFlexible hours$128.15k - $151.93k per year
...Optimistic, Persistent, and Empathetic . Your role At Dialpad, we are building the future of AI-driven communication. As a Software Development Engineer in Test (SDET), you will lead the charge in building scalable, world-class mobile test automation frameworks that empower...Work at officeShift work- ...Ready to make an impact as a Full Stack Software Engineer for an innovative SaaS startup? At Remarcable , we’re not just building software—we’re transforming how contractors and distributors work together in the construction industry. We’re looking for a resourceful...Full timeFor contractorsRemote work
- ...loving culture, and a drive to do what it takes to make great games. And this is where you come in… The key function of the Software Engineer Co-op (Gameplay ) is to gain knowledge and experience in building gameplay features and mechanics, and creating tools that...Contract workInternship
- ...running in real industrial pilots - and we’re growing the team to take it even further. About the Role As a Senior Embedded Software Engineer at Humanoid, you will play a pivotal role in designing, developing, and optimizing embedded systems for cutting-edge robotic...Full timeWork at officeWorldwide
$146k - $162k per year
...We are looking for a Software Engineer to join our Product team in Vancouver, where we build features end-to-end, from designing our data models to implementing the subtle interaction behaviors that differentiate good software from great software. In this role, you won’t just...Long term contractWork at officeLocal areaWork from homeWorldwide- ...Sonus Microsystems is reimagining ultrasound for continuous, operator-independent patient monitoring, and we're looking for a Software Engineer to join our highly collaborative team of scientists, engineers, and innovators working at the intersection of deep tech and real-...Full timeInternship
$192k - $240k per year
...banking with intuitive spend management, bill pay, and travel software, Brex enables founders and finance teams to accelerate operations... ...tools, resources, and support you need to grow your career. Engineering at Brex Engineering at Brex is about building systems that...Long term contractWork at officeRemote workWork from home- ...culture, and a drive to do what it takes to make great games. And this is where you come in... We are looking for a Senior Software Engineer to join us on a contract for for an unannounced project with a large IP partner. The key function of the Senior Software Engineer...Contract workFor contractors
$122.3k - $170.7k per year
...and developer workflows Contribute to incident response and continuous system improvements What You Bring ~7+ years as a Software Engineer, with increasing levels of responsibility. ~ Experience working on large-scale build/release systems or developer platforms (e...Full timeLocal area$110k - $160k per year
...technology. The Role You will join TrustFlight as an AI Software Engineer and serve as a core member of the AI Team. In this role, you... ...and internal tools. Build, orchestrate, and integrate LLM-based systems into production applications. Apply AI patterns...Permanent employmentFull timeWork at office- ...focused on precision, speed, and real outcomes. Our AI-powered engine learns, adapts, and improves continuously. Employers don’t just... ...leveraging AI tools. Who You Are ~10+ years of experience in software engineering, with a demonstrated track record of leading and...Hourly payLong term contractRemote workFlexible hours
$279.12k - $332.44k per year
...been the foundation of the way users build in Roblox. Our physics engine enables dynamic environments with emergent behavior making games... ...knowledge of modern computer architecture and its impact on software performance. Physics and Numerical Methods Experience: Practical...Full timeWork at officeLocal areaVisa sponsorshipMonday to friday- ...Principal Software Engineer Saviynt's AI-powered identity platform manages and governs human and non-human access to all of an organization's applications, data, and business processes. Customers trust Saviynt to safeguard their digital assets, drive operational efficiency...Internship
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Software Engineer - LLM Inference. Be the first to apply!
- software engineer - ai machine learning Vancouver, BC
- linux software engineer Vancouver, BC
- software development engineer Vancouver, BC
- software developer co-op Vancouver, BC
- software developer entry level Vancouver, BC
- remote entry level software developer Vancouver, BC
- junior software developer internship Vancouver, BC
- développeur logiciel Vancouver, BC
- software engineer Vancouver, BC
- junior software engineer Vancouver, BC
