About the role
Step into the engine room of Agentic Commerce! Imagine owning the bleeding edge of machine learning at Shopify, where your acceleration, optimization, and scaling of ML inference will shape the experience of millions of merchants, and influence how commerce AI is done worldwide. We’re seeking a Senior Staff Engineer to architect, optimize, and own the high-performance runtime that transforms innovative models into production breakthroughs. Your work will be the engine behind our real-time AI systems, driving game-changing cost and latency reductions, and enabling rapid launches of intelligent features that keep Shopify (and our merchants) years ahead. Join a remote-first team of world-class experts, experiment fearlessly, and see your code move the needle for some of the largest-scale ML workloads in commerce.
Responsibilities
Architect, optimize, and own Shopify’s production ML inference. Designing for high throughput, ultra-low latency, and global reliability.
Leverage and extend technologies like CUDA, TensorRT, Triton, TVM, and custom GPU kernels to deliver state-of-the-art performance and efficiency at scale.
Partner with ML, infrastructure, and product teams to seamlessly deploy, benchmark, and scale cutting-edge models powering our platform.
Drive cost optimization and system efficiency, reducing cloud spend and carbon footprint by orders of magnitude without sacrificing model quality.
Lead deep performance investigations, apply advanced techniques (pruning, quantization, distillation, batching), and implement robust solutions for serving models in production.
Set technical strategy and culture for ML inference across Shopify, mentoring others and collaborating with global AI pioneers.
Qualifications
Proven, hands-on expertise in building and optimizing large-scale ML inference systems, with measurable performance and cost wins.
Deep experience in production model serving, runtime optimization, and acceleration. Especially leveraging GPUs (CUDA, TensorRT) and high-performance deep learning infrastructure.
Strong software engineering skills (Python, C++, and/or other relevant languages) with a robust systems and distributed computing mindset.
Demonstrated leadership in architecting or scaling reliable, real-time inference at scale, handling millions of queries per day.
Track record of cross-functional impact: working closely with ML research/engineering, infra, and product teams to deliver production results.
Advanced understanding of model compression, quantization, efficient deployment, and tradeoffs between speed, cost, and accuracy.
Nice to Haves
Open source contributions to inference frameworks (TensorRT, TVM, Triton, DeepSpeed, ONNX, etc.) or technical talks/publications at leading AI conferences.
Experience optimizing inference across a variety of hardware (NVIDIA, AMD, ARM, cloud TPUs).
Familiarity with building or integrating robust monitoring, observability, and auto-scaling for inference platforms.
Experience with modern MLOps pipelines and methodologies.
Prior experience in e-commerce, large-scale product infra, or globally distributed inference workloads.
At Shopify, we pride ourselves on moving quickly—not just in shipping, but in our hiring process as well. If you're ready to apply, please be prepared to interview with us within the week. Our goal is to complete the entire interview loop within 30 days. You will be expected to complete a live pair programming session, come prepared with your own IDE.
About Shopify
Opportunity is not evenly distributed. Shopify puts independence within reach for anyone with a dream to start a business. We propel entrepreneurs and enterprises to scale the heights of their potential. Since 2006, we’ve grown to over 8,300 employees and generated over $1 trillion in sales for millions of merchants in 175 countries.
This is life-defining work that directly impacts people’s lives as much as it transforms your own. This is putting the power of the few in the hands of the many, is a future with more voices rather than fewer, and is creating more choices instead of an elite option.
About you
Moving at our pace brings a lot of change, complexity, and ambiguity—and a little bit of chaos. Shopifolk thrive on that and are comfortable being uncomfortable. That means Shopify is not the right place for everyone.
- Care deeply about what you do and about making commerce better for everyone
- Excel by seeking professional and personal hypergrowth
- Keep up with an unrelenting pace (the week, not the quarter)
- Be resilient and resourceful in face of ambiguity and thrive on (rather than endure) change
- Bring critical thought and opinion
- Put AI agents and tools to work on the tasks they're built for, and focus on the work only humans can do
- Embrace differences and disagreement to get shit done and move forward
- Work digital-first for your daily work
First things first
It looks like you might not be a great fit for Shopify at this time — while we encourge everyone to apply, we are looking for someone who can meet all these criteria.