Role Overview
As an ML Efficiency Engineer, you’ll be the engine that accelerates how quickly we can push the boundaries of LLM capabilities. Your work will make cutting-edge research move faster -dramatically increasing the pace at which we can iterate, test hypotheses, and discover breakthroughs. You’ll empower both our Applied Scientists & customers to explore more ideas in the same amount of time & unlocking progress that simply isn’t possible e.g. super long horizon agents, etc.
In this role, you’ll build and optimize the tooling, pipelines, and systems that make large-scale experimentation seamless. You’ll improve training and inference performance, streamline data and model workflows, and identify bottlenecks that slow iteration. Every efficiency gain - whether in cluster utilization, memory usage, parallelization, or algorithmic throughput - translates directly into more experiments, richer exploration, and faster discovery.
Ideal Profile