
London or Bristol, 3 days in the office, 2 days WFH
At Fractile, we’re building what we believe will be the world’s fastest AI inference chip from the ground up. We're balanced across hardware and software engineering, and HW/SW co-design is real here. We move fast, and we help each other move fast. We care about each other, the software we ship and the people who rely on it.
This role sits at the boundary between host and silicon. The kernel driver is key to keeping pace with our ultra-fast devices on cutting-edge server platforms. It’s a high-leverage layer where each win shows up as real throughput and latency gains.
You’ll be there for the pre-silicon simulations, first bring-up, first end-to-end runs, and the moments where performance jumps because of something you shipped.
If you want to build the software where every driver win unlocks huge system performance, come build it together.
Fractile is a London-based AI chip startup developing in-memory computing processors designed to run large language model inference up to 100x faster and 10x cheaper than current GPU systems. Founded by Oxford Robotics Institute PhD graduate Walter Goodwin, the company's novel chip architecture fuses computation with memory to eliminate the data-shuttling bottleneck that limits conventional hardware. Fractile emerged from stealth in July 2024 and has since announced a £100M commitment to expand UK operations, including a new hardware engineering facility in Bristol. The team includes senior hires from NVIDIA, ARM, and Imagination Technologies.