
London or Bristol, 3 days in the office, 2 days WFH
At Fractile, we’re building what we believe will be the world’s fastest AI inference chip from the ground up. We’re balanced across hardware and software engineering, and HW/SW co-design is real here. We move fast, and we help each other move fast. We care about each other, the software we ship, and the people who rely on it.
On the device, close to the metal, we write the runtime software that orchestrates work across the chip and runs performance-critical ML kernels. This is where performance gets real and the wins compound. Your work directly influences trade-offs for the silicon, system deployment, and the compiler.
You'll drive the first accelerator compute runs, evaluating performance on silicon, running early benchmarks, and feeding results back into the hardware and software roadmap.
If you want to build the software that turns cutting-edge hardware capability into real throughput and low latency, come build it with us.
Fractile is a London-based AI chip startup developing in-memory computing processors designed to run large language model inference up to 100x faster and 10x cheaper than current GPU systems. Founded by Oxford Robotics Institute PhD graduate Walter Goodwin, the company's novel chip architecture fuses computation with memory to eliminate the data-shuttling bottleneck that limits conventional hardware. Fractile emerged from stealth in July 2024 and has since announced a £100M commitment to expand UK operations, including a new hardware engineering facility in Bristol. The team includes senior hires from NVIDIA, ARM, and Imagination Technologies.