AI Inference Engineer QVAC (100% remote Worldwide)
Tether Operations Limited
·
Full Time
·
3 months ago
Tether Operations Limited
The engineer will own the inference backbone for QVAC's local AI stack, focusing on the C++ systems layer to ensure models run fast, reliably, and predictably on user hardware. Responsibilities include porting and enhancing inference engines like llama.cpp and ONNX to run efficiently on edge devices, focusing on runtime stability and performance.