Inferact

Member of Technical Staff, CI/CD Infrastructure

Inferact · Full Time · 13 days ago

Inferact

🌎 United States 💵 $200K - $400K per year ⭐ 5-10 yrs exp 💼 Software Development

Maintain and scale the compute infrastructure powering CI, releases, and performance benchmarks for the vLLM project across various accelerators. Focus on reducing CI time-to-signal and building tooling to support thousands of open-source contributors.

APPLY

Member of Technical Staff, AMD GPU Performance Engineering

Inferact · Full Time · a month ago

Inferact

🌎 United States 💵 $200K - $400K per year ⭐ 5-10 yrs exp 💼 Others

Build and optimize AMD GPU backends, kernels, and runtime paths to make vLLM a first-class inference engine. Improve performance-critical paths including attention, GEMM, and communication-heavy operations using ROCm and related tooling.

APPLY

Member of Technical Staff, Inference

Inferact · Full Time · a month ago

Inferact

🌎 United States 💵 $200K - $400K per year ⭐ 5-10 yrs exp 💼 Others

Optimize the vLLM inference engine to improve the speed and cost of running LLMs and diffusion models. Develop innovations for diverse hardware and architectures, including mixture-of-experts and multimodal models.

APPLY

Member of Technical Staff, TPU & AMD GPU Performance Engineering

Inferact · Full Time · a month ago

Inferact

🌎 United States 💵 $200K - $400K per year ⭐ 5-10 yrs exp 💼 Others

Build and optimize AMD GPU and TPU backends, kernels, and compiler integrations to make vLLM a first-class inference engine on non-NVIDIA hardware. Improve critical paths such as attention, GEMM, and KV-cache while developing robust benchmarking infrastructure.

APPLY

Member of Technical Staff, Developer Relations

Inferact · Full Time · a month ago

Inferact

🌎 United States 💵 $200K - $400K per year ⭐ 5-10 yrs exp 💼 Software Development

The role involves creating high-quality technical content, tutorials, and demos to help developers adopt and scale vLLM. You will act as an educator-builder, explaining complex inference systems concepts and hosting workshops for the AI infrastructure community.

APPLY

Member of Technical Staff, Kernel Engineering

Inferact · Full Time · 5 months ago

Inferact

🌎 United States 💵 $200K - $400K per year ⭐ 5-10 yrs exp 💼 Software Development

The role involves writing kernels and low-level optimizations to enhance the performance of vLLM as an inference engine. The engineer will collaborate with hardware vendors to maximize performance across various accelerator types.

python

APPLY

Member of Technical Staff, Cloud Orchestration

Inferact · Full Time · 5 months ago

Inferact

🌎 United States 💵 $200K - $400K per year ⭐ 5-10 yrs exp 💼 Software Development

The cloud orchestration engineer will build the operational backbone for vLLM, focusing on cluster management, deployment automation, and production monitoring. The role involves ensuring that vLLM deployments are observable, debuggable, and recoverable.

cloud kubernetes

APPLY

Member of Technical Staff, Performance and Scale

Inferact · Full Time · 5 months ago

Inferact

🌎 United States 💵 $200K - $400K per year ⭐ 5-10 yrs exp 💼 Others

The role involves building distributed systems that power inference at a global scale. You will design and implement foundational layers to enable vLLM to serve models across thousands of accelerators with minimal latency and maximum reliability.

training serverless

APPLY

Member of Technical Staff, Exceptional Generalist (Remote)

Inferact · Full Time · 5 months ago

Inferact

🌎 United States ⭐ 2-5 yrs exp 💼 Others

You will work across the entire vLLM stack, optimizing CUDA kernels, designing distributed orchestration systems, and implementing new model architectures. Your work will directly impact how the world runs AI inference.

communication

APPLY

9 Remote Job Openings at Inferact

Member of Technical Staff, CI/CD Infrastructure

Member of Technical Staff, AMD GPU Performance Engineering

Member of Technical Staff, Inference

Member of Technical Staff, TPU & AMD GPU Performance Engineering

Member of Technical Staff, Developer Relations

Member of Technical Staff, Kernel Engineering

Member of Technical Staff, Cloud Orchestration

Member of Technical Staff, Performance and Scale

Member of Technical Staff, Exceptional Generalist (Remote)

DAILYREMOTE

REMOTE WORK TIPS

REMOTE JOB ROLES

REMOTE JOBS

REMOTE JOB RESOURCES