PICTOR LABS INC

Sr. ML Inference Engineer

Posted 5 months ago

United States

$200K - $260K per year

⭐ 5-10 years experience

python cloud medical docker kubernetes software-engineering documentation compliance

Apply Now

Please mention DailyRemote when applying

AI Summary

The engineer will design, develop, and optimize production ML inference systems for virtual staining models, architecting high-performance pipelines capable of processing gigapixel pathology images with sub-2-minute latency requirements. This involves optimizing model architectures for cloud APIs and edge devices, and implementing state-of-the-art inference frameworks.

Job DetailsJob Location: Texas - Remote - Austin, TX 78730Salary Range: $200,000.00 - $260,000.00 Salary About Pictor Labs Pictor Labs is the leading virtual staining company revolutionizing digital pathology adoption worldwide through cutting-edge AI-powered technology. Our solutions deliver diagnostic-quality results in minutes while preserving tissue samples for comprehensive analysis. Our breakthrough DeepStain™ and ReStain™ technologies enable unlimited virtual staining from a single tissue sample, eliminating the bottlenecks and limitations of traditional chemical staining processes. This innovation supports the critical evolution from research applications to clinical deployment, empowering laboratories to advance their digital pathology capabilities while reducing chemical waste, improving operational efficiency, and expanding diagnostic possibilities. About the Role We are seeking an experienced Senior ML Inference Engineer to join our team, focusing on optimizing and deploying our production virtual staining models at scale. The ideal candidate will have deep expertise in ML inference optimization, GPU programming, and building production-grade inference systems. You will work on critical challenges such as reducing inference latency for whole slide imaging (WSI) from tens of minutes to under 2 minutes, deploying models on edge devices with NVIDIA hardware, and ensuring our inference infrastructure meets FDA and SOC2 compliance requirements. This role offers the opportunity to work at the intersection of cutting-edge AI and life-saving healthcare technology, making a tangible impact on patient outcomes. Location: Remote US Company: Pictor Labs Employment Type: Full-time Responsibilities Design, development, and optimization of production ML inference systems for virtual staining models (Deepstain, Restain, ClearStain) serving clinical and pharmaceutical customers Architect and implement high-performance inference pipelines capable of processing gigapixel pathology images with sub-2-minute latency requirements Work with ML Research and Engineering teams to optimize model architectures and deployment strategies for both cloud-based APIs and edge devices (NVIDIA DGX Sparc, Grace Blackwell superchips) Evaluate, implement, and maintain state-of-the-art inference frameworks (TensorRT, Triton Inference Server, ONNX Runtime) to maximize GPU utilization and throughput Profile and optimize deep neural networks on NVIDIA GPUs using tools such as NVIDIA Nsight, PyTorch Profiler, and custom instrumentation Design and implement efficient model serving architectures that support both synchronous REST APIs and asynchronous batch processing workflows Collaborate with Platform and Edge Device teams to containerize inference systems (Docker, Kubernetes) for deployment across cloud and on-premise environments Partner with cloud providers (AWS, GCP, Azure) to optimize hosted inference solutions and leverage latest hardware accelerators Ensure inference systems meet regulatory requirements (FDA 510(k), SOC2) with comprehensive monitoring, logging, and audit capabilities Prototype and productionize new inference optimization techniques, including quantization, pruning, distillation, and dynamic batching strategies Build robust telemetry and monitoring systems to track model performance, latency, throughput, and resource utilization in production Qualifications Qualifications Required: 7+ years of experience building and optimizing production ML inference systems at scale Expert-level proficiency in Python and experience writing high-performance inference services 5+ years of hands-on experience with PyTorch and at least one production inference tools (TensorRT, Triton Inference Server, ONNX Runtime, TorchServe) Deep understanding of computer vision model architectures, particularly generative models (GANs, diffusion models) and vision transformers Extensive experience profiling and optimizing deep neural networks on NVIDIA GPUs, including memory optimization, kernel fusion, and mixed-precision inference Strong background in image processing pipelines and libraries (OpenCV, Pillow, scikit-image) for handling large-scale medical imaging data Proven experience deploying ML systems on Kubernetes and major cloud providers (AWS, GCP, Azure) Experience with Docker containerization and orchestration for ML workloads Strong software engineering practices including version control (Git), CI/CD, unit testing, and production debugging Excellent communication, collaboration, and technical documentation skills Preferred: Experience with medical imaging, digital pathology, or whole slide imaging (WSI) processing Knowledge of edge device deployment and embedded systems for AI inference Experience with MLOps tools (MLflow, Kubeflow, Apache Airflow) and model versioning Understanding of FDA regulatory requirements for AI/ML in medical devices Background in distributed inference systems and model parallelism techniques Familiarity with monitoring and logging tools (Prometheus, Grafana, ELK Stack) What We Offer The opportunity to work on technology that directly improves patient outcomes and transforms clinical diagnostics, alongside a talented team of engineers and researchers pushing the boundaries of AI in healthcare. PictorLabs is an equal opportunity employer and does not discriminate on the basis of race, religion, national origin, gender, sexual orientation, age, veteran status, disability, or other legally protected statuses.

Automatically Apply to the Best Remote Jobs

Stop the endless job search. Our AI finds and applies to the best jobs for you.

Try it Now

PICTOR LABS INC

Sr. ML Inference Engineer

AI Summary

Automatically Apply to the Best Remote Jobs

Ace Your Job Interview

How to Answer "How Do You Handle Criticism"?

How to Answer "Tell Me About Yourself?" in an Interview

How to Answer "What is your Experience with Customer Service?"

How to Answer "Describe Your Experience Working With Diverse Teams Or Different Cultures?"

How to Answer The Interview Question "What Sets You Apart From Other Candidates?"

How to Answer "Why Are You The Best Person For This Job"?

How to Answer "Tell Me About A Time When You Had To Balance Competing Priorities?"

How to Answer "Why Should We Hire You?"

How to Answer "What Areas Need Improvement?"

How to Answer "Tell Me About A Time When You Had To Balance Competing Priorities?"

How to Answer "Tell Me About a Time You Received Constructive Feedback"

How to Answer "What Is Your Greatest Accomplishment?"

Similar Jobs

Quality Engineer III

Cloud Network Engineer III

Software Engineer

Senior Test Engineer

Sales Director, US Video & Cloud

Senior DevOps Engineer - Remote USA

PICTOR LABS INC

Sr. ML Inference Engineer

AI Summary

Automatically Apply to the Best Remote Jobs

Share This Job:

Similar Jobs

Quality Engineer III

Cloud Network Engineer III

Software Engineer

Senior Test Engineer

Sales Director, US Video & Cloud

Senior DevOps Engineer - Remote USA

Personalize your Remote Job Search in 3 Easy Steps!