Build the technical foundation and reusable infrastructure for a new business vertical while leading early customer engagements from scoping to delivery. Partner with product and engineering teams to translate customer needs into core platform capabilities.
Protege
15 Remote Job Openings at Protege
Own end-to-end healthcare customer engagements from initial scoping and technical implementation to post-launch support. Bridge the gap between customer needs and product capabilities by developing custom solutions and surfacing repeatable patterns for the platform.
Design and operate large-scale ingestion systems to transform raw multimodal data into clean, AI-ready datasets. Own the end-to-end data processing layer, ensuring high throughput, reliability, and security for sensitive data.
Lead the research and development of evaluation frameworks and metrics to optimize the quality of large-scale speech datasets. Translate research insights into scalable filtering rules and tools to improve downstream ML model performance.
Design and build high-value datasets, tasks, and environments to benchmark agentic systems and multi-step model behavior. Develop frameworks to evaluate data quality and connect model failures to specific dataset or environment gaps.
Lead the launch and growth of a new business vertical from the ground up, including defining the strategy and resourcing plan. Forge strategic partnerships between model builders and data holders to validate product-market fit and close initial deals.
Lead the DataLab research function by defining the research agenda and building systems for AI data experimentation and evaluation. Partner with Product, Engineering, and GTM teams to translate research findings into product direction and customer strategy.
Lead the design and validation of trusted benchmarks and evaluations for frontier AI models across various domains. Develop the statistical framework for evaluation science and translate research findings into deployable datasets for customers.
Drive revenue by helping organizations access and scale high-quality video datasets for AI model training and evaluation. This involves owning the full sales cycle from pipeline generation and discovery to closing deals and developing vertical-specific use cases.
Lead the evaluation and optimization of large-scale datasets used to train generative video models. Develop systems, metrics, and benchmarks to ensure video data is diverse, high-impact, and production-ready.
The Head of Security will own the end-to-end security strategy, architecture, and operations while building the program from the ground up. This role involves maturing compliance programs, securing data pipelines, and serving as the primary security face to customers and partners.
You will operationalize and scale the healthcare data partner ecosystem by managing onboarding, technical integration, and commercial alignment. This role involves ensuring high-quality data delivery while collaborating across product, engineering, and legal teams to meet AI training requirements.
Own the privacy and data trust product value stream from discovery through execution. Define the product roadmap, manage vendor relationships, and translate research into scalable product capabilities.
The Solutions Engineer will connect Protegeβs media catalog with customer AI data needs, focusing on data quality and curation of media datasets. They will work with evolving partner datasets to normalize and operationalize them for AI use cases.
The Business Development Representative will be responsible for building a high-quality healthcare pipeline through disciplined, insight-led outbound prospecting focused on organizations providing real-world data for model development. Key tasks include running targeted sequences, rigorously qualifying opportunities, and ensuring clean handoffs to accelerate next steps.