Objective/Scope
The Manager, AI/ML (Computer Vision & LLMs/Agents) leads a team of machine learning engineers and applied scientists responsible for designing, deploying, and operating production AI systems with a primary emphasis on computer vision and large language model / agentic applications. This leader owns the end-to-end ML lifecycle, from problem framing and data strategy through model development, deployment, and continuous monitoring, and is accountable for the scalability, availability, and accuracy of the AI systems the team ships.
The Manager sets and enforces the standards that make AI systems production-grade: rigorous model evaluation and accuracy targets, MLOps tooling and automation, observability and drift detection, and clear service-level objectives for latency and uptime. They translate AI strategy into an executable roadmap, ensuring solutions meet performance, reliability, and responsible-AI standards appropriate to a clinical/healthcare context including patient-safety-relevant accuracy and applicable ethical and regulatory requirements.
As a people leader, the Manager grows and mentors the team, sets goals, manages performance, and removes blockers, while partnering closely with cross-functional teams (product, data engineering, DevOps/infrastructure, and clinical stakeholders) to align priorities and deliver impact. They remain current with the field, evaluating emerging computer vision and LLM/agent techniques and bringing the most valuable advances into the team's practice.
Job Functions
Technical Leadership & Architecture
- Lead the architecture and end-to-end execution of the ML lifecycle - data strategy, model development, deployment, and continuous operation - primarily for computer vision and LLM/agentic systems.
- Own the MLOps foundation: training and deployment pipelines, model serving, CI/CD for models, and reproducible experimentation.
- Set and enforce standards for model accuracy and quality (evaluation frameworks, offline and online metrics, regression testing of models, and A/B testing) and hold the team to defined targets.
- Ensure production AI systems are scalable and highly available: define service-level objectives for latency, throughput, and uptime, and establish monitoring, drift detection, alerting, and rollback practices.
- Lead requirements definition and feasibility assessment for new ML features and functionality, balancing technical trade-offs against business and clinical impact.
- Guide design and code quality across the team; contribute hands-on in Python and the ML stack as needed to unblock the team and set the technical bar.
Delivery & Project Management
- Plan and manage team workload: delegate tasks, set daily, weekly, and monthly goals, and track progress against them.
- Provide project estimates and timelines; manage scope of tasks, features, and epics before, during, and after delivery.
- Identify risks early and form contingency plans; surface and remove blockers that impede the team's progress.
- Manage software/feature releases and communicate release status and outcomes to stakeholders.
- Report project status, write progress updates, and deliver presentations to relevant stakeholders, including management and customers.
People Leadership
- Lead, mentor, and develop the ML engineering team; conduct regular 1-1s, set goals, and manage performance.
- Motivate the team and foster a transparent, psychologically safe environment where people can raise questions, concerns, challenges, failures, and successes openly.
- Evaluate team performance, provide guidance to sustain productivity, and partner with senior management and HR on performance management and compensation.
- Participate in hiring assessing candidates and providing interview feedback to grow the team.
Cross-Functional Collaboration & Strategy
- Partner with product, data engineering, DevOps/infrastructure, and clinical stakeholders to align priorities and drive projects forward.
- Translate AI strategy into an executable roadmap, ensuring solutions comply with performance, reliability, and responsible-AI standards appropriate to a clinical/healthcare context.
- Stay current with advances in computer vision, LLMs, and agentic systems; evaluate emerging techniques and bring the most valuable into the team's practice.
- Analyze existing operations and processes, and drive improvements through training, tooling, and best practices.
Competencies
AI/ML Domain (core)
- Computer Vision: image/video understanding, detection, segmentation, real-time inference
- Large Language Models & Agentic Systems: LLM application development, prompting/fine-tuning, RAG, agent orchestration
- Deep Learning: modern neural architectures (CNNs, transformers) and training practices
- Model Evaluation & Accuracy: evaluation frameworks, metrics, A/B testing, regression testing of models
- Data Science, Data Modeling, and Predictive Modeling
Engineering & Operations
- MLOps & ML Lifecycle: training/deployment pipelines, model serving, CI/CD for models
- Production Reliability: scalability, high availability, latency/throughput SLOs, monitoring, drift detection
- DevOps & Cloud Infrastructure (incl. GPU/accelerated compute)
- Software Engineering Best Practices: design patterns, test-driven development, debugging, code quality
Leadership & Delivery
- People Leadership: mentoring, performance management, team development
- Technical Project Management: estimation, scope, and delivery
- Strategic Planning & Strategy Management
- Cross-Functional Collaboration & Stakeholder Communication
- Responsible AI: ethics, safety, and (clinical/regulatory) compliance
Supervisory Requirements
- Manage machine learning team members including tasks, performance, and related mentoring.
Education and Experience
Required
- Bachelor's degree in a related field (Computer Science, Computer Information Systems, or equivalent). Advanced degree in Computer Science, Machine Learning, or a related discipline is preferred.
- 7+ years of related machine learning or software engineering experience, including 3+ years managing technical teams.
- Demonstrated experience leading teams that delivered ML/AI systems to production at scale, with accountability for accuracy, reliability, and availability.
- Hands-on expertise in computer vision and/or LLM/agentic systems, with the ability to set the technical direction for both.
- Expert-level proficiency in Python and strong command of the modern ML stack.
- Proficiency in at least two ML frameworks (for example, PyTorch, TensorFlow, or Hugging Face Transformers).
Working knowledge of MLOps practices: model deployment, serving, monitoring, drift detection
- Working knowledge of MLOps practices: model deployment, serving, monitoring, drift detection, and CI/CD for models.
- Solid software engineering fundamentals, including design patterns, test-driven development, unit testing, and effective troubleshooting and debugging practices.
Leadership and Delivery
- Able to lead, support, motivate, and mentor technical team members.
- Able to define and review software requirements, and to build and manage task hierarchies for new features and fixes.
- Able to manage scope of tasks, features, and epics before, during, and after delivery.
- Able to provide realistic project estimates and timelines, and to identify and remove bottlenecks that impede progress.
- Communicates effectively with peers, internal customers, and stakeholders across all levels of the organization.
- Takes initiative on resolving issues and continuously improving the codebase and team practices.
Preferred
- Experience deploying AI systems in a clinical, healthcare, or other regulated environment.
- Familiarity with cloud and GPU/accelerated compute infrastructure for training and inference.
- Experience with responsible-AI practices, including model safety, bias evaluation, and applicable compliance requirements.
Physical Demands and Working Conditions:
- Must be able to sit for much of the workday with periodic walking and/or standing.
- Must be able to work in an office environment.
- Minimal travel is required for this role.
Other Duties:
Please note this job description is not designed to cover or contain a comprehensive listing of activities, duties or responsibilities that are required of the employee for this job. Duties, responsibilities, and activities may change at any time with or without
Compensation & Benefits
- Base Salary Range: $180,000 - $200,000 per year
- Bonus Eligibility: Yes, bonus eligible
- Benefits Offered:
- AvaSure sponsored Medical, Dental & Vision
- Safe Harbor 401K with Employer Matching up to 4%
- HSA Employer Contributions, Employer Paid Life, Short-term and Long-term Disability, and AD&D Insurance Plans
- Flexible Time Off Plan & Paid Holidays
- Parental Leave
- Generous Tuition & Continuing Education Reimbursement available
- Employee Referral Bonus
FLSA Classification: Exempt
Position Type: Full-time
Location: United States | Remote
Why AvaSure?
As the pioneer and expert in inpatient telehealth, AvaSure has provided safer environments with over 2 million patients monitored and 200 million monitored hours. By continuing to reduce adverse events, and by optimizing workforce efficiencies for the nation's top health systems, AvaSure has consistently lowered the cost of care while providing safer, more efficient healthcare for everyone.β
Diversity creates a healthier work environment: AvaSure is an Equal Employment Opportunity/Affirmative Action employer, and all qualified applicants