Design, implement, and maintain scalable and secure core infrastructure across on-premises and cloud environments. Provide advanced technical support for production incidents and drive operational excellence through root cause analysis and system optimization.
About the Job
The Senior Infrastructure Engineer is a highly technical role responsible for the design, implementation, and ongoing support of MedAdvisor Solutions’ core infrastructure, with a strong emphasis on Linux-based systems, enterprise servers, and data center operations. This role partners closely with the Director of IT to shape infrastructure strategy, improve system reliability, and drive operational excellence across production environments. The position balances hands-on engineering with architectural input and advanced support responsibilities, with minimal focus on end-user desktop support.
Role Responsibilities
Infrastructure Engineering & Architecture
- Design, implement, and maintain scalable, secure, and highly available infrastructure solutions across on-premises and cloud environments.
- Lead architecture discussions and provide recommendations for system improvements, modernization, and optimization.
- Partner directly with the Senior Director of IT to define infrastructure roadmap, standards, and best practices.
- Evaluate and implement new technologies to enhance system performance, resilience, and cost efficiency.
Cloud & Server Administration
- Administer and support Google Cloud Platform (GCP) environments, including patching, performance tuning, and security hardening.
- Administer and support enterprise Linux environments (e.g., RHEL, Ubuntu, CentOS), including patching, performance tuning, and security hardening.
- Administer and support enterprise Windows environments, including patching, performance tuning, and security hardening.
- Manage physical and virtual server environments, including VMware and related platforms.
- Oversee system provisioning, configuration management, and automation initiatives.
- Perform advanced troubleshooting across OS, storage, compute, and networking layers.
Operational Support & Reliability
- Own escalation-level support for production incidents, acting as a senior technical resource for complex issues.
- Monitor system health, capacity, and performance; proactively identify and remediate risks.
- Drive root cause analysis (RCA) and implement preventative measures to reduce recurring incidents.
- Improve and mature incident management, change management, and operational processes.
Network & Platform Management
- Manage network connectivity and operations including firewall, enterprise load balancers and all network connectivity (e.g. Palo Alto, F5, GCP Networking)
- Ensure high availability and disaster recovery readiness across critical systems.
- Support backup, replication, and failover strategies.
Collaboration & Leadership
- Act as a key liaison between Infrastructure, Development, and Security teams to support application performance and deployment needs.
- Mentor junior engineers and contribute to skill development across the team.
- Collaborate with vendors and service providers to resolve issues and optimize services.
Qualifications & Experience
- Bachelor’s degree in Computer Science, Information Systems, or equivalent experience.
- 7+ years of experience in systems engineering, infrastructure, or related roles.
- Strong expertise in Linux and Windows system administration in enterprise environments.
- Deep experience with server infrastructure, virtualization (VMware preferred), and cloud operations.
- Proven experience in infrastructure design and architecture.
- Strong understanding of networking fundamentals (routing, switching, firewalls, load balancing).
- Experience with monitoring and observability tools (e.g., Splunk, Prometheus, or similar).
- Experience with automation and scripting (e.g., Bash, Python, Ansible, Terraform) strongly preferred.
- Experience with cloud platforms (AWS, Azure, or GCP).