Design, develop, and maintain monitoring and telemetry systems to ensure the reliability of a large-scale GPU cloud network. Create automated collectors, dashboards, and alerting pipelines to proactively detect and resolve network anomalies.
CoreWeave Europe
3 Remote Job Openings at CoreWeave Europe
Design, deploy, and maintain SOAR capabilities to automate security responses and reduce human intervention. Develop integrations and leverage AI tooling to enrich security event context and decision-making.
Provide high-touch technical support for Kubernetes-powered HPC cloud infrastructure and AI workloads. Mentor team members, lead technical escalations, and develop knowledge-sharing resources to improve team efficiency.