Act as a technical advisor to grow the manufacturing developer ecosystem across EMEA by accelerating the adoption of NVIDIA's AI and computing platforms. Design technical assets and collaborate cross-functionally to align developer feedback with product roadmaps.
NVIDIA
168 Remote Job Openings at NVIDIA
Collaborate with internal and external teams to define system requirements for fault-tolerant quantum computing. Develop novel approaches for real-time quantum error correction and calibration across various qubit modalities.
Lead the planning, deployment, and validation of AI data center infrastructure, focusing on power, cooling, and networking. Ensure all physical infrastructure meets NVIDIA reference architectures and industry standards through rigorous quality assurance and continuous improvement.
Lead the planning, deployment, and validation of AI data center infrastructure, focusing on power, cooling, and networking systems. Ensure all physical infrastructure meets NVIDIA reference architectures and industry standards through rigorous auditing and quality assurance.
The role involves acting as a technical consultant for ISV developers to foster the adoption of NVIDIA's AI and computing platforms. You will collaborate cross-functionally to identify growth opportunities, guide partner onboarding, and influence product roadmaps based on field feedback.
Provide technical mentorship and guidance to customers on deploying Agentic AI workflows using NVIDIA's GenAI stack. Build POCs, develop technical collateral, and collaborate with engineering teams to integrate AI into hospitality and travel industry stacks.
Develop and optimize server management software and firmware for GPU and Grace solutions in large data center clusters. Collaborate with architects and cross-functional teams to implement robust manageability components and ensure system reliability.
Principal Simulation Engineer, Industrial Physics and Robotics
NVIDIA
·
Full Time
·
2 days ago
NVIDIA
Design and develop high-fidelity physically based simulation systems for robotics and industrial digital twins. Collaborate across teams to integrate advanced simulation methods into scalable GPU-accelerated computing environments.
Act as a technical specialist supporting OEM partners with switch ASIC integration and system-level composition. Serve as the primary link between OEM engineering teams and internal NVIDIA ASIC and SDK groups to ensure smooth deployment.
Develop new Deep Learning models for speech recognition, synthesis, and natural language processing. Responsibilities include designing large-scale training algorithms, mentoring interns, and publishing research in top conferences.
Research and develop techniques to accelerate high-performance computing applications in scientific computing, AI, and data analytics. Collaborate with internal teams and external experts to optimize parallel algorithms and influence next-generation hardware and software architectures.
Lead the Cyber Defense & Response team by managing the full incident response lifecycle and ensuring operational readiness. Drive the transformation of manual SOC tasks into automated AI pipelines while partnering with engineering teams to safeguard corporate and cloud environments.
Lead the K-12 AI Education Program to align educational goals with workforce readiness and industry innovation. Coordinate with a national network of nonprofit, civic, and public sector partners to scale AI literacy and program impact.
Manage electronic discovery processes and conduct forensic collections of digital evidence for litigation and regulatory matters. Collaborate with legal teams, IT, and external vendors to deploy eDiscovery tools and implement evidence management procedures.
Senior Solutions Architect, Cloud Infrastructure and DevOps - NVIS
NVIDIA
·
Full Time
·
6 days ago
NVIDIA
Maintain large-scale HPC/AI clusters and develop automation tooling for deployment, monitoring, and resource consumption. Collaborate with customers and internal teams to analyze and implement large-scale networking projects.
Senior Solutions Architect, Generative AI - AI Models and Systems at NVAITC
NVIDIA
·
Full Time
·
6 days ago
NVIDIA
Collaborate with university research labs to identify and execute high-impact Generative AI projects. Act as a strategic bridge between academic partners and NVIDIA's engineering teams to drive the adoption of NVIDIA software platforms.
Senior Solutions Architect, Physical AI and Robotics at NVAITC
NVIDIA
·
Full Time
·
7 days ago
NVIDIA
Collaborate with university PIs on high-impact Physical AI and Robotics research projects while championing the adoption of NVIDIA software platforms. Act as a strategic bridge between academic partners and NVIDIA's internal engineering teams to drive world-class research and institutional agreements.
Design and deploy Agentic AI applications to automate telecommunications network operations using generative models and RAG pipelines. Provide technical guidance to strategic partners and translate integration challenges into reference architectures for the NVIDIA accelerated computing stack.
Lead a team of software and production engineers to build and operate scalable GPU infrastructure for DGX Cloud. Drive execution across Kubernetes operability, automation, and incident response while coaching technical leaders.
Profile and optimize end-to-end neural reconstruction and Gaussian Splatting workflows to improve speed, scalability, and reliability. Translate Python and PyTorch bottlenecks into efficient CUDA/C++ implementations while ensuring reconstruction quality is preserved.
Build and deploy AI-powered tools and LLM-based systems to automate root cause analysis and predict service trends for GeForce NOW. Establish robust data management practices and pipelines to transform production data into actionable intelligence.
Lead the development of connectivity reference designs and physical builds for large-scale AI clusters and factories. Collaborate with power, cooling, and software teams to optimize rack layouts and ensure successful global deployment.
Lead the Infrastructure Engineering Program Management team to drive program design, execution, and delivery of dependent infrastructure systems. Establish a program management charter focused on accountability and outcomes while hiring and growing a high-performance team.
Lead cross-functional programs across engineering, product, and go-to-market teams to drive cloud and software platform roadmaps. Coordinate technical workstreams to ensure release readiness, dependency closure, and the transition of pilots into repeatable product offerings.
Design and implement scalable, high-performance software libraries and services for AI-driven developer and robotics workflows. Provide technical leadership by shaping architecture, defining project scope, and guiding execution from design to delivery.
The role focuses on hardening products, services, and the software development lifecycle by identifying risks early and driving pragmatic fixes. Responsibilities include running security reviews across code and cloud infrastructure while building automation tools to ensure secure-by-default practices.
Build and scale tooling, services, and workflows for GeForce NOW and the GDN pixel streaming AI product. Collaborate with cross-functional teams to standardize AI workflows and bring new products to life.
Act as a technical advisor to medical device customers to integrate AI and accelerated computing into next-generation healthcare products. Develop proof-of-concept demonstrations and optimize workloads using NVIDIA's hardware and software platforms.
Senior Solutions Architect, Infiniband and Networking Ethernet - NVIS
NVIDIA
·
Full Time
·
9 days ago
NVIDIA
Build and support large-scale AI/HPC infrastructure for customers, focusing on performance, reliability, and real-time monitoring. Collaborate with internal teams to refine services and implement large-scale networking projects.
Lead research in AI for quantum algorithm discovery and drive technical collaborations with supercomputing centers and QPU builders. Develop innovative quantum-classical applications and publish impactful research to drive NVIDIA's quantum product adoption.
Design, implement, and maintain large-scale HPC/AI clusters including monitoring, logging, and workload orchestration. Develop automation tooling for deployment and provide technical insights for system design and performance tuning.
The role involves acting as a trusted advisor to government clients to translate mission goals into AI-enabled plans and solutions. Responsibilities include developing proofs of concept, mentoring partners, and collaborating with technical teams for solution deployment.
The role involves acting as a trusted advisor to public sector clients to translate mission goals into AI-enabled plans and solutions. Responsibilities include developing AI demonstrations, mentoring partners, and collaborating with technical teams on field trials and deployments.
Develop and architect critical firmware for the GPU Out-of-Band Hub (OOBHUB) to manage peripherals and secure communications. Collaborate with hardware architects to define firmware-hardware interfaces and implement high-reliability update mechanisms.
Drive growth of the NVIDIA AI Enterprise platform by managing strategic alignment and enablement with Global Strategic OEM partners. Develop joint executable plans and programmatic approaches to scale AI solutions through GSIs, SDPs, and ISVs.
Quantum Error Correction Research Scientist Intern - Fall 2026
NVIDIA
·
Full Time
·
13 days ago
NVIDIA
Develop automated high-performance quantum error correction research pipelines for code discovery and decoder design. Use AI and machine learning to identify novel codes and improve decoding accuracy for realistic quantum systems.
Manage NVIDIA Interconnect products by ensuring flawless engineering implementation, maintenance, and yield management. Coordinate with manufacturing partners and cross-functional teams to resolve product failures and scale capabilities.
Lead the implementation of performance practices for large-scale GPU infrastructure and AI workloads. Collaborate with cross-functional teams to architect, debug, and optimize high-performance compute platforms.
Responsible for production systems enabling large scalable GPU clusters for AI workloads, including asset provisioning and lifecycle management. Focuses on implementing monitoring, health management, and incident response to ensure high reliability and performance.
Architect and optimize AI/ML pipelines for large biological foundation models using NVIDIA's accelerated computing platform. Act as a technical advisor to biopharma customers to integrate GPU-accelerated software and improve scientific discovery workflows.
Senior Solutions Architect, Simulations - Clinical Sciences and Autonomous Lab
NVIDIA
·
Full Time
·
19 days ago
NVIDIA
Drive innovation in healthcare and life sciences by designing and optimizing GPU-accelerated AI software for clinical sciences and autonomous labs. Partner with pharmaceutical companies to implement patient modeling, robotic systems, and biomedical agentic AI.
Drive enterprise business growth and revenue performance across the OEM ecosystem in Spain, Italy, and Portugal. Lead funnel development and implement strategic go-to-market programs in collaboration with internal NVIDIA teams and external partners.
Lead and grow an engineering team focused on developing secure, scalable runtime infrastructure for AI agents. Define technical strategy and transform research from academia and industry into production-ready OpenShell capabilities.
Lead the research, design, and implementation of security architectures for next-generation NVIDIA Networking products. Collaborate with cross-functional teams and external partners to develop hardware security primitives and trusted platforms.
Senior Technical Program Manager, Pre-Silicon Software Enablement and Workload Studies
NVIDIA
·
Full Time
·
20 days ago
NVIDIA
Drive NVIDIA's software left-shift program to ensure software teams have necessary infrastructure to begin development early in the silicon lifecycle. Lead cross-functional alignment between architecture, modeling, and software teams to resolve dependencies and improve pre-silicon results.
Responsible for strategy creation, forecasting, and account management for Akamai and other select telecom accounts. The role involves building strategic partnerships and evangelizing NVIDIA's AI platform and technologies to drive revenue and market share growth.
Design and develop a massively distributed scalable platform to identify and remediate non-performant GPU assets within DGX Cloud. Collaborate across NVIDIA teams to ensure production AI clusters maintain maximum performance and reliability.
Lead and mentor a distributed team of firmware engineers responsible for CPU bootloader firmware (SBIOS) for ARM-based data center CPUs. Partner with architecture teams to shape next-generation silicon and ensure high-quality production releases.
Conduct in-depth performance characterization and analysis on large multi-GPU and multi-node clusters. Triage and root-cause performance issues while building tools to visualize and analyze performance data.
Drive the adoption of NVIDIA Metropolis and Vision AI technologies by building strategic relationships with ISVs and enterprise leaders. Provide technical leadership to integrate Vision-Language Models and AI agents into real-world operational intelligence applications.
Act as the primary point of contact for Capital Markets organizations to drive the adoption of NVIDIA's AI-first computing platforms. Collaborate with internal experts and external partners to transform key accounts into strategic partners and grow revenue.
Lead the design, deployment, and optimization of large-scale networking infrastructure for US Federal Government customers. Act as a trusted technical advisor to align NVIDIA technology with customer needs and provide feedback to engineering teams.
Design and deploy large-scale AI infrastructure and networking platforms for Cloud Partners in Canada. Act as a technical advisor to optimize AI training and inference pipelines while collaborating with internal engineering teams.
Lead the design and deployment of large-scale GPU infrastructure for US Federal Government customers. Act as a trusted technical advisor to align NVIDIA technology with customer roadmaps and resolve complex cluster performance issues.
Design and implement automated test scripts in Python to verify software functionality for high-speed networking and security services. Manage end-to-end feature validation, from test planning to defect analysis and closure in Linux-based environments.
Senior Software Engineer, Distributed Systems Engineer - DGX Cloud
NVIDIA
·
Full Time
·
24 days ago
NVIDIA
Design and develop a massively distributed scalable platform to identify and remediate non-performant GPU assets for DGX Cloud. Collaborate across NVIDIA teams to ensure production AI clusters maintain maximum performance and reliability.
Lead contract manufacturers in the Mexico region to ensure the highest quality of complex AI hardware systems. Drive continuous improvement through supplier assessments, audits, and the implementation of quality metrics and corrective actions.
The role involves collaborating with Cloud Partners to implement NVIDIA's hardware and software solutions for large-scale AI and HPC infrastructure. You will drive end-to-end technology integration and provide technical support throughout the customer lifecycle.
The role involves acting as a trusted advisor to government clients to translate mission goals into AI-enabled plans and solutions. Responsibilities include developing technical collateral, guiding field trials, and advocating for customer needs to shape product requirements.
Act as a technical advisor to biopharma companies to accelerate drug discovery using NVIDIA's computing platform. Responsibilities include building proof-of-concept demonstrations, scaling AI deployments, and guiding customers in implementing production-grade inference and training algorithms.
Senior Developer Relations Manager - Digital Biology Partnerships
NVIDIA
·
Full Time
·
a month ago
NVIDIA
Serve as a technical advisor to the developer ecosystem in computer-aided drug discovery to drive the adoption of NVIDIA's AI and computing platforms. Collaborate with software partners to optimize applications, co-develop roadmaps, and integrate the NVIDIA stack into developer pipelines.
Define and guide the global security architecture for cloud and datacenter infrastructure to protect AI workloads. Lead the implementation of Zero Trust principles, network segmentation, and secure container environments while mentoring a team of architects.
The role involves managing strategy, forecasting, and relationships for a select group of insurance enterprise customers to drive revenue and market share. You will collaborate with internal architects and external partners to evangelize NVIDIA's accelerated computing platform.
Drive the adoption and expansion of NVIDIA AI Enterprise software by collaborating with ecosystem partners like ISVs and OEMs. Translate customer business challenges into AI workloads and scalable go-to-market plays to accelerate revenue growth.
Lead the architecture for cloud-networking, orchestration, and security solutions for DPUs and NICs. Design end-to-end system architectures from the application level down to the hardware.
Senior HPC and AI Networking Performance Research and Analysis Engineer
NVIDIA
·
Full Time
·
a month ago
NVIDIA
Profile and analyze AI workloads on large-scale GPU and CPU clusters to optimize communication patterns and system performance. Develop performance analysis tools and collaborate across hardware and software teams to identify and resolve bottlenecks.
Design and implement security solutions across all layers, from high-level applications and OS to device firmware and networking products. Collaborate with cross-functional teams and external partners to identify threats and develop architectural security features.
Research and implement model architecture changes to improve high-fidelity video generation with a focus on human-centric quality and motion coherence. Translate research results into production-grade checkpoints and robust implementations for world foundation models.
Lead the design and implementation of large-scale Kubernetes clusters focusing on reliability, performance, and real-time monitoring. Manage the full service lifecycle from inception through deployment and maintain system health through automation and sustainable incident response.
Manage end-to-end production infrastructure supply chain from NPI to mass production delivery. Coordinate production capacity, risk assessments, and infrastructure maintenance across multiple production sites.
The Senior Product Engineer will manage board product lifecycles, including yield management, manufacturing process optimization, and failure resolution. They will collaborate with cross-functional teams to ensure high-quality product execution across global contract manufacturing sites.
The Senior Supplier Quality Engineer will lead factory quality onsite activities, including NPI and mass production, while ensuring compliance with NVIDIA quality standards. They will also facilitate root cause analysis, manage supplier performance metrics, and drive continuous improvement initiatives across the supply chain.
Act as the primary technical point of contact for enterprise customers, managing installations, maintenance, and complex technical issue resolution. Collaborate with internal engineering, marketing, and support teams to document standard methodologies and improve support processes.
The role involves collaborating with contracted electronic manufacturers to ensure quality standards are met through audits, process assessments, and quality control plans. You will lead root cause analysis and corrective action initiatives while working cross-functionally to resolve supplier-related quality issues.
The Solution Architect will drive the adoption of NVIDIA technology in telecommunications by developing high-value solutions and supporting technical trials. They will lead complex project deployments and communicate advancements through whitepapers, training sessions, and technical documentation.
You will develop, deploy, and validate AI factory environments by running and debugging complex AI/LLM workloads on GPU clusters. Additionally, you will build automation and observability tools to optimize performance, latency, and scalability for distributed training.
The engineer will support and maintain cloud network infrastructure by remediating critical alerts and triaging production-impacting incidents. They will also collaborate with cross-functional teams to drive operational improvements and manage large-scale IP network technologies.
You will lead technical engagement efforts with defense partners to integrate NVIDIA's accelerated computing stack into autonomous aerial platforms and uncrewed systems. This involves architecting perception and planning pipelines, providing reference designs, and guiding product roadmaps for edge AI technologies.
The architect will drive the deployment of end-to-end AI networking solutions and provide technical guidance to strategic customers. They will also capture customer requirements to influence product roadmaps and support on-site infrastructure bring-ups.
The Solutions Architect will serve as a subject matter expert in AI research and engineering, driving collaborations with academia and industry. They will also lead technical projects, mentor team members, and develop educational materials to foster AI ecosystem growth.
Lead and mentor technical teams while driving AI research and strategic collaborations with academia and industry. Develop and implement NVIDIA technology-related tutorials, workshops, and demos to foster accelerated AI adoption.
The Senior Competition Counsel will partner with management on competition matters and interface with European and global regulators. They will also collaborate with cross-functional teams to promote competition across NVIDIA's business operations.
Manager, Solutions Architecture β GPU and Networking Systems
NVIDIA
·
Full Time
·
a month ago
NVIDIA
Lead a team of solutions architects and engineers to design, debug, and deploy large-scale GPU and AI networking solutions in customer data centers. Act as a senior technical advisor to strategic customers while collaborating with product and sales teams to influence the product roadmap.
The Solutions Architect will serve as a technical advisor to drive the design, integration, and deployment of large-scale AI and GPU infrastructure for strategic partners. They will collaborate with cross-functional teams to deliver technical content, conduct workshops, and ensure successful implementation of NVIDIA hardware and software solutions.
The role involves collaborating with customers to optimize AI workload performance and reduce infrastructure costs. You will lead proof-of-concepts for AI solutions and develop software for NVIDIA and open-source AI frameworks.
Senior Strategic Alliance Manager, AI for Chemistry and Material Science - HER
NVIDIA
·
Full Time
·
a month ago
NVIDIA
You will identify and engage top researchers in AI for chemistry and material science to drive NVIDIA platform adoption. Additionally, you will define strategic partnerships and lead technical collaborations within the academic and research ecosystem.
You will act as the technical authority and advocate for the capital markets developer ecosystem, driving engagement and adoption of NVIDIA AI solutions. This involves collaborating with cross-functional teams to deliver technical enablement resources and influencing product roadmaps based on developer feedback.
Manage end-to-end production infrastructure supply chain from NPI to mass production delivery. Coordinate capacity management, risk assessment, and maintenance activities across multiple production sites.
You will serve as a technical advisor and champion for the developer ecosystem within the telecom industry to drive the adoption of NVIDIA technologies. This involves collaborating with internal teams and external partners to integrate NVIDIA's stack into developer products and pipelines while shaping technology roadmaps.
You will design and deploy sophisticated Agentic AI systems for top-tier retail and enterprise clients using NVIDIA's core technology stack. This role involves building reference architectures, optimizing inference performance, and enabling partner engineering teams through technical workshops and documentation.
You will design and implement full-stack AI factory infrastructure, including hardware architecture, workload orchestration, and performance tuning. You will also lead technical sales activities and collaborate directly with customers to optimize their AI inference and simulation workflows.
The Solutions Architect will engage with customers and partners to deliver high-value technical solutions leveraging NVIDIA's AI, HPC, and networking platforms. They will act as a trusted technical advisor, creating documentation and educational content while collaborating with internal teams to drive customer success.
You will design, architect, and implement complex Ethernet networking solutions for data centers while providing technical expertise to strategic customers. Additionally, you will develop network automation scripts and collaborate with internal teams to improve product strategy and deployment processes.
Analyze High Performance Computing (HPC) applications to identify performance characteristics and optimization opportunities. Provide technical guidance to compiler and application engineering teams to improve GPU acceleration and system performance.
Analyze High Performance Computing applications to identify performance characteristics and optimization opportunities. Assist customers with GPU acceleration and provide technical guidance to compiler and application engineering teams.
You will design and implement core JAX components to drive peak performance on NVIDIA products while collaborating with AI researchers. Additionally, you will build tools to improve the efficiency of teams developing AI-based systems and bridge the gap between research and real-world applications.
You will plan and establish processes, define test requirements, and optimize production lines to successfully launch new GPU boards for datacenter architectures. Additionally, you will collaborate with cross-functional teams and contract manufacturers to ensure cost and quality metrics are met while resolving yield and test problems.
The role involves building and scaling NVIDIA's HCLS startup ecosystem in the DACH region through strategic partnerships and developer engagement programs. You will act as the primary interface for regional startups and venture capital firms to accelerate the adoption of NVIDIA technologies.
Collaborate with customers to optimize AI workload performance and reduce infrastructure costs. Lead proof-of-concepts for AI solutions and develop software for NVIDIA and open-source AI frameworks.
The role involves guiding partners in adopting end-to-end Agentic AI solutions and collaborating with customers and partners to deploy AI solutions at scale. Solution Architects will also assist with demos, proof-of-concepts, and knowledge sharing.
The Project Technical Delivery Manager will oversee the complete project lifecycle from initiation to close, ensuring technical requirements are met within budget. They will facilitate technical architecture decisions, manage complex installations, and maintain effective relationships with stakeholders and customers.
The role involves leading safety engineering processes to ensure compliance with ISO standards for autonomous driving systems. You will act as the primary point of contact for safety concerns and collaborate with partners to resolve development issues.
You will serve as a forward-deployed technical liaison to deploy, manage, and validate large-scale AI Compute and HPC infrastructure for enterprise customers. This role involves collaborating with internal teams and partners to define project requirements, provide technical support, and perform knowledge transfers.
The role involves serving as a technical liaison to design, architect, and test large-scale Ethernet networking solutions for AI factories. You will collaborate with partners and internal teams to deploy, automate, and optimize network infrastructure for accelerated computing workloads.
The Senior Solutions Architect will serve as a technical liaison to support partners and customers in the planning, construction, and deployment of large-scale AI factories. This role involves reviewing infrastructure build plans, ensuring compliance with reference architectures, and providing technical mentorship to optimize performance and scalability.
You will serve as a technical SME to design developer tools, APIs, and workflows for chemistry and materials science. You will also collaborate across research and engineering teams to shape the NVIDIA ALCHEMI software stack and represent the company at scientific conferences.
The researcher will identify hardware vulnerabilities on SoC and GPU designs and develop advanced security tools and techniques. They will also guide the integration of security mitigations and conduct research into side-channel, fault, and physical attacks.
You will design and develop automated tests for networking switches and adapters while collaborating with hardware and software engineering teams. Additionally, you will support production lines, troubleshoot testing procedures, and drive projects from definition through to mass production.
The Senior Software Engineer will be responsible for adapting NVIDIA Drive software on development and production vehicles and triaging complex issues across safety-critical ADAS systems. They will collaborate with OEM and internal teams to root cause functional problems and support vehicle bring-up activities including validation and data logging.
You will deploy, lead, and maintain large-scale AI data center network stacks while providing technical recommendations to strategic customers. The role involves troubleshooting complex network issues and collaborating with internal teams to ensure successful implementation of AI infrastructure blueprints.
Design and implement innovations for managing GPU-based AI servers, focusing on OOB management and BMC firmware development. Collaborate with global teams, hardware engineers, and industry partners to deliver high-end enterprise server platforms.
The role involves designing novel benchmark tasks and evaluation methodologies to measure the effectiveness of agentic memory systems. You will also build synthetic dataset pipelines and partner with internal teams to integrate memory improvements into various applications.
Lead joint solution development with Global System Integrators to integrate NVIDIA AI technology into healthcare and life sciences offerings. Develop and execute strategic go-to-market plans to drive revenue growth and accelerate AI adoption across the sector.
Senior Software Engineer - Accelerated Kubernetes Runtime Team
NVIDIA
·
Full Time
·
2 months ago
NVIDIA
Design and implement automation systems to orchestrate the lifecycle of runtime components across thousands of Kubernetes clusters. Develop Kubernetes controllers, operators, and CRDs to manage the installation, upgrade, and validation of accelerated compute components.
Lead a distributed team in developing security-critical root-of-trust firmware for data center platforms. Drive modern engineering practices, including AI-assisted workflows, while ensuring high standards for security, reliability, and project execution.
Serve as a deep technical advisor to partners, enabling them to build enterprise Physical AI systems using NVIDIA's simulation and robotics stack. Define architectures, compute footprints, test plans, and rollout strategies for complex robotics and digital twin workflows.
Senior HPC Cluster Administrator - Deep Learning Frameworks Infrastructure
NVIDIA
·
Full Time
·
2 months ago
NVIDIA
You will own the full lifecycle of large-scale GPU compute clusters, including procurement, configuration, and reliability management. Additionally, you will lead infrastructure automation and collaborate with engineering teams to optimize performance for deep learning workloads.
The Client Director will drive business relationships with Global System Integrators in EMEA to promote AI-enabled services and GPU-accelerated computing. They will act as a bridge between partners and NVIDIA teams to execute joint go-to-market plans, manage PoCs, and support technical enablement.
You will be responsible for driving software revenue growth for NVIDIA's accelerated datacenter solutions across dedicated EMEA regions. This involves managing the full sales cycle, collaborating with internal teams and partners, and building deep relationships with key stakeholders.
Primary responsibilities include building and operating AI/HPC infrastructure for new and existing customers. The role involves supporting operational and reliability aspects of large-scale AI clusters.
Maintain large scale computational and AI infrastructure, focusing on monitoring, logging, and workload orchestration. Serve as a key technical resource, developing and documenting standard methodologies and operational guidelines.
Senior Systems Software Engineer, Data Center Infrastructure Management - EngOps
NVIDIA
·
Full Time
·
2 months ago
NVIDIA
The engineer will take ownership of daily cluster failures and issues, troubleshooting them promptly to maintain optimal cluster availability and performance. Responsibilities also include managing updates to site controller management nodes and overseeing the rollout and rollback of cluster software and firmware updates.
This role involves serving as a trusted technical advisor and champion for the EMEA AI Natives developer ecosystem, driving adoption of NVIDIA technologies by demonstrating groundbreaking solutions and accelerating critical workloads. The manager will also guide partners and startups through integration, track ecosystem growth, and collaborate cross-functionally to optimize adoption strategies.
The manager will serve as a trusted technical advisor, driving strategic engagements in high-impact AI domains like GenAI and scaling AI infrastructure, while guiding partners through NVIDIA integration to foster co-innovation. They will also map and expand the partner ecosystem and collaborate cross-functionally to optimize adoption strategies.
This role involves defining and driving the product vision, strategy, and roadmap for source control systems essential for large-scale chip and software development, focusing on performance, reliability, and scalability. The manager will also own the evolution of repository architecture, optimize developer workflows in partnership with engineering teams, and establish metrics to measure impact.
This role involves researching and developing techniques to optimize key Cloud and HPC CPU workloads specifically on NVIDIA's CPU, requiring in-depth analysis for current and future generations. Responsibilities also include engaging with the developer community, guiding framework developers, and contributing directly to their software stack or developing reference code.
This role involves serving as a trusted technical advisor and champion for the developer ecosystem within designated CSP partners, driving the adoption of NVIDIA technologies by accelerating critical workloads and demonstrating groundbreaking solutions. The manager will also advise on technical enablement resources, guide partners through integration, and represent partner technical needs internally to influence product roadmaps.
The manager will develop deep technical expertise across federal mission workloads, serving as a trusted advisor to accelerate adoption of NVIDIA software stacks like CUDA-X and NeMo into partner platforms. They will also analyze the federal developer ecosystem to shape product roadmaps and enable partners through technical workshops and reference architectures.
The role involves defining and implementing the chip pad ring and substrate interconnect scheme, leading the package layout design process, and collaborating with various engineering teams to define chip floor plans and ball outs for robust electrical packages. Responsibilities also include communicating effectively across different company teams.
The engineer will develop and implement CUDA Core Libraries in C++ and/or Python, focusing on parallel algorithms and idiomatic language bindings for core CUDA functionality. Responsibilities also include composing, optimizing, and evolving GPU algorithms and APIs, owning features end-to-end, and improving the overall developer experience.
The role involves guiding customers in adopting NVIDIA's technology stacks to deliver end-to-end GenAI and Agentic AI solutions, utilizing cloud native methodologies and accelerated compute to build modern AI factories. Responsibilities also include sharing knowledge via demos, proof-of-concepts, and writing technical content, while collaborating with engineering to solve complex problems.
The Senior Physical Design Engineer will own the full physical design flow including synthesis, floorplanning, place & route, and timing closure for the LPU chip design. This role involves cross-functional optimization with IP and design teams to drive PPA improvements and leading design closure for successful GDSII tapeout.
The Senior DFT Engineer will define and implement SCAN, MBIST, and JTAG debug structures, driving post-silicon testing plans and creating ATPG and MBIST test vectors. They will also build DFT timing constraints, partner with physical design teams, and work with the post-silicon team to bring up test patterns on actual silicon.
Primary responsibilities involve deploying, managing, and maintaining AI/HPC infrastructure in Linux-based environments for customers, acting as the domain expert during planning and implementation phases. This role also requires creating handover documentation, performing knowledge transfers, and providing feedback to internal teams regarding bugs and improvements.
This role is responsible for all aspects of demand creation, co-selling, forecasting, sales leadership, training, and education to end users and partners to grow revenue for NVIDIA platforms through joint solutions with Palantir across the US Federal Government. The strategist will serve as the key contact for the partnership within federal accounts, aligning go-to-market strategies and identifying new mission opportunities.
The engineer will ensure the functional correctness and completeness of next-generation chip designs by developing and implementing formal verification methodologies using advanced formal techniques to obtain bounded proofs. Key tasks include identifying verification behaviors, implementing testplans with assumptions and assertions, developing abstraction models, and debugging RTL failures.
This role is responsible for owning end-to-end inventory accuracy across consigned materials, SFG, FG, and bone-pile inventory at Contract Manufacturers, ensuring compliance with policies and leading B2B reconciliation between SAP and ERP systems. Key duties include managing EDI/B2B interfaces, governing bone-pile inventory, serving as an SAP key user, and driving digitalization using data and AI tools to improve accuracy and efficiency.
The role involves managing the business relationship with Global System Integrators (GSIs) in the EMEA region, focusing on evolving them into exemplary partners for providing AI-enabled services. Key duties include defining strategy with GSI leadership, devising enablement plans for their developers on GPU accelerated computing, and acting as a bridge for technical collaborations and joint go-to-market plans.
Senior Networking Solution Test Engineer β AI Cluster Debugging
NVIDIA
·
Full Time
·
3 months ago
NVIDIA
The role involves designing and reviewing test requirements for NVLink, Ethernet, and InfiniBand components within large-scale AI clusters, and building realistic customer-like testbeds incorporating heterogeneous hardware and complex network fabrics. Responsibilities also include owning end-to-end cluster troubleshooting, reproducing customer scenarios, triaging issues across the stack, and driving them to root cause resolution.
Primary duties involve deploying, managing, and maintaining the Infiniband Network within Linux-based environments for AI Factory infrastructure for both new and existing customers. This includes acting as the domain expert during planning and implementation, creating handover documentation, and performing knowledge transfers to support customers.
The role involves developing a highly optimized inference framework that runs on the worldβs largest supercomputers and data centers, focusing on performance and scalability in AI networking acceleration.
The Factory Planner is responsible for translating demand into executable factory plans by aligning materials, capacity, and manufacturing readiness across contract manufacturing partners and cross-functional teams. This involves driving best-likely build commits, monitoring execution against weekly plans, and collaborating with various teams to resolve constraints and maintain healthy inventory levels.
The role involves leading and cultivating relationships with strategic Venture Capital firms and their portfolio companies across the EMEA region to amplify NVIDIA's influence in the startup ecosystem. This includes enabling successful partnerships between portfolio companies and NVIDIA business units and planning executive engagement opportunities.
This role focuses on redefining AI hardware development methodology by inventing next-wave techniques, pioneering AI-driven automation for sophisticated ASIC conception, exploration, and closure. Responsibilities include taking a comprehensive view of the ASIC lifecycle to identify bottlenecks where automation and AI can improve convergence and turnaround time.
The engineer will take a comprehensive view of the ASIC development lifecycle to identify bottlenecks where automation and AI can improve predictability and turnaround time. They will also establish quantitative metrics to measure efficiency and serve as a technical catalyst by sharing best practices and mentoring engineers on emerging AI-enabled techniques.
The role involves collaborating with development teams to adopt new features and assisting customers with deploying, debugging, and improving the efficiency of AI workloads on extensive NVIDIA platforms. Responsibilities also include benchmarking features, analyzing performance, and solving cluster performance and stability issues directly with external customers.
This leader will guide a team focused on developing next-generation system designs and integrating new compute, networking, storage, and software systems for AI supercomputing at scale. Responsibilities include building platforms for software development, automation, performance engineering, and collaborating with partners on deployment and validation.
EMEA Sales Senior Account Manager, Smart Spaces and Local Government
NVIDIA
·
Full Time
·
4 months ago
NVIDIA
This role involves owning strategic relationships with leading cities and public-sector organizations to position NVIDIA's platforms and AI Factory strategy for modernizing public services and competitiveness across EMEA. Key activities include crafting revenue growth for Smart Cities solutions, building a robust pipeline focused on AI deployment, and developing long-term relationships with senior city leaders.
The Senior Power System Engineer will build and develop power delivery systems for data center and super compute platforms, focusing on high-current power systems. They will collaborate with cross-functional teams to ensure power integrity and optimize power converter performance.
As a Senior Formal Verification Engineer, you will verify ASICs using formal verification tools and define the verification scope to ensure correctness. You will collaborate with various teams to resolve design issues and improve verification methodologies.
As a key member of the ASIC Verification team, you will verify the design and implementation of the inference accelerator. You will collaborate with architects, designers, and verification teams to ensure the correctness of the design.
As a key member of the Design team, you will implement, document and deliver high performance, area and power efficient RTL. You will collaborate with various teams to analyze architectural trade-offs and deliver fully verified designs.
As a Senior ASIC Power Engineer, you will handle power-related activities including ASIC energy evaluation and power architecture. You will also support power analysis efforts and work on power verification for NVIDIA's products.
As a Senior Formal Verification Engineer, you will verify ASICs using formal verification tools and define the verification scope to ensure correctness. You will collaborate with various teams to improve methodologies and deliver high-quality results on schedule.
Analyze Deep Learning models and investigate TensorRT stability and performance issues. Work with an internationally distributed team for CUDA and TensorRT development.
The Solutions Architect will partner with internal teams to drive the adoption of NVIDIA technology within AdTech and Media ecosystems, acting as a trusted technical advisor for customers on hardware and software use.
Lead architecture for cloud-networking and security solutions while designing state-of-the-art system architecture for DPUs & NICs technologies. Collaborate with global teams to innovate and develop proof of concept prototypes into full-fledged products.
The engineer will join the Developer Tools team to work on software like Nsight Systems, collaborating across multiple teams and hardware platforms ranging from embedded systems to large multi-GPU servers. Responsibilities include participating in research, benchmarking, driving productization activities, and engaging in all phases of the software life cycle.
The role involves working on software like Nsight Systems, interacting with diverse hardware platforms from embedded systems to large servers, and acting as a communicator between the Nsight Systems team, chip design teams, and the metrics library team. Responsibilities include understanding user performance goals to influence future chip design and participating in all phases of the software life cycle.
Responsible for NVIDIA board team product management, including product yield management and ensuring product engineering implementation. Supervise NVIDIA board productsβ quality and yield goals, and take corrective actions if needed.
The Solutions Architect will work with customers on data center GPU server and networking infrastructure deployments, guiding discussions on network topologies and supporting server/network/cluster deployments. They will also identify new project opportunities and build custom product demonstrations for solutions addressing critical business needs.
Collaborate with internal and external teams to define system requirements for fault-tolerant quantum computing. Develop novel approaches to quantum error correction and calibration supported by rigorous systems analysis.
The Data Center Deployment Specialist will deploy and support NVIDIA products, guide contractors, ensure accurate cable installations, and conduct quality testing. They will also document processes and maintain event logs while participating in project calls.
Contributing to the development of CUDA Quantum by building core infrastructure for inter-device communication and efficient execution across multiple processors. Partnering with architects and product managers to create an extensible toolchain integrating quantum architecture specific components.
Develop automation for deploying Kubernetes clusters for streaming media use cases and monitor and manage these clusters. Collaborate with other NVIDIA R&D teams globally in a fast-paced environment.
The Senior System Validation Engineer will be responsible for testing verification and validation of both hardware and software, maintaining product quality, and executing various tests. Additionally, the role involves troubleshooting and optimizing testing procedures while supporting production matters.
Drive strategic technical teamwork with leading Agentic AI companies and advocate for NVIDIAβs technologies. Collaborate with partners to identify product gaps and influence the future roadmap.
Senior Systems Software Security Engineer β Data Center Systems
NVIDIA
·
Full Time
·
7 months ago
NVIDIA
You will focus on securing NVIDIAβs Data Center Systems by delivering necessary security features and engaging with teams to drive implementation. Your role will involve designing and developing optimized security solutions following industry standards.