Please mention DailyRemote when applying
Employment Type: Full-Time
Work Setting: Onsite/Remote
Work Location: New York/San Francisco/Remote
Work Hours: Office Hours
Find out more here: https://semianalysis.com
About SemiAnalysis
SemiAnalysis is an independent research and analysis firm specializing in the Semiconductor and AI industries. Our in-depth coverage spans the entire supply chain, from semiconductor fabrication processes to cutting-edge AI Models, software, and infrastructure. We are recognized as the leading authority on the semiconductor supply chain, with the highest concentration of industry experts within one team, and a deep-rooted passion for delving into the intricacies.
We’re a global team of over 50 analysts, each with extensive networks across the semiconductor supply chain and AI ecosystem, publishing industry shaping articles while participating in 40+ conferences annually.
Our newsletter reaches more than 200,000 subscribers worldwide, including senior management and c-suite leaders at the leading semiconductor and AI companies.
We also offer three core products:
Industry Models – we develop and publish industry models on accelerator shipments, datacentre demand and supply, GPU total cost of ownership, and more. We work with hyperscalers, neoclouds, many of the world’s largest hedge funds, and government agencies.
Core Research – our public equity markets product, geared towards financial investors, distils our deep technical research and knowledge into key insights on technology and product trends.
Consulting and Technical Due Diligence – We conduct custom research and project work to guide key strategic and investment decisions for the largest private equity funds, leading venture capital firms, companies across the AI ecosystem, and government agencies.
Position Overview
We are seeking a Technical Consultant to join our team working on ClusterMAX™, the industry standard GPU Cloud rating system. We are hiring at all experience levels with competitive compensation.
Responsibilities
Lead ClusterMAX™ consulting engagements, including technical due diligence projects related to neoclouds, AI accelerators, AI infrastructure, AI labs, and adjacent ecosystems.
Translate ClusterMAX™ benchmarking and testing insights into actionable recommendations and investment decisions for clients.
Contribute to the development of next-generation benchmarking methodologies, TCO analysis frameworks, and future ClusterMAX™ research initiatives.
Collaborate with executives, engineers, and technical teams across major neocloud providers, including Amazon Web Services, Microsoft Azure, Google Cloud, Oracle, CoreWeave, Nebius, Crusoe, Lambda, and Together.
Build and maintain relationships with AI accelerator manufacturers, OEMs, and ecosystem partners, including NVIDIA, AMD, Intel, Google, Amazon, Cerebras, Groq, Dell Technologies, Hewlett Packard Enterprise, Lenovo, and Cisco.
Strengthen relationships with AI labs, investors, startups, and technical communities to better understand industry requirements and operational challenges.
Author detailed technical research reports evaluating architecture design, benchmark performance, reliability, scalability, and operational usability of neocloud and AI infrastructure providers.
Stay informed on emerging trends and technologies through participation in major industry conferences such as NeurIPS, MLSys, NVIDIA GTC, OCP, SC, and Hot Chips.
Requirements
Strong understanding of ML frameworks such as PyTorch and JAX.
Familiarity with GPU and TPU cluster environments running orchestration platforms such as Kubernetes or Slurm.
Understanding of distributed storage technologies including Weka, VAST, Lustre, and S3-based storage systems.
Knowledge of high-performance networking technologies such as InfiniBand and RoCEv2.
Understanding of ML systems benchmarking and performance testing tools, including GEMMs, nccl-tests, vLLM, sglang, fio, TorchTitan, Megatron, and related frameworks.
Experience working at a hyperscaler, neocloud provider, server OEM, AI accelerator company, or large-scale AI infrastructure environment is preferred.
Ability to work proactively and independently within a globally distributed team environment.
Strong analytical, technical communication, and problem-solving capabilities.
Growth Areas
Develop deep expertise in AI infrastructure, neocloud ecosystems, accelerators, and large-scale ML system benchmarking.
Gain exposure to technical due diligence and investment decision-making processes across frontier AI and semiconductor markets.
Build relationships with leading hyperscalers, AI labs, infrastructure startups, accelerator vendors, and institutional investors.
Contribute to industry-recognized benchmarking methodologies and technical research publications.
Expand technical knowledge across distributed systems, networking, storage, AI infrastructure, and performance optimization.
Work directly with globally recognized companies shaping the future of AI compute and infrastructure.
Increase visibility within the AI and semiconductor ecosystem through conferences, technical collaborations, and published research.
Stop the endless job search. Our AI finds and applies to the best jobs for you.
Discover remote opportunities in Others
Answer easy questions
200,000+ jobs across 15+ categories
Get your best job matches
Only hand-screened, legit jobs
Find a remote job faster
No ads, scams, or junk
“ I was the first applicant for a remote marketing position that got listed on the company website the same day I applied. Had an interview within 48 hours!