Systems Engineer, AI Infrastructure

About the role

We’re rekindling efforts to leverage acceleration and machine learning at Cloudflare in 2023, and the HW systems team is looking to hire senior technical engineers with a strong understanding of deep learning, ML end-to-end workloads and frameworks, and experience with hardware accelerator and processor architectures (GPUs, CPUs w/AI features, ASICs). This is an exciting opportunity with real world impact, where you’ll get to build and deploy hardware accelerators on our data center platform. 

Responsibilities 

  • Evaluate, design and deploy cutting edge acceleration solutions for our growing services
  • Lead the design of scalable AI infrastructure for Cloudflare’s own internal machine learning platform and tune for optimal performance
  • Collaborate with product/data science teams to identify customer use-cases and translate workloads to technical design and hardware requirements (architecture definition, hardware selection, performance tuning)
  • Engage with hardware vendors to identify hardware solutions that best fit needs of the platform
  • Set the strategy and long term roadmap to demonstrate the value proposition of AI/ML workloads for the edge 
  • Engage with AI leaders across the industry and influence design based on open industry standards 

Qualifications

  • Masters or equivalent experience in Computer Architecture, Computer Science, Electrical Engineering or related field with 12 years of relevant experience or equivalent
  • Has demonstrated technical leadership on critical company wide projects with experience in computer architecture (GPU, CPU w/AI. acceleration) and ML software ecosystem. Experience with programming models a plus
  • Strong technical foundation and deep understanding of cloud technologies, DL/ML workloads in the industry, frameworks (Tensorflow/Pytorch), and containers tools. 
  • Ability to work in a constantly changing ambiguous environment and bridget the software/hardware divide 
  • Industry-wide impact. Proactively creates formal networks involving coordination with internal and external technical leaders and has tangible proof points (patents, papers, conference contributions, open source SW or HW contributions, and/or sitting on a standards committee or board, etc.) demonstrating industry-wide influence as an influential spokesperson for the organization
  • Must be collaborative and has demonstrated ability to work effectively across cross functional teams, sound technical judgement and is capable of building positive working relationships
  • Seeks to mentor team members, offers technical advice  and seeks to continuously learn