SantaClaraRecruiter Since 2001
the smart solution for Santa Clara jobs

AI/ML Systems - Principal Software Engineer

Company: Entrada Ventures
Location: Santa Clara
Posted on: September 15, 2023

Job Description:

THE ROLE: SENIOR/STAFF/PRINCIPAL SW ENGINEER (SYSTEMS)About usIf you are following the evolution of the leading approach in deep learning powered AI, the renaissance in NLP as well as the next disruption in computer vision, you likely know it's all about Transformer based models.. They are powering neural nets with billions to trillions of parameters and existing silicon architectures (including the plethora of AI accelerators) are struggling to varying degrees to keep up with exploding model sizes and their performance requirements. More importantly, TCO considerations for running these models at scale are becoming a bottleneck to meet exploding demand. Hyperscalers are keen on how to gain COGS efficiencies with the trillions of AI inferences/day they are already serving, but certainly for addressing the steep demand ramp they are anticipating in the next couple of years. d-Matrix is addressing this problem head on by developing a fully digital in memory computing accelerator for AI inference that is highly optimized for the computational patterns in Transformers. The fully digital approach removes some of the difficulties of analog techniques that are most often touted in pretty much all other in-memory computing AI inference products. d-Matrix's AI inference accelerator has also been architected as a chiplet, thereby enabling both a scale-up and scale-out solution with flexible packaging options. The d-Matrix team has a stellar track record in developing and commercializing silicon at scale as senior execs at the likes of Inphi, Broadcom, and Intel. Notably, they recognized early the extremely important role of programmability and the software stack and are thoughtfully building up the team in this area even since before their Series A. The company has raised $44m in funding so far and has 70+ employees across Silicon Valley, Sydney and Bengaluru.Why d-MatrixWe want to build a company and a culture that sustains the tests of time. We offer the candidate a very unique opportunity to express themselves and become a future leader in an industry that will have a huge influence globally. We are striving to build a culture of transparency, inclusiveness and intellectual honesty while ensuring all our team members are always learning and having fun on the journey. We have built the industry's first highly programmable in-memory computing architecture that applies to a broad class of applications from cloud to edge. The candidate will get to work on a path breaking architecture with a highly experienced team that knows what it takes to build a successful business.The Role: Principal SW Engineer (Systems)The role requires you to be part of the team that helps productize the SW stack for our AI compute engine. As part of the Software team, you will be responsible for the development, enhancement, and maintenance of the next-generation AI Deployment software. -You have had past experience working across all aspects of the full stack tool chain and understand the nuances of what it takes to optimize and trade-off various aspects of hardware-software co-design. You are able to build and scale software deliverables in a tight development window. - You will work with a team of compiler experts to build out the compiler infrastructure working closely with other software (ML, Systems) and hardware (mixed signal, DSP, CPU) experts in the company.Qualifications - -Minimum: -

  • Computer Science, Engineering, Math, Physics or related degree -
  • Strong grasp of computer architecture, data structures, system software, and machine learning fundamentals
  • Proficient in C/C++ and Python development in Linux environment and using standard development tools -
  • Experience with deep learning frameworks (such as PyTorch, Tensorflow) -
  • Experience with deep learning runtimes (such as ONNX Runtime, TensorRT,---) -
  • Experience with inference servers/model serving frameworks (such as Triton, TFServ, KubeFlow,---)
  • Experience with distributed systems collectives such as NCCL, OpenMPI,...
  • Experience deploying ML workloads on distributed systems, in a multitenancy environment
  • Experience with MLOps from definition to deployment including training, quantization, sparsity, model preprocessing, and deployment -
  • Experience training, tuning and deploying ML models for CV (ResNet,..), NLP (BERT, GPT), and/or Recommendation -Systems (DLRM) -
  • Self-motivated team player with a strong sense of ownership and leadership -Desired: -
    • MS or PhD in Computer Science, Electrical Engineering, or related fields
    • Prior startup, small team or incubation experience -
    • Work experience at a cloud provider or AI compute / sub-system company -
    • Experience implementing SIMD algorithms on vector processors -
    • Experience with open source ML compiler frameworks such as MLIR -Location -Silicon Valley preferred, but open to other locations within the US/Canada -

Keywords: Entrada Ventures, Santa Clara , AI/ML Systems - Principal Software Engineer, IT / Software / Systems , Santa Clara, California

Click here to apply!

Didn't find what you're looking for? Search again!

I'm looking for
in category
within


Log In or Create An Account

Get the latest California jobs by following @recnetCA on Twitter!

Santa Clara RSS job feeds