Job Qualification
You are graduating with a MS, or Ph.D. degree in Computer Science, Computer Engineering, Applied Math, or related field.
You can work independently, define project goals and scope, and lead your own development effort.
Strong C++/ C or Python programming and software design skills, including debugging, performance analysis, and test design.
Strong expertise on one of these directions:
deep learning quantization
NN compiler (TVM, MLIR, GLOW, LLVM) experience
NN model pruning
Cadence DSP programming
Ways to stand out from the crowd:
Numerical methods experience
Knowledge of computer architecture
Python experience
Experience with Deep Learning Frameworks such as TensorFlow, PyTorch, and MXNet
Job Description
Job description:
We are looking for an experienced AI/ML engineer passionate about deploying AI inference and end to end enablement, AI framework integration, model accuracy, and performance analysis and tuning. You might be an ideal candidate if you are seeking to develop high quality, innovative, and scalable software that enables state of the art AI inference models to run efficiently and accurately on the BST Intelligence processors.
Responsibilities:
Contributing to deep learning infrastructure, data pipelines, tools and workflows that lay the foundation for building AI at scale.
Writing software to deploy AI models and pipelines in real time applications (inference).
Apply low precision inference, quantization, and compression of DNNs.
Continuously improve inference latency, accuracy and power consumption of DNNs.
Stay up to date with the latest research and innovations in deep learning, implement and experiment with new ideas.
Job Location: 2290 N 1st. St. STE100 San Jose, CA 95131
























