Software for AI hardware
Embedded Machine Learning for Dummies fromARM
podcast OctoML: Automated Deep Learning Engineering with Jason Knight and Luis Ceze Software Engineering Daily
The output of the network at test time is a score per class (typically 1000 classes for image net). The top 5 performance checks whether the right class label is among the 5 highest output scores. It's a laxer measure than top 1 (whether the top scoring class is the correct label). You can check the imagenet competition, I think this is explained there in more detail
Google research 2021
LEARNED HARDWARE/SOFTWARE CO-DESIGN OF NEURAL ACCELERATORS Zhan Shi, Chirag Sakhuja
https://tvm.apache.org/
An End to End Machine Learning Compiler Framework for CPUs, GPUs and accelerators
Cloudflare
Cloudflare's Intelligent Design Byrne Hobart
https://stratechery.com/2021/cloudflare-on-the-edge/
https://blog.cloudflare.com/cloudflare-workers-serverless-week/