Amazon debuts AWS Inf1, an AI inference instance
December 3, 2019
Amazon Web Services today debuted Inf1, an instance that powers AI inference in the cloud that CEO Andy Jassy calls the lowest cost inference offering available in the cloud.
“…it will have lower latency, it will have 3 times higher throughput, and up to 40% lower cost per instance compared to our G4 instance which is based on an Nvidia chip which previously was the lowest cost inference…