Amazon Elastic Inference

  • Allows attaching low-cost GPU-powered inference acceleration to EC2 instances, SageMaker instances, or ECS tasks.
  • Reduce machine learning inference costs by up to 75%.

Common use cases

  • Computer vision
  • Natural language processing
  • Speech recognition
IT Certification Category (English)728x90


  • Accelerator
    • A GPU-powered hardware device provisioned.
    • It is not a part of the hardware where your instance is hosted.
    • Uses AWS PrivateLink endpoint service to attach to the instance over the network.
  • Only a single endpoint service is required in every Availability Zone to connect Elastic Inference accelerator to instances.


  • Supports TensorFlow, Apache MXNet, PyTorch, and ONNX models.
  • Can provide 1 to 32 trillion floating-point operations per second (TFLOPS) per accelerator.
  • The accelerator attached to each instance in an auto-scaling group scales accordingly to your application’s compute demand.


  • You are charged for the accelerator hours you consume.


New Year Sale – Upgrade Your Skills and Get a Chance to Win FREE Courses

NEW Course – AWS Certified Data Analytics Specialty Practice Exams

AWS Certified Data Analytics Sepcialty

Pass your AWS and Azure Certifications with the Tutorials Dojo Portal

Tutorials Dojo portal

Our Bestselling AWS Certified Solutions Architect Associate Practice Exams

AWS Certified Solutions Architect Associate Practice Exams

Enroll Now – Our AWS Practice Exams with 95% Passing Rate

AWS Practice Exams Tutorials Dojo

Enroll Now – Our Azure Certification Exam Reviewers

azure reviewers tutorials dojo

Tutorials Dojo Study Guide and Cheat Sheets eBooks

Tutorials Dojo Study Guide and Cheat Sheets-2

FREE Intro to Cloud Computing for Beginners

FREE AWS Practice Test Samplers

Browse Other Courses

Generic Category (English)300x250

Recent Posts