AWS Inferentia

Creator

Creator

Seonglae Cho

Created

Created

2020 Dec 4 8:16

Editor

Editor

Seonglae Cho

Edited

Edited

2023 Dec 7 9:42

Refs

Refs

AWS가 사용자 맞춤 설계한, 고성능 기계 학습 추론 칩

AWS Inferentia의 워크로드 성능을 최적화하는 컴파일러, 런타임 및 프로파일링 도구로 구성된 AWS Neuron 소프트웨어 개발 키트(SDK)는 Tensorflow, PyTorch, MXNet 등의 유명 프레임워크에서 생성 및 훈련되고 AWS Inferentia 기반 Amazon EC2 Inf1을 사용하여 실행

Inferentia2

Deploy Llama 2 7B on AWS inferentia2 with Amazon SageMaker

In this blog post, you will learn how to compile and deploy Llama 2 7B on AWS Inferentia2 with Amazon SageMaker.

https://www.philschmid.de/inferentia2-llama-7b#1-convert-llama-2-to-aws-neuron-inferentia2-with-optimum-neuron

Deploy Llama 2 7B on AWS inferentia2 with Amazon SageMaker

How AWS Can Undercut Nvidia With Homegrown AI Compute Engines - The Next Platform

Amazon Web Services may not be the first of the hyperscalers and cloud builders to create its own custom compute engines, but it has been hot on the heels

How AWS Can Undercut Nvidia With Homegrown AI Compute Engines - The Next Platform

https://www.nextplatform.com/2023/12/04/how-aws-can-undercut-nvidia-with-homegrown-ai-compute-engines/

How AWS Can Undercut Nvidia With Homegrown AI Compute Engines - The Next Platform

AWS Inferentia - Amazon Web Services (AWS)

AWS가 사용자 맞춤 설계한, 고성능 기계 학습 추론 칩 AWS의 비전은 딥 러닝을 상용화하고 저렴한 종량 요금제 사용 모델로 최첨단 인프라를 사용할 수 있게 하는 것입니다. AWS Inferentia는 딥 러닝 워크로드를 가속화하도록 설계된 Amazon 최초의 맞춤형 칩으로, 이 비전을 실현하기 위한 장기적인 전략의 일환으로 개발되었습니다.

https://aws.amazon.com/ko/machine-learning/inferentia/

AWS Inferentia - Amazon Web Services (AWS)

Recommendations

///////