GithubHelp home page GithubHelp logo

shivamb25 / aws-neuron-samples Goto Github PK

View Code? Open in Web Editor NEW

This project forked from aws-neuron/aws-neuron-samples

0.0 0.0 0.0 8.84 MB

Example code for AWS Neuron SDK developers building inference and training applications

License: Other

Shell 0.55% C++ 0.10% Python 6.15% Jupyter Notebook 93.19% Dockerfile 0.01%

aws-neuron-samples's Introduction

AWS Neuron Samples

This repository contains samples for AWS Neuron, the software development kit (SDK) that enables machine learning (ML) inference and training workloads on the AWS ML accelerator chips Inferentia and Trainium.

The samples in this repository provide an indication of the types of deep learning models that can be used with Trainium and Inferentia, but do not represent an exhaustive list of supported models. If you have additional model samples that you would like to contribute to this repository, please submit a pull request following the repository's contribution guidelines.

Samples are organized by use case (training, inference) and deep learning framework (PyTorch, TensorFlow) below:

Training

Framework Description Instance Type
PyTorch NeuronX (torch-neuronx) Sample training scripts for training various PyTorch models on AWS Trainium Trn1, Trn1n & Inf2
Usage Description Instance Type
Nemo Megatron for Neuron A library that enables large-scale distributed training of language models such as Llama and is adapted from Nemo Megatron. Trn1, Trn1n
AWS Neuron samples for ParallelCluster How to use AWS ParallelCluster to build HPC compute cluster that uses trn1 compute nodes to run your distributed ML training job. Trn1, Trn1n
AWS Neuron samples for EKS The samples in this repository demonstrate the types of patterns that can be used to deliver inference and distributed training on EKS using Inferentia and Trainium. Trn1, Trn1n
AWS Neuron samples for SageMaker SageMaker Samples using ml.trn1 instances for machine learning (ML) training workloads on the AWS ML accelerator chips Trainium. Trn1, Trn1n

Inference

Framework Description Instance Type
PyTorch NeuronX (torch-neuronx) Sample Jupyter notebooks demonstrating model compilation and inference for various PyTorch models on AWS Inferentia2 and Trainium Inf2 & Trn1
PyTorch NeuronX (transformers-neuronx) Sample Jupyter Notebooks demonstrating tensor parallel inference for various PyTorch large language models (LLMs) on AWS Inferentia2 and Trainium Inf2 & Trn1
PyTorch Neuron (torch-neuron) Sample Jupyter notebooks demonstrating model compilation and inference for various PyTorch models on AWS Inferentia Inf1
TensorFlow Neuron (tensorflow-neuron) Sample Jupyter notebooks demonstrating model compilation and inference for various TensorFlow models on AWS Inferentia Inf1
Usage Description Instance Type
AWS Neuron samples for SageMaker SageMaker Samples using ml.inf2 and ml.trn1 instances for machine learning (ML) inference workloads on the AWS ML accelerator chips Inferentia2 and Trainium. Inf2 & Trn1

Getting Help

If you encounter issues with any of the samples in this repository, please open an issue via the GitHub Issues feature.

Contributing

Please refer to the CONTRIBUTING document for details on contributing additional samples to this repository.

Release Notes

Please refer to the Change Log.

Known Issues

Model Framework Training/Inference Instance Type Status
Fairseq PyTorch Inference Inf1 RuntimeError: No operations were successfully partitioned and compiled to neuron for this model - aborting trace!
Yolof PyTorch Inference Inf1 RuntimeError: No operations were successfully partitioned and compiled to neuron for this model - aborting trace!

aws-neuron-samples's People

Contributors

aws-maens avatar jeffhataws avatar 5cp avatar aws-mesharma avatar natemail-aws avatar amithrm avatar rgrandhiamzn avatar aws-rhsoln avatar awswinder avatar aws-anantsh avatar aws-rxgupta avatar awsjoshir avatar hannanjgaws avatar samir-souza avatar shebbur-aws avatar awshtokoyo avatar eshalakhotia avatar amazon-auto avatar hgt312 avatar kct22aws avatar aws-sadaf avatar samskalicky avatar shivamb25 avatar yangfei1990 avatar aws-yishanm avatar yudisun0201 avatar aws-donc avatar aws-rishyraj avatar jyang-aws avatar micwade-aws avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.