Comments (5)
Hi, thias42.
We’re going to take a look into this soon. Please double check that you’ve updated all of Neuron SDK and are running the latest ingredients. Instructions for updating Neuron are available here: https://github.com/aws/aws-neuron-sdk/blob/master/docs/neuron-install-guide.md
from aws-neuron-sdk.
We’re going to take a look into this soon. Please double check that you’ve updated all of Neuron SDK and are running the latest ingredients. Instructions for updating Neuron are available here: https://github.com/aws/aws-neuron-sdk/blob/master/docs/neuron-install-guide.md
Yes, the Neuron SDK and tools are up-to-date.
from aws-neuron-sdk.
Thanks thias42. We have reproduced your issue and are investigating it.
from aws-neuron-sdk.
Currently compiler is unable to compile certain configurations due to tensor size limitations. We are aware of the limitations of the compiler and working to remove those limitations. For more information, please see release notes at https://github.com/aws/aws-neuron-sdk/tree/master/release-notes. For now, if you change the input representation (input_repr) to "mel128", and use compile option "no_fuse_ops=['melspectrogram/mul']" you will be able to get 99 operations out of 151 (inference optimized) operations to be placed on Neuron runtime.
model = openl3.models.load_audio_embedding_model(input_repr="mel128", content_type="music", embedding_size=512)
tf.saved_model.simple_save(
session = keras.backend.get_session(),
export_dir = model_dir,
inputs = {'input': model.inputs[0]},
outputs = {'output': model.outputs[0]})
tfn.saved_model.compile(model_dir, compiled_model_dir, no_fuse_ops=["melspectrogram/mul"])
You will see that the framework partitions the graph into 3 subgraph, but only one subgraph is compiled successfully (the portion after melspectrogram/mul):
INFO:tensorflow:fusing subgraph neuron_op_d4374713206b859f with neuron-cc
INFO:tensorflow:fusing subgraph neuron_op_2fdb7316ce04901d with neuron-cc
INFO:tensorflow:fusing subgraph neuron_op_bc8fe16a32d9e22a with neuron-cc
WARNING:tensorflow:Failed to fuse subgraph neuron_op_2fdb7316ce04901d with '/home/ubuntu/test_venv/bin/neuron-cc compile /tmp/tmp63uejh3m/neuron_op_2fdb7316ce04901d/graph_def.pb --framework TENSORFLOW --pipeline compile SaveTemps --output /tmp/tmp63uejh3m/neuron_op_2fdb7316ce04901d/graph_def.neff --io-config "{\"inputs\": {\"melspectrogram/transpose_20/_1:0\": [[1, 1, 199, 1025], \"float32\"], \"melspectrogram/unstack0/_2:0\": [[], \"int32\"]}, \"outputs\": [\"melspectrogram/Log:0\"]}"'
WARNING:tensorflow:Failed to fuse subgraph neuron_op_bc8fe16a32d9e22a with '/home/ubuntu/test_venv/bin/neuron-cc compile /tmp/tmp63uejh3m/neuron_op_bc8fe16a32d9e22a/graph_def.pb --framework TENSORFLOW --pipeline compile SaveTemps --output /tmp/tmp63uejh3m/neuron_op_bc8fe16a32d9e22a/graph_def.neff --io-config "{\"inputs\": {\"input_10/_3:0\": [[1, 1, 48000], \"float32\"]}, \"outputs\": [\"melspectrogram/transpose_2:0\", \"melspectrogram/Shape:0\"]}"'
INFO:tensorflow:Number of operations in TensorFlow session: 790
INFO:tensorflow:Number of operations after tf.neuron optimizations: 151
INFO:tensorflow:Number of operations placed on Neuron runtime: 99
Please let us know if further assistance is needed. You can also file AWS support ticket or contact us directly at [email protected].
from aws-neuron-sdk.
Thank you! I'll stick to mel128 for now. Looking forward to a version without the tensor size limitations.
from aws-neuron-sdk.
Related Issues (20)
- Doc issue: Inf2 data types has bad link for int8 HOT 1
- Running Llama3 Returns Tensor Allocate Status 2 HOT 3
- Failing to load a traced model HOT 3
- Error "ImportError: cannot import name 'packaging' from 'pkg_resources'" when using latest setuptools version 70 HOT 5
- NEFF Unable to open: kelf-b.json - 2 when loading in a model traced on 4 NeuronCores HOT 1
- k8s-neuron-device-plugin - LICENSE and Source
- Bad image quality for Stable Diffusion 1.5 after applying the optimized attenstion score HOT 4
- Multiple models on torchserve HOT 5
- Support for falcon 2 / falcon 2 vlm HOT 1
- JAX inference (Beta)
- Input tensor is not an XLA tensor: CPUFloatType while using crf.decode function HOT 4
- RuntimeError: Bad StatusOr access: INVALID_ARGUMENT: PJRT_Client_Create: error condition nullptr != (args)->client->Error(): Init: error condition !(num_devices > 0): HOT 3
- BERT model implemented usiing TransformerEncoder returns all NaNs when running it torch==1.13.1 HOT 3
- PDF print on the home page is empty when the left side is collapsed HOT 1
- Quite largely increased latency with weights/neff separated HOT 1
- Input tensors not being read torch neuronx 2.1.2 HOT 4
- Is there something wrong in torch_neuronx.trace ? HOT 3
- support for aten::upsample_nearest3d HOT 1
- Is it possible to compile a model when no NeuronCores are available? HOT 2
- ECS inf1 neuron hook script fails HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from aws-neuron-sdk.