Converting the loaded model using to_neuron() method takes a long time. Is there any w

Hi <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="

Hey <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url=

Hey <a class="user-mention notranslate" data-hovercard-type="user" data-h

Any solution to save the converted model? about transformers-neuronx HOT 3 OPEN

aws-neuron commented on June 30, 2024

Any solution to save the converted model?

from transformers-neuronx.

Comments (3)

hannanjgaws commented on June 30, 2024 1

Hi @aliseyfi:

We are working on adding serialization support for all models in an upcoming release. We will update this ticket when serialization support is available.

from transformers-neuronx.

jimburtoft commented on June 30, 2024

Hey @aliseyfi , does model.save work for you?

Example code from https://huggingface.co/aws-neuron/Mistral-neuron:

model_neuron = MistralForSampling.from_pretrained('mistralai/Mistral-7B-Instruct-v0.1-split', batch_size=1, \
    tp_degree=2, n_positions=256, amp='bf16', neuron_config=neuron_config)
model_neuron.to_neuron()

#save compiled neff files out to the same directory
model_neuron.save("mistralai/Mistral-7B-Instruct-v0.1-split")

from transformers-neuronx.

aliseyfi commented on June 30, 2024

Hey @aliseyfi , does model.save work for you?

Example code from https://huggingface.co/aws-neuron/Mistral-neuron:

model_neuron = MistralForSampling.from_pretrained('mistralai/Mistral-7B-Instruct-v0.1-split', batch_size=1, \
    tp_degree=2, n_positions=256, amp='bf16', neuron_config=neuron_config)
model_neuron.to_neuron()

#save compiled neff files out to the same directory
model_neuron.save("mistralai/Mistral-7B-Instruct-v0.1-split")

Sorry, I don't work on that project anymore. Thanks for the update though.

from transformers-neuronx.

Any solution to save the converted model? about transformers-neuronx HOT 3 OPEN

Comments (3)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent

Jobs