GithubHelp home page GithubHelp logo

Comments (5)

mrwyattii avatar mrwyattii commented on June 23, 2024

@aponte411 currently we expect the user to set resource group and subscription from the Azure-CLI like so:
az account set --subscription "xxxxxxxx-xxxx-xxxx-xxxx-xxxxxxxxxxxx"

I agree that exposing more options and expanding the AML deployment capabilities would be nice. Let me know if you have some time to help test/debug/expand these capabilities!

from deepspeed-mii.

buswrecker avatar buswrecker commented on June 23, 2024

Hi, i resolved this - i had an issue with torch (had to pip uninstall nvidia_cublas_cu11) and also i wasn't on a GPU VM. Managed to build the folder with deploy.sh and deploying to a managed endpoint now

@aponte411 - did you make any progress? i'm getting the same error in Jupyter notebook.

deepspeed==0.8.2
deepspeed-mii==0.05+unknown
python==3.8.0
Ubuntu==20.04.1

from deepspeed-mii.

TahaBinhuraib avatar TahaBinhuraib commented on June 23, 2024

@buswrecker I can run deepspeed mii from the gpu vm but I still can't deploy, I get the same error:
subprocess.CalledProcessError: Command '['az', 'ml', 'workspace', 'show', '--query', 'container_registry']' returned non-zero exit status 2.

from deepspeed-mii.

jakemanger avatar jakemanger commented on June 23, 2024

I also could not get this working after following instructions in the readme. The only way I could use aml is after I overrode the get_acr_name() function to return my acr name instead of calling the az cli command. Is there a way to set a default --name argument for this command so this can be fixed and it returns the correct acr name?

The command I'm talking about is:

["az",
  "ml",
  "workspace",
   "show",
   "--query",
   "container_registry"],

Note, I also tried putting --name myworkspacename in as an argument and it just returned ------

from deepspeed-mii.

ShuntaIto avatar ShuntaIto commented on June 23, 2024

I'm facing same problem, on GPU VM.

Maybe, adding "shell=True" will resolve this problem?

acr_name = subprocess.check_output(
            ["az",
             "ml",
             "workspace",
             "show",
             "--query",
             "container_registry"],
            text=True, shell=True)

from deepspeed-mii.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.