I'm trying to train model as per guideline of cloud annotation cli.
Following steps has been done
But whenever i started the training it is failing.
I have my company account as well as personal account too.
On the both account training get failed with following error. (Error extracted from IBM Cloud storage bucket)
"status": {
"state": "error",
"finished_at": "2019-07-23T06:31:31.903Z",
"submitted_at": "2019-07-23T06:26:57.293Z",
"error": {
"trace": "",
"errors": [{
"code": "dl_job_failed (S100)",
"message": "Internal error (S100)",
"more_info": "http://watson-ml-api.mybluemix.net/"
}]
},
"message": "training-a072AJHZg: INSUFFICIENT_RESOURCES",
"metrics": [],
"current_at": "2019-07-23T06:31:32.784Z",
"error_cause": "system"
}
Training with training/test data at:
DATA_DIR: /mnt/data/prod.bucket
MODEL_DIR: /job/model-code
TRAINING_JOB:
TRAINING_COMMAND: cd "$(dirname "$(find . -name "start.sh" -maxdepth 2 | head -1)")" && ./start.sh 1000
Storing trained model at:
RESULT_DIR: /mnt/results/prod.bucket.train/training-Sw9aGBNZR
Wed Jul 24 06:58:31 UTC 2019: Running Tensorflow job
/usr/local/bin/train.sh: line 38: ./start.sh: Permission denied
Training exited with error code 126
Failed: learner_exit_code: 126
I did tried searching error in IBM docs but there is no luck and not getting any details about it.
Please respond as we are using IBM platform for production object detection purpose.