error when trainning "invalid literal for int() with base 10: ' ' " about pria_lifechem HOT 7 CLOSED

gitter-lab commented on July 19, 2024

error when trainning "invalid literal for int() with base 10: ' ' "

from pria_lifechem.

Comments (7)

chao1224 commented on July 19, 2024

Hi @xuzhang5788 , the process_num in sklearn_randomforest.py specifies the hyparameter id, these lines is selecting the hyperparameter given the process id.

You can find all the possible hyperparameters for random forest in this json file, which includes 3*3*4*3 = 108 hyperparameters in all. In other words, process_num can range from 0 to 107.

from pria_lifechem.

xuzhang5788 commented on July 19, 2024

Thank you for your fast response. I still didn't get it. Do I need to modify this comment to run?

python sklearn_randomforest.py --config_json_file=../../json/sklearn_randomforest.json --model_dir=$results_and_model_directory --dataset_dir=$path_to_dataset --process_num=$process --stage=0

from pria_lifechem.

chao1224 commented on July 19, 2024

Can you post the complete bash script? I'm not sure what's the "$process" here. (But if you set it from 0 to 107, it should work)

from pria_lifechem.

xuzhang5788 commented on July 19, 2024

The bash script is from your readme file
https://github.com/gitter-lab/pria_lifechem/tree/master/pria_lifechem/models

from pria_lifechem.

chao1224 commented on July 19, 2024

You need to specify the variables in the bash env

export process=0
export path_to_dataset=/dir/to/the/dataset

python sklearn_randomforest.py \
--config_json_file=../../json/sklearn_randomforest.json \
--model_dir=$results_and_model_directory \
--dataset_dir=$path_to_dataset \
--process_num=$process \
--stage=1

from pria_lifechem.

Malnammi commented on July 19, 2024

Just to clarify. There are a total 108 hyperparameter combinations for the RandomForest in sklearn_randomforest.json. The script sklearn_randomforest.py , selects one of these using the process_num argument. So you need to set process_num to the hyperparameter index you want.

For example, one of the 8 RF hyperparameters that were promoted to the CV stage was index: 12. So, to run for index 12:

python sklearn_randomforest.py \
--config_json_file=../../json/sklearn_randomforest.json \
--model_dir=./folder_to_store_results/  \
--dataset_dir=./path_to_csv_files/ \
--process_num=12 \
--stage=1

Set model_dir to a folder path you'd like to store the results. Set dataset_dir to the folder path where the [file_0.csv, file_1.csv, ..., file_4.csv] data files are. Set stage to 0 for hyperparameter stage, to 1 for CV stage, and 2 for Prospective stage; the stage determines which test data to use for evaluation.

from pria_lifechem.

xuzhang5788 commented on July 19, 2024

@chao1224 @Malnammi
Thank you so much.

from pria_lifechem.

error when trainning "invalid literal for int() with base 10: ' ' " about pria_lifechem HOT 7 CLOSED

Comments (7)

Related Issues (4)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent

Jobs