dtypes for the scoring function #174

ryanma9629 · 2023-06-26T07:59:57Z

When generating the Python scoring function in MM, the default dtypes are set to 'object', as below:
input_array = pd.DataFrame([[LOAN, MORTDUE, VALUE, REASON, JOB, YOJ, DEROG, DELINQ, CLAGE, NINQ, CLNO, DEBTINC]], columns=["LOAN", "MORTDUE", "VALUE", "REASON", "JOB", "YOJ", "DEROG", "DELINQ", "CLAGE", "NINQ", "CLNO", "DEBTINC"], dtype=object)
However, classifiers such as lightgbm don't accept object dtypes. So we may get an error when scoring with lightgbm models in MM:
ValueError: DataFrame.dtypes for data must be int, float or bool. Did not expect the data types in the following fields: LOAN, MORTDUE, VALUE, REASON, JOB, YOJ, DEROG, DELINQ, CLAGE, NINQ, CLNO, DEBTINC
I don't know whether it is safe to set all dtypes to float or None when generating the scoring func.

The text was updated successfully, but these errors were encountered:

smlindauer · 2023-07-03T12:27:58Z

@ryanma9629:
I was running in to a depreciation error with pandas in regards to setting all the values to float, but it may be better to just let pandas dictate the type. The worry I had had was that MM can't accept numpy values, but we can check for that from the output of the prediction function.

I will run through some other model types to see how they handle not setting the dtype in the input_array.

ryanma9629 · 2023-07-07T02:10:07Z

due to the depreciation of np.int and np.float since numpy version 1.20?
https://numpy.org/devdocs/release/1.20.0-notes.html#deprecations

@ryanma9629: I was running in to a depreciation error with pandas in regards to setting all the values to float, but it may be better to just let pandas dictate the type. The worry I had had was that MM can't accept numpy values, but we can check for that from the output of the prediction function.

I will run through some other model types to see how they handle not setting the dtype in the input_array.

ryanma9629 added the bug Something isn't working label Jun 26, 2023

smlindauer self-assigned this Jul 3, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

dtypes for the scoring function #174

dtypes for the scoring function #174

ryanma9629 commented Jun 26, 2023

smlindauer commented Jul 3, 2023

ryanma9629 commented Jul 7, 2023

dtypes for the scoring function #174

dtypes for the scoring function #174

Comments

ryanma9629 commented Jun 26, 2023

smlindauer commented Jul 3, 2023

ryanma9629 commented Jul 7, 2023