stop converting characters to factors in `pkAnalysesAsDataFrame()`? #673

IndrajeetPatil · 2021-12-04T04:33:41Z

Parameters, units, etc. are not categorical data, and treating them as such makes it difficult to work with them further.

If you agree, the calls to readr::col_factor() here can be replaced with readr::col_character():

OSPSuite-R/R/utilities-pk-analysis.R

Lines 74 to 80 in 624745e

    
           colTypes <- list( 
        
             IndividualId = readr::col_integer(), 
        
             QuantityPath = readr::col_factor(), 
        
             Parameter = readr::col_factor(), 
        
             Value = readr::col_double(), 
        
             Unit = readr::col_factor() 
        
           )

The only downside to this is that character/string columns will take up more space than factors, which are integers in disguise, so the returned dataframes will be slightly heavier.

The text was updated successfully, but these errors were encountered:

msevestre · 2021-12-04T14:31:41Z

Is this a breaking change?

msevestre · 2021-12-04T14:32:30Z

I agree with this change. The only reason we have it as factor is because....euh....wait... no idea

IndrajeetPatil · 2021-12-13T14:47:29Z

Is this a breaking change?

I have no clue.

@PavelBal Do you have any sense about this? Do you find yourself using this function?

PavelBal · 2021-12-13T15:07:58Z

All pk analyses workflow is not straightforward to me, so go ahead.

msevestre · 2021-12-14T14:18:29Z

All pk analyses workflow is not straightforward to me

What does this mean? If something is not clear, it should be mentioned and discussed don't you think?

As far as breaking change is concerned, my question is simple, independently from how this is being used:
If someone was relying on the dataframe from v10 with factors to do some stuff, and in v11 it is not factor anymore, will it break the code or not? If yes, we need to mark this as a breaking change in our release note

@IndrajeetPatil Can you create a label: Breaking change so that we can tag those changes that we are adding in v11 (for instance I would not export all the validatexxx again)

IndrajeetPatil · 2021-12-15T11:30:55Z

If someone was relying on the dataframe from v10 with factors to do some stuff, and in v11 it is not factor anymore, will it break the code or not?

This is hard to predict because of the coercion rules, but since, in most contexts, R will convert factor to character and vice versa, I don't think it will be a breaking change. But I will make a note of it as such, just to be safe.

x <- factor(c("z", "y", "x"), levels = c("x", "y", "z"))
as.character(x)
#> [1] "z" "y" "x"

y <- c("z", "y", "x")
as.factor(y)
#> [1] z y x
#> Levels: x y z

^{Created on 2021-12-15 by the reprex package (v2.0.1)}

Can you create a label: Breaking change

Done.

IndrajeetPatil added the Discussion 🦜 label Dec 13, 2021

IndrajeetPatil added a commit that referenced this issue Dec 13, 2021

closes #673

4a6793e

msevestre closed this as completed in 2ecc5bc Dec 15, 2021

IndrajeetPatil mentioned this issue Dec 15, 2021

don't re-export validate* functions #703

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

stop converting characters to factors in `pkAnalysesAsDataFrame()`? #673

stop converting characters to factors in `pkAnalysesAsDataFrame()`? #673

IndrajeetPatil commented Dec 4, 2021

msevestre commented Dec 4, 2021

msevestre commented Dec 4, 2021

IndrajeetPatil commented Dec 13, 2021

PavelBal commented Dec 13, 2021

msevestre commented Dec 14, 2021

IndrajeetPatil commented Dec 15, 2021

stop converting characters to factors in pkAnalysesAsDataFrame()? #673

stop converting characters to factors in pkAnalysesAsDataFrame()? #673

Comments

IndrajeetPatil commented Dec 4, 2021

msevestre commented Dec 4, 2021

msevestre commented Dec 4, 2021

IndrajeetPatil commented Dec 13, 2021

PavelBal commented Dec 13, 2021

msevestre commented Dec 14, 2021

IndrajeetPatil commented Dec 15, 2021

stop converting characters to factors in `pkAnalysesAsDataFrame()`? #673

stop converting characters to factors in `pkAnalysesAsDataFrame()`? #673