Weak Dataset Sampling - Possible Discrepancy Between Implementation and Arxiv Paper? #31

nmichlo · 2020-10-20T11:09:28Z

I have been examining the implementation of generating the weakly supervised dataset from the paper Weakly-Supervised Disentanglement Without Compromises, however I think there is a discrepancy between the function simple_dynamics in disentanglement_lib/methods/weak/train_weak_lib and what is described in Section 5 from version 3 the Arxiv paper.

Examined Function

disentanglement_lib/disentanglement_lib/methods/weak/train_weak_lib.py

Lines 41 to 57 in 86a644d

    
           def simple_dynamics(z, ground_truth_data, random_state, 
        
                               return_index=False, k=gin.REQUIRED): 
        
             """Create the pairs.""" 
        
             if k == -1: 
        
               k_observed = random_state.randint(1, ground_truth_data.num_factors) 
        
             else: 
        
               k_observed = k 
        
             index_list = random_state.choice( 
        
                 z.shape[1], random_state.choice([1, k_observed]), replace=False) 
        
             idx = -1 
        
             for index in index_list: 
        
               z[:, index] = np.random.choice( 
        
                   range(ground_truth_data.factors_num_values[index])) 
        
               idx = index 
        
             if return_index: 
        
               return z, idx 
        
             return z, k_observed

Excerpt & Corresponding Code

The following is an excerpt from the Experimental Setup subsection from Section 5 of the paper Weakly-Supervised Disentanglement Without Compromises, split into sections that I assume should correspond to the above code:

To create data sets with weak supervision from the existing disentanglement data sets, we first sample from the discrete z according to the ground-truth generative model (1)–(2).

disentanglement_lib/disentanglement_lib/methods/weak/train_weak_lib.py

Lines 41 to 42 in 86a644d

    
           def simple_dynamics(z, ground_truth_data, random_state, 
        
                               return_index=False, k=gin.REQUIRED):

Then, we sample k factors of variation that should not be shared by the two images...

disentanglement_lib/disentanglement_lib/methods/weak/train_weak_lib.py

Lines 44 to 49 in 86a644d

    
           if k == -1: 
        
             k_observed = random_state.randint(1, ground_truth_data.num_factors) 
        
           else: 
        
             k_observed = k 
        
           index_list = random_state.choice( 
        
               z.shape[1], random_state.choice([1, k_observed]), replace=False)

... and re-sample those coordinates to obtain z˜. This ensures that each image pair differs in at most k factors of variation.

disentanglement_lib/disentanglement_lib/methods/weak/train_weak_lib.py

Lines 50 to 54 in 86a644d

    
           idx = -1 
        
           for index in index_list: 
        
             z[:, index] = np.random.choice( 
        
                 range(ground_truth_data.factors_num_values[index])) 
        
             idx = index

For k we consider the range from 1 to d − 1. This last setting corresponds to the case where all but one factor of variation are re-sampled. We study both the case where k is constant across all pairs in the data set and where k is sampled uniformly in the range [d − 1] for every training pair (k = Rnd in the following). Unless specified otherwise, we aggregate the results for all values of k.

Problem With 2?

From the above excerpt there seems to be a problem with line:

disentanglement_lib/disentanglement_lib/methods/weak/train_weak_lib.py

Lines 48 to 49 in 86a644d

    
           index_list = random_state.choice( 
        
               z.shape[1], random_state.choice([1, k_observed]), replace=False)

In particular the expression random_state.choice([1, k_observed]). Instead of keeping k fixed half of the time k will be set to 1.

I may be misunderstanding things from the excerpt, but to me this seems odd that this is happening.

Fix?

Based on this, should lines 48 and 49 not be the following?

index_list = random_state.choice(z.shape[1], k_observed, replace=False)

Problem With 4?

Based on the following excerpt it seems as though factors in the sampled pairs should always differ.

...We study both the case where k is constant across all pairs in the data set...

disentanglement_lib/disentanglement_lib/methods/weak/train_weak_lib.py

Lines 52 to 53 in 86a644d

    
           z[:, index] = np.random.choice( 
        
               range(ground_truth_data.factors_num_values[index]))

However, based on lines 52-53 this is not the case. There is a chance for the re-sampled factor to be the same. It is not guaranteed to be different.

This probability of being the same will only increase if the ground truth dimensionality/size of that factor is small.

Fix?

Sampling with the original value for the particular differing z removed from the range.

Untested possible code for 1 input factor:

choices = set(range(ground_truth_data.factors_num_values[index])) - {z[0, index]}
z[0, index] = np.random.choice(choices)

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Weak Dataset Sampling - Possible Discrepancy Between Implementation and Arxiv Paper? #31

Weak Dataset Sampling - Possible Discrepancy Between Implementation and Arxiv Paper? #31

nmichlo commented Oct 20, 2020 •

edited

Loading

Weak Dataset Sampling - Possible Discrepancy Between Implementation and Arxiv Paper? #31

Weak Dataset Sampling - Possible Discrepancy Between Implementation and Arxiv Paper? #31

Comments

nmichlo commented Oct 20, 2020 • edited Loading

Examined Function

Excerpt & Corresponding Code

Problem With 2?

Fix?

Problem With 4?

Fix?

nmichlo commented Oct 20, 2020 •

edited

Loading