GitHub - LeeeeeLy/Clustered-ImageNet64-with-path-fixer: Decode and store ImageNet64 images in PNG format labeled with the ImageNet synset IDs. Cluster images into user-picked clustered labels to tailor to training requirements.

Preparing ImageNet64 Dataset with Clustered Label

Download ImageNet64 from ImageNet Download Page.
Extract the *.zip files to unveil the dataset batches, including train_data_batch_1 to train_data_batch_10 and val_data.

Organize the extracted data into:

Install the required dependencies to ensure the scripts run smoothly:

pip install -r requirements.txt

Extracting Images: Run extractimages.py to decode and store images in PNG format, categorizing them into directories named after ImageNet synset IDs.
Data Clustering: Execute clustereddata.py to organize images based on clustered labels. For instance, all cat images (n02124075, n02123394, n02123159, n02123597, n02123045, n02127052) are clustered, enhancing dataset manageability for training purposes. The script also balances the dataset by equalizing the number of images across clusters.

For comprehensive label mappings and insights into data clustering, refer to the following resources:

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
LICENSE		LICENSE
README.md		README.md
clustereddata.py		clustereddata.py
extractimages.py		extractimages.py
labelmapping.txt		labelmapping.txt
requirements.txt		requirements.txt