sklearn_logclassification_globalrw

Jul 19, 2023

792896a · Jul 19, 2023

Name		Name	Last commit message	Last commit date
parent directory ..
README.md		README.md
generate_configs.py		generate_configs.py

README.md

Dataset Setup

Currently, for Logistic Regression Global Reweighing we support following datasets:

Adult
Compas
German

Running Scikitlearn Logistic Classifier in FL with Global Reweighing with Differential Privacy

This example explains how to run federated learning on a Logistic Classifier, implemented with Scikit-Learn with Global Reweighing with Differential Privacy. Global Reweighing with Differential Privacy is presented in Mitigating Bias in Federated Learning.

The following preprocessing was performed in AdultSklearnDataHandler on the original dataset:

Drop following features: workclass, fnlwgt, education, marital-status, occupation, relationship, capital-gain, capital-loss, hours-per-week, native-country
Map race, sex and class values to 0/1
Split age and education columns into multiple columns based on value

Further details in documentation of preprocess() in AdultSklearnDataHandler.

The following preprocessing was performed in CompasSklearnDataHandler on the original dataset:

Map sex values to 0/1 based on underprivileged/privileged groups
Filter out rows with values outside of specific ranges for days_b_screening_arrest, is_recid, c_charge_degree, score_text, race
Quantify length_of_stay from c_jail_out and c_jail_in
Quanitfy priors-count, length_age_cat, score_text, age_cat
Drop following features: id, name, first, last, compas_screening_date, dob, age, juv_fel_count, decile_score, juv_misd_count, juv_other_count, priors_count, days_b_screening_arrest, c_jail_in, c_jail_out, c_case_number, c_offense_date, c_arrest_date, c_days_from_compas, c_charge_desc, is_recid, r_case_number, r_charge_degree, r_days_from_arrest, r_offense_date, r_charge_desc, r_jail_in, r_jail_out, violent_recid, is_violent_recid, vr_case_number, vr_charge_degree, vr_offense_date, vr_charge_desc, type_of_assessment, decile_score.1, screening_date, v_type_of_assessment, v_decile_score, v_score_text, v_screening_date, in_custody, out_custody, priors_count.1, start, end, event
Split age-cat, priors_count and c_charge_degree columns into multiple columns based on value

Further details in documentation of preprocess() in CompasSklearnDataHandler.

The following preprocessing was performed in 'GermanSklearnDataHandler` on the original dataset:

Drop following features: checking-status, duration-months, purpose, credit-amount, installment-rate, other-debtors, residence, property, installment-plans,housing, credits, job, number-liable, telephone, foreign
Map age, sex and class values to 0/1
- '>= 25': 1, '<25': 0
- 'A91': 1, 'A93': 1, 'A94: 1, 'A92': 0, 'A95': 0
Split credit-history, savings and employment columns into multiple columns based on value

Further details in documentation of preprocess() in GermanSklearnDataHandler.

The value of epsilon is, at default, set to 1, and can be modified via the data handler. No other preprocessing is performed.

Set FL environment by running:

./setup_environment.sh <env_name>  # setup environment

Activate a new FL environment by running:

conda activate <env_name>          # activate environment

Split data by running:

python examples/generate_data.py -n <num_parties> -d <dataset_name> -pp <points_per_party>

Generate config files by running:

python examples/generate_configs.py -n <num_parties> -f sklearn_logclassification_rw -d <dataset_name> -p <path>

In a terminal running an activated FL environment, start the aggregator by running:
```
python -m ibmfl.aggregator.aggregator <agg_config>
```
Type START and press enter to start accepting connections
In a terminal running an activated FL environment, start each party by running:
```
python -m ibmfl.party.party <party_config>
```
Type START and press enter to start accepting connections.

Type REGISTER and press enter to register the party with the aggregator.
Finally, start training by entering TRAIN in the aggregator terminal.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Files

sklearn_logclassification_globalrw

sklearn_logclassification_globalrw

README.md

Dataset Setup

Running Scikitlearn Logistic Classifier in FL with Global Reweighing with Differential Privacy

Files

sklearn_logclassification_globalrw

Directory actions

More options

Directory actions

More options

Latest commit

History

sklearn_logclassification_globalrw

Folders and files

parent directory

README.md

Dataset Setup

Running Scikitlearn Logistic Classifier in FL with Global Reweighing with Differential Privacy