Merge branch 'master' into refactor/SK-936

scaleoutsystems · Jul 16, 2024 · d774208 · d774208
2 parents e95e248 + 1ed374b
commit d774208
Show file tree

Hide file tree

Showing 5 changed files with 111 additions and 182 deletions.
diff --git a/examples/FedSimSiam/README.rst b/examples/FedSimSiam/README.rst
@@ -1,18 +1,23 @@
+   **Note: If you are new to FEDn, we recommend that you start with the MNIST-Pytorch example instead: https://github.com/scaleoutsystems/fedn/tree/master/examples/mnist-pytorch**
+
 FEDn Project: FedSimSiam on CIFAR-10
 ------------------------------------
 
-This is an example FEDn Project that runs the federated self-supervised learning algorithm FedSimSiam on 
-the CIFAR-10 dataset. This is a standard example often used for benchmarking. To be able to run this example, you 
-need to have GPU access. 
+This is an example FEDn Project that trains the federated self-supervised learning algorithm FedSimSiam on 
+the CIFAR-10 dataset. CIFAR-10 is a popular benchmark dataset that contains images of 10 different classes, such as cars, dogs, and ships.
+In short, FedSimSiam trains an encoder to learn useful feature embeddings for images, without the use of labels. 
+After the self-supervised training stage, the resulting encoder can be downloaded and trained for a downstream task (e.g., image classification) via supervised learning on labeled data.
+To learn more about self-supervised learning and FedSimSiam, have a look at our blog-post: https://www.scaleoutsystems.com/post/federated-self-supervised-learning-and-autonomous-driving
+
+To run the example, follow the steps below. For a more detailed explanation, follow the Quickstart Tutorial: https://fedn.readthedocs.io/en/stable/quickstart.html
 
-   **Note: We recommend all new users to start by following the Quickstart Tutorial: https://fedn.readthedocs.io/en/stable/quickstart.html** 
+**Note: To be able to run this example, you need to have GPU access.**
 
 Prerequisites
 -------------
 
--  `Python 3.8, 3.9, 3.10 or 3.11 <https://www.python.org/downloads>`__
--  `A FEDn Studio account <https://fedn.scaleoutsystems.com/signup>`__   
--  Change the dependencies in the 'client/python_env.yaml' file to match your cuda version.
+-  `Python >=3.8, <=3.12 <https://www.python.org/downloads>`__
+-  `A project in FEDn Studio  <https://fedn.scaleoutsystems.com/signup>`__   
 
 Creating the compute package and seed model
 -------------------------------------------
@@ -36,90 +41,31 @@ Create the compute package:
 
    fedn package create --path client
 
-This should create a file 'package.tgz' in the project folder.
+This creates a file 'package.tgz' in the project folder.
 
-Next, generate a seed model (the first model in a global model trail):
+Next, generate the seed model:
 
 .. code-block::
 
    fedn run build --path client
 
-This will create a seed model called 'seed.npz' in the root of the project. This step will take a few minutes, depending on hardware and internet connection (builds a virtualenv).  
-
-Using FEDn Studio
------------------
-
-Follow the instructions to register for FEDN Studio and start a project (https://fedn.readthedocs.io/en/stable/studio.html).
-
-In your Studio project:
-
-- Go to the 'Sessions' menu, click on 'New session', and upload the compute package (package.tgz) and seed model (seed.npz).
-- In the 'Clients' menu, click on 'Connect client' and download the client configuration file (client.yaml)
-- Save the client configuration file to the FedSimSiam example directory (fedn/examples/FedSimSiam)
-
-To connect a client, run the following command in your terminal:
-
-.. code-block::
-
-   fedn client start -in client.yaml --secure=True --force-ssl
-
-
-Running the example
--------------------
+This will create a model file 'seed.npz' in the root of the project. This step will take a few minutes, depending on hardware and internet connection (builds a virtualenv).  
 
-After everything is set up, go to 'Sessions' and click on 'New Session'. Click on 'Start run' and the example will execute. You can follow the training progress on 'Events' and 'Models', where you 
-can monitor the training progress. The monitoring is done using a kNN classifier that is fitted on the feature embeddings of the training images that are obtained by
-FedSimSiam's encoder, and evaluated on the feature embeddings of the test images. This process is repeated after each training round.
+Running the project on FEDn Studio
+----------------------------------
 
-This is a common method to track FedSimSiam's training progress, as FedSimSiam aims to minimize the distance between the embeddings of similar images.
-A high accuracy implies that the feature embeddings for images within the same class are indeed close to each other in the
-embedding space, i.e., FedSimSiam learned useful feature embeddings.
+To learn how to set up your FEDn Studio project and connect clients, take the quickstart tutorial: https://fedn.readthedocs.io/en/stable/quickstart.html.
 
 
-Running FEDn in local development mode:
----------------------------------------
-
-Follow the steps above to install FEDn, generate 'package.tgz' and 'seed.tgz'.
-
-Start a pseudo-distributed FEDn network using docker-compose:
-.. code-block::
-
-   docker compose \
-    -f ../../docker-compose.yaml \
-    -f docker-compose.override.yaml \
-    up
-
-This starts up local services for MongoDB, Minio, the API Server, one Combiner and two clients. 
-You can verify the deployment using these urls: 
-
-- API Server: http://localhost:8092/get_controller_status
-- Minio: http://localhost:9000
-- Mongo Express: http://localhost:8081
-
-Upload the package and seed model to FEDn controller using the APIClient:
-
-.. code-block::
-
-   from fedn import APIClient
-   client = APIClient(host="localhost", port=8092)
-   client.set_active_package("package.tgz", helper="numpyhelper")
-   client.set_active_model("seed.npz")
-
-
-You can now start a training session with 100 rounds using the API client:
-
-.. code-block::
-
-   client.start_session(rounds=100)
-
-Clean up 
---------
-
-You can clean up by running
-
-.. code-block::
+When running the example in FEDn Studio, you can follow the training progress of FedSimSiam under 'Models'. 
+After each training round, a kNN classifier is fitted to the feature embeddings of the training images obtained 
+by FedSimSiam's encoder and evaluated on the feature embeddings of the test images. 
+This is a common method to track FedSimSiam's training progress, 
+as FedSimSiam aims to minimize the distance between the embeddings of similar images. 
+If training progresses as intended, accuracy increases as the feature embeddings for 
+images within the same class are getting closer to each other in the embedding space. 
+In the figure below we can see that the kNN accuracy increases over the training rounds, 
+indicating that the training of FedSimSiam is proceeding as intended. 
 
-   docker-compose \
-   -f ../../docker-compose.yaml \
-   -f docker-compose.override.yaml \
-   down -v
+.. image:: figs/fedsimsiam_monitoring.png
+   :width: 50%
diff --git a/examples/FedSimSiam/figs/fedsimsiam_monitoring.png b/examples/FedSimSiam/figs/fedsimsiam_monitoring.png
diff --git a/examples/huggingface/README.rst b/examples/huggingface/README.rst
@@ -1,3 +1,6 @@
+
+   **Note: If you are new to FEDn, we recommend that you start with the MNIST-Pytorch example instead: https://github.com/scaleoutsystems/fedn/tree/master/examples/mnist-pytorch**
+
 Hugging Face Transformer Example
 --------------------------------
 
@@ -11,20 +14,21 @@ Federated learning is a privacy preserving machine learning technique that enabl
 Fine-tuning large language models (LLMs) on various data sources enhances both accuracy and generalizability.
 In this example, the Enron email spam dataset is split among two clients. The BERT-tiny model is fine-tuned on the client data using 
 federated learning to predict whether an email is spam or not.
-Execute the following steps to run the example:
 
-Prerequisites
--------------
+In FEDn studio, you can visualize the training progress by plotting test loss and accuracy, as shown in the plot below. 
+After running the example for only a few rounds in FEDn studio, the BERT-tiny model - fine-tuned via federated learning - 
+is able to detect spam emails on the test dataset with high accuracy. 
 
-Using FEDn Studio:
+.. image:: figs/hf_figure.png
+   :width: 50%
 
--  `Python 3.8, 3.9, 3.10 or 3.11 <https://www.python.org/downloads>`__
--  `A FEDn Studio account <https://fedn.scaleoutsystems.com/signup>`__   
+To run the example, follow the steps below. For a more detailed explanation, follow the Quickstart Tutorial: https://fedn.readthedocs.io/en/stable/quickstart.html 
 
-If using pseudo-distributed mode with docker-compose:
+Prerequisites
+-------------
 
--  `Docker <https://docs.docker.com/get-docker>`__
--  `Docker Compose <https://docs.docker.com/compose/install>`__
+-  `Python >=3.8, <=3.12 <https://www.python.org/downloads>`__
+-  `A project in FEDn Studio  <https://fedn.scaleoutsystems.com/signup>`__   
 
 Creating the compute package and seed model
 -------------------------------------------
@@ -48,100 +52,17 @@ Create the compute package:
 
    fedn package create --path client
 
-This should create a file 'package.tgz' in the project folder.
+This creates a file 'package.tgz' in the project folder.
 
-Next, generate a seed model (the first model in a global model trail):
+Next, generate the seed model:
 
 .. code-block::
 
    fedn run build --path client
 
-This will create a seed model called 'seed.npz' in the root of the project. This step will take a few minutes, depending on hardware and internet connection (builds a virtualenv).  
-
-
-
-Using FEDn Studio (recommended)
--------------------------------
-
-Follow the instructions to register for FEDN Studio and start a project (https://fedn.readthedocs.io/en/stable/studio.html).
-
-In your Studio project:
-
-- Go to the 'Sessions' menu, click on 'New session', and upload the compute package (package.tgz) and seed model (seed.npz).
-- In the 'Clients' menu, click on 'Connect client' and download the client configuration file (client.yaml)
-- Save the client configuration file to the huggingface example directory (fedn/examples/huggingface)
-
-To connect a client, run the following command in your terminal:
-
-.. code-block::
-
-   fedn client start -in client.yaml --secure=True --force-ssl
-   
-
-Alternatively, if you prefer to use Docker, run the following:
-
-.. code-block::
-
-   docker run \
-   -v $PWD/client.yaml:/app/client.yaml \
-   -e CLIENT_NUMBER=0 \
-   -e FEDN_PACKAGE_EXTRACT_DIR=package \
-   ghcr.io/scaleoutsystems/fedn/fedn:0.9.0 client start -in client.yaml --secure=True --force-ssl
-
-
-Running the example
--------------------
-
-After everything is set up, go to 'Sessions' and click on 'New Session'. Click on 'Start run' and the example
-will execute. You can follow the training progress on 'Events' and 'Models', where you can view the calculated metrics.
-
+This will create a model file 'seed.npz' in the root of the project. This step will take a few minutes, depending on hardware and internet connection (builds a virtualenv).  
 
+Running the project on FEDn
+----------------------------
 
-Running FEDn in local development mode:
----------------------------------------
-
-Create the compute package and seed model as explained above. Then run the following command:
-
-
-.. code-block::
-
-   docker-compose \
-   -f ../../docker-compose.yaml \
-   -f docker-compose.override.yaml \
-   up
-
-
-This starts up local services for MongoDB, Minio, the API Server, one Combiner and two clients. You can verify the deployment using these urls:
-
-- API Server: http://localhost:8092/get_controller_status
-- Minio: http://localhost:9000
-- Mongo Express: http://localhost:8081
-
-
-Upload the package and seed model to FEDn controller using the APIClient:
-
-.. code-block::
-
-    from fedn import APIClient
-    client = APIClient(host="localhost", port=8092)
-    client.set_active_package("package.tgz", helper="numpyhelper")
-    client.set_active_model("seed.npz")
-
-
-You can now start a training session with 5 rounds (default) using the API client:
-
-.. code-block::
-
-    client.start_session()
-
-Clean up 
---------
-
-You can clean up by running 
-
-.. code-block::
-
-   docker-compose \
-   -f ../../docker-compose.yaml \
-   -f docker-compose.override.yaml \
-   down -v
+To learn how to set up your FEDn Studio project and connect clients, take the quickstart tutorial: https://fedn.readthedocs.io/en/stable/quickstart.html. 
diff --git a/examples/huggingface/figs/hf_figure.png b/examples/huggingface/figs/hf_figure.png
diff --git a/fedn/cli/run_cmd.py b/fedn/cli/run_cmd.py
@@ -4,7 +4,6 @@
 
 import click
 import yaml
-
 from fedn.common.exceptions import InvalidClientConfig
 from fedn.common.log_config import logger
 from fedn.network.clients.client import Client
@@ -44,7 +43,70 @@ def run_cmd(ctx):
     """:param ctx:
     """
     pass
+@run_cmd.command("validate")
+@click.option("-p", "--path", required=True, help="Path to package directory containing fedn.yaml")
+@click.option("-i", "--input", required=True, help="Path to input model" )
+@click.option("-o", "--output", required=True,help="Path to write the output JSON containing validation metrics")
+@click.pass_context
+def validate_cmd(ctx, path,input,output):
+    """Execute 'validate' entrypoint in fedn.yaml.
+
+    :param ctx:
+    :param path: Path to folder containing fedn.yaml
+    :type path: str
+    """
+    path = os.path.abspath(path)
+    yaml_file = os.path.join(path, "fedn.yaml")
+    if not os.path.exists(yaml_file):
+        logger.error(f"Could not find fedn.yaml in {path}")
+        exit(-1)
+
+    config = _read_yaml_file(yaml_file)
+    # Check that validate is defined in fedn.yaml under entry_points
+    if "validate" not in config["entry_points"]:
+        logger.error("No validate command defined in fedn.yaml")
+        exit(-1)
+
+    dispatcher = Dispatcher(config, path)
+    _ = dispatcher._get_or_create_python_env()
+    dispatcher.run_cmd("validate {} {}".format(input, output))
+
+    # delete the virtualenv
+    if dispatcher.python_env_path:
+        logger.info(f"Removing virtualenv {dispatcher.python_env_path}")
+        shutil.rmtree(dispatcher.python_env_path)
+@run_cmd.command("train")
+@click.option("-p", "--path", required=True, help="Path to package directory containing fedn.yaml")
+@click.option("-i", "--input", required=True, help="Path to input model parameters" )
+@click.option("-o", "--output", required=True,help="Path to write the updated model parameters ")
+@click.pass_context
+def train_cmd(ctx, path,input,output):
+    """Execute 'train' entrypoint in fedn.yaml.
 
+    :param ctx:
+    :param path: Path to folder containing fedn.yaml
+    :type path: str
+    """
+    path = os.path.abspath(path)
+    yaml_file = os.path.join(path, "fedn.yaml")
+    if not os.path.exists(yaml_file):
+        logger.error(f"Could not find fedn.yaml in {path}")
+        exit(-1)
+
+    config = _read_yaml_file(yaml_file)
+    # Check that train is defined in fedn.yaml under entry_points
+    if "train" not in config["entry_points"]:
+        logger.error("No train command defined in fedn.yaml")
+        exit(-1)
+
+    dispatcher = Dispatcher(config, path)
+    _ = dispatcher._get_or_create_python_env()
+    dispatcher.run_cmd("train {} {}".format(input, output))
+
+    # delete the virtualenv
+    if dispatcher.python_env_path:
+        logger.info(f"Removing virtualenv {dispatcher.python_env_path}")
+        shutil.rmtree(dispatcher.python_env_path)
 @run_cmd.command("startup")
 @click.option("-p", "--path", required=True, help="Path to package directory containing fedn.yaml")
 @click.pass_context