We recently completed a major refactor of the entire IF codebase, moving from an object oriented to functional programmign style.
If you are a IF user or developer, there are some changes you must be aware of resulting from a recent refactor of the IF codebase. This guide will help you to update your processes to integrate your work with the newly refactored IF.
There have been some name changes to the CLI, specifically:
-
impact-engine
-->if-run
The command line tool has been renamed fromimpact-engine
to simplyif-run
. This means that to invoke the Impact Framework on the command line you simply useif-run ...
-
impl
-->manifest
We have deprecated the originalimpl
andompl
terminology across all our repositories and on the command line. Now, to pass a manifest file to IF, you use the--manifest
command, as follows:if-run --manifest <path-to-manifest>
-
ompl
-->output
We have deprecated the original
impl
andompl
terminology across all our repositories and on the command line. Now, to define a savepath for your output file, you use the--output
command, as follows:if-run --manifest <path-to-manifest> --output <savepath>
As before, you can install IF from our npm package using
npm i @grnsft/if
Then run IF using the following command:
if-run --manifest <path-to-manifest>
This will dump the output to the console. If you want to save the output to a yaml file, provide a savepath to the --output
command:
if-run --manifest <path-to-manifest> --output <savepath>
If you want to clone the source code and install and run a local copy of IF, you can do so using:
git clone https://github.com/Green-Software-Foundation/if &&
cd if &&
npm i
Then run IF using the following command:
npm run if-run -- --manifest <path-to-manifest>
There have also been some changes to the structure of manifest files. Some of these are simple renaming changes, others are more functional.
-
Rename
graph
->tree
Thegraph
section of the manifest file is now renamed totree
. This is just to help us stay consistent in our metaphors and provide a more familiar naming convention for the data beneath. -
Use plugin name as key in
Initialize
block In the previous version of IF, the plugins were organized into an array each having aname
key, with the plugin name as the value. In the refactored IF, we use the name as the key identifying the plugin. For example, this is the OLD way:initialize: plugins: - name: ccf model: CloudCarbonFootprint path: if-plugins
This is the new way:
initialize: plugins: 'sci-e': path: '@grnsft/if-plugins' method: SciE
-
Rename
model
tomethod
inInitialize
block Each plugin in the initialize blockEach plugin in the
Initialize
block has a field where the name of the exported function representing that plugin is defined. Previously, these were class names and they were defined using themodel
key. Now, they are functions, and they are defined using themethod
key. We usemethod
instead offunction
becausefunction
is a reserved keyword in Typescript.For example:
'sci-embodied': path: 'builtin' method: SciEmbodied
-
Global config We have introduced the concept of global config to the plugins. This is truly global configuration data that should be kept constant regardless of where the plugin is invoked across the manifest file.
A good example is the
interpolation
method to use in the Teads curve plugin - this is not expected to vary from component to component and can therefore be defined in global config. The plugin code itself must expect the global config. Then, the config can be passed in theInitialize
block, for example:initialize: plugins: 'time-sync': method: TimeSync path: 'builtin' global-config: start-time: '2023-12-12T00:00:00.000Z' end-time: '2023-12-12T00:01:00.000Z' interval: 5 allow-padding: true
-
Node level config
We have also introduced the concept of node-level config. This is designed for pluin configuration that might vary between components in the tree. For example, for each child in the tree you might wish to use the
groupby
plugin to group the outputs according to a different set of keys.tree: children: child-1: pipeline: - teads-curve - sci-e - sci-embodied - sci-o - time-sync - sci config: group-by: group: - region - cloud/instance-type
-
Defaults
We have also introduced the concept of
defaults
. This is a section in each component's definition that can be used to provide fallbacks for missing input data. For example, perhaps you have a value arriving from an external API that should be present in every observation in your inputs array, but for soem reason the API fails to deliver a value for some timestamps. In this case, IF would fallback to the value provided for that metric in thedefaults
section of the manifest for that component.You can also use
defaults
as a quick way to add values to everyobservation in your input array if those values are expected to be constant over time (e.g. some of the lifespan values for embodied carbon calculations could be appropriate to include in defaults). This saves you from having to enter the value in every observation in the input array, instead IF can automatically grab it fromdefaults
for every timestamp.tree: children: child-1: pipeline: - teads-curve - sci-e - sci-embodied - sci-o - time-sync - sci defaults: cpu/thermal-design-power: 100 grid/carbon-intensity: 800 device/emissions-embodied: 1533.120 # gCO2eq time-reserved: 3600 # 1hr in seconds device/expected-lifespan: 94608000 # 3 years in seconds resources-reserved: 1 resources-total: 8
Technically time-sync is not a new feature as it was present in IF before the refactor, but there are some tweaks to how the plugin is configured that are worth explaining here. Time sync snaps all input arrays across an entire graph to a common time grid.
This means you have to define a global start time, end time and interval to use everywhere. There is also a boolean to toggle whether you should allow the time sync model to pad the start and end of your time series with zeroes. You should default to true
unless you have a specific reason not to. In the refactored IF we expect this information to be provided in global config, as follows:
initialize:
plugins:
'time-sync':
method: TimeSync
path: 'builtin'
global-config:
start-time: '2023-12-12T00:00:00.000Z'
end-time: '2023-12-12T00:01:00.000Z'
interval: 5
allow-padding: true
The aggregate plugin aggregates data in two ways: first it condenses individual time series down into a single value (in many cases, this will be the total across the observation period for each metric) and aggregating multiple time series from several components into a single time series (in many cases this means the sum of the metric across multiple componments for each timestep).
This is a builtin feature of IF, meaning it does not have to be initialized as a plugin. Instead, you just have to include a short config block in the top of the manifest file. There are two pieces of information required:
-
metrics
: which metrics do you want to aggregate? Every metric you provide here must exist in the output array.method
: the aggregation method for the specied metric
-
type
: the options arehorizontal
,vertical
or both. Horizontal aggregation is the type that condenses each time series into a single summary value. Vertical aggregation is aggregated across components.
Here's what the config block should look like:
aggregation:
metrics:
'carbon':
method: 'sum'
type: 'both'
Groupby allows you to regroup your outputs according to keys you define. For example, maybe you want to group your outputs by region (show me all the outputs for applications run in uk-south
etc). Groupby is a plugin that needs to be initialized in the manifest.
You can initialize the plugin as follows:
initialize:
plugins:
'group-by':
path: builtin
method: GroupBy
Then you configure groupby for each component in the node level config. In the following example we will regroup the outputs by the region
:
tree:
children:
child-1:
pipeline:
- teads-curve
- sci-e
- sci-embodied
- sci-o
- time-sync
- group-by
- sci
config:
group-by:
group:
- region
We have introduced exhaust
as an IF feature. This is a wrapper around export plugins and it allows community contributors to create plugins for exporting to different formats.
Details tbc...
The plugins themselves require some changes to keep them compatible with the refactored IF.
Instead of the old class-based model, plugins are now functions. They conform to the following interface:
export type PluginInterface = {
execute: (
inputs: PluginParams[],
config?: Record<string, any>
) => PluginParams[];
metadata: {
kind: string;
};
[key: string]: any;
};
The plugin still requires an execute function. This is where you implement the plugin logic.
Here's a minimal example for a plugin that sums some inputs defined in global config - see inline comments for some important notes:
// Here's the function definition - notice that global config is passed in here!
export const Sum = (globalConfig: SumConfig): PluginInterface => {
const inputParameters = globalConfig['input-parameters'] || [];
const outputParameter = globalConfig['output-parameter'];
// we also return metadata now too - you can add more or just use this default
const metadata = {
kind: 'execute',
};
/**
* Calculate the sum of the input metrics for each timestamp.
*/
const execute = async (inputs: PluginParams[]): Promise<PluginParams[]> => {
inputs.map(input => {
return calculateSum(input, inputParameters, outputParameter);
});
return inputs;
};
/**
* Calculates the sum of the energy components.
*/
const calculateSum = (
input: PluginParams,
inputParameters: string[],
outputParameter: string
) => {
input[outputParameter] = inputParameters.reduce(
(accumulator, metricToSum) => {
return accumulator + input[metricToSum];
},
0
);
};
// return the metadata and the execute function
return {
metadata,
execute,
};
};