PyPSA-USA: Near Term Emission Targets

Uncertainty Analysis Workflow for PyPSA-USA

Intro

This repo contains the code used for the paper "xxx". Broadly, the workflow allows users to run a uncertainty analysis over PyPSA-USA networks. Included is 1. Uncertainty Parameterization. 2. Global Sensitivity Analysis (GSA). 3. Uncertainty Analysis (UA). This readme walks through how to configure and run the workflow.

Install

Installation requires uses to clone the GitHub repository and install required dependencies.

Clone the Repository

Users can clone the repository using HTTPS, SSH, or GitHub CLI. See GitHub docs for information on the different cloning methods. Run one of the following command to clone the repository. Once cloned, open the pypsa-gsa project directory.

via HTTPS

git clone https://github.com/DeltaE/pypsa-gsa.git

via SSH

If it your first time cloning a repository through ssh, you will need to set up your git with an ssh-key by following these directions.

git clone [email protected]:DeltaE/pypsa-gsa.git

via GitHub CLI

gh repo clone DeltaE/pypsa-gsa

Dependencies

Users can install dependencies via Anaconda or UV.

via Conda

Install mamba (a drop in replacement for conda) following the instructions here. Once installed, activate the environment (pypsa-gsa) with the following command.

conda env create --file pypsa-gsa.yaml
conda activate pypsa-gsa

via UV

Install uv following the instructions here. Once installed, activate the environment with the following command.

uv venv
source .venv/bin/activate

How to Configure

This section will walk through how configure the workflow. The results in this paper can be replicated by supplying the network files found in the Zenodo deposit. All configuration file options can be found in the config/ directory.

Scenario

To run a new scenario (ie. the GSA and UA workflow), give a new scenario name in the config/config.yaml file. Additionally, for the scenario, you can select if CH4 is counted against the emission budget. As CH4 can lead to eaisly lead to infeasabilities if running with a CO2 limit, users have the option to track CH4, but not count it against the emission limit.

scenario: 
  name: "caiso"
  ch4: false # if true, ch4 counts against emission budget. Else, ch4 still tracked, but not counted against emission budget.

Network File

This workflow uses PyPSA-USA to create a base network of different regions in the USA. The network file and the population layout data (generated during the PyPSA-USA workflow) must be placed in the config/pypsa-usa/ directory. In the config/config.yaml file, update the filepath names for pypsa-usa. The era5_year parameter extracts the minimum and maximum operating conditions for both natural gas and electrical trading. If running a sector study, this must be 2018.

pypsa_usa:
  network: *.nc # network file name
  pop_layout: *.csv # population file name
  era5_year: 2018 # year for energy trade limits

Uncertainty Parameterization

All uncertain parameters are defined in the config/parameters.csv file. For each uncertain parameter, the user must provide the information in the following table. The template file config/parameters.csv gives an example of the data schema.

Column Header	Description
name	Unique name to track the uncertain parameter
group	Name of group for the parameter to be included in for the sensitivity analysis
nice_name	Plotting name for the group
component	PyPSA component name. If applying to the timeseries value, ensure you include the '_t' (ie. 'link' and 'link_t' are treated seperatly)
carrier	PyPSA-USA carrier to filter the component by
attribute*	PyPSA attribute to apply the uncertainty range to
range*	Either 'percent' or 'absolute'. Percent will apply a relative percent change to the reference value. Absolute will disregard the reference value and apply the range specified.
unit*	Unit of input value. If using a 'percent' range, put 'percent'. All units are converted to PyPSA-USA base units
min_value	Minimum value to sample
max_value	Maximum value to sample
source	(Optional) Source of the data
notes	(Optional) Any additional info on the data

*Constraint uncertainty (for example, renewable portfolio standards or electric vehicle policies) are treated a little different (see config/parameters.csv for examples). The following constraint uncertainities are supported.

Transport electrification limits
Renewable portfolio standards
Clean energy standards
Technology capacity targets
Carbon limits
Natural gas import/exports to/from outside model scope
Electrical import/exports to/from outside model scope

When running the workflow, numerous checks are in place to ensure data is inputted corerctly. Moreover, the user can write out metadata associated with the sample to ensure final ranges are reasonable. This is specified in the configuration file as given below. metadata.csv and metadata.yaml will print out the same data, just in different formats.

metadata:
  csv: True
  yaml: True
  networks: True # keep solved networks

Gloabl Sensitivity Analysis

The following GSA configuration options are available:

gsa:
  parameters: config/parameters.csv 
  results: config/results_gsa.csv # results to run SA over
  replicates: 10 
  scale: True # Scale Elementary Effect
  rankings: # get union of most impactful parameters
    top_n: 3
    results:
    - marginal_cost_energy
    - marginal_cost_elec
    - carbon

The gsa.parameters should point to the uncertainty parameterization file described in the previous section. The gsa.results points to a file describing what results to run the GSA over (see table below). gsa.replicates and gsa.scale are options for the Method of Morris described here. If running a GSA over results of different units, it is important to scale the results! gsa.rankings is a post processing step to easily extract the what uncertainities are the most impactful on the specified results.

The following table shows how to configure the GSA results. See the file config/results_gsa.csv for an example:

Column Header	Description
name	Unique name to track the gsa result
nice_name	Nice name for the result
component	PyPSA component name. If applying to the timeseries value, the average is taken (ie. the average marginal cost)
carrier	PyPSA-USA carrier to filter the component by
variable	PyPSA attribute to apply the sensitivity to
plots	What plots for the variable to be applied to (ie. if you want to plot multiple results against each other)
unit	Unit for the plot

Uncertainty Analysis

The following UA configuration options are available:

uncertainity:
  sample: lhs # (lhs|sobol)
  replicates: 600
  parameters: # **index names** from parameters csv to include in sample
  - capex_com_elec_water_heater
  - lpg_cost
  - ng_leakage
  results: config/results_ua.csv # results to extract from ua
  plots: config/plots_ua.csv

The uncertainty.sample is the sampling method to use. Only Latin Hypercube Sampling (LHS) and Sobol Sampling are supported, and are generated using SALib. uncertainty.replicates modifies the number of samples, with information on how it is used here for LHS and here for Sobol. uncertainty.parameters specifies what parameters to include in the sample; these values will typically be the most impactful parameters identified from the GSA (and written out to the file results/{scenario}/gsa/rankings_top_n.csv). All other parameters in the config/parameters.csv file will take on their average value. uncertainty.results and uncertainty.plots describes the results and plots to extract from the UA. Details on how to format these files is given below.

The following table shows how to configure the UA results. See the file config/results_ua.csv for an example:

Column Header	Description
name	Unique name to track the uncertainty result
nice_name	Nice name for the result
component	PyPSA component name. If applying to the timeseries value, the average is taken (ie. the average marginal cost)
carrier	PyPSA-USA carrier to filter the component by
variable	PyPSA attribute to apply the sensitivity to
barplot*	What barplots to group the result on
unit*	Unit for the barplot

The following table shows how to configure plots generated from the UA. See the file config/plots_ua.csv for an example:

Column Header	Description
name	Unique name to track the uncertainty result
nice_name	Nice name for the result
type	Type of plot (`scatter`
xaxis	Result to plot on xaxis
xlabel	Label of xaxis
yaxis	Result to plot on yaxis (for scatter plot)
ylabel	Label of yaxis
plot	Name of plotted file
group	Subplot group
xhue	Legend Title (for scatter) or y label (for bar)

*TODO: Move this to the dedicated UA plotting config.

Solver

Four solving options are provided; cbc, HiGHS, CPLEX, and Gurobi. The user selects what solver and solver profile they want with the following config/config.yaml options.

# Choose a solver
solver:
  name: gurobi # (cbc|gurobi|cplex|highs)
  options: gurobi-default # see solving config

Solver profiles are configured in the config/solving.yaml file. An example of a solving profile is given below. This profile is taken from the main PyPSA-USA project here, which is in turn taken from the PyPSA-Eur project here.

solving:
  solver_options:
    gurobi-default:
      threads: 8
      method: 2 # barrier
      crossover: 0
      BarConvTol: 1.e-4
      OptimalityTol: 1.e-4
      FeasibilityTol: 1.e-3
      Seed: 123
      AggFill: 0
      PreDual: 0
      GURO_PAR_BARDENSETHRESH: 200

How to Run

snakemake is used for workflow orchastration. This section will walk through how to use snakemake to execute the workflow. Information on how to tune resources for both local and HPC execution can be found at the end of the section.

Generate Network Specific Data

Uncertain parameters to the specific network first need to be generated. These are parameters like CO2 targets and technology targets that use growth rates. To generate this data, first update the following configuration options in config/config.yaml. If you do not want to apply CO2 targets, set the values to False.

# config options for data that is generated
generated:
  co2L_min: False # 40 # as percentage of 2005 level emissions
  co2L_max: False # 50 # as percentage of 2005 level emissions
  ccgtccs_max: 10 # as a percentage of max natgas capacity

Then run the following command. A file called results/{scenario}/generated/config/parameters.csv will be generated that includes your original parameters and new paremeters appened to the bottom. Do not manually modify this file! (Note, this can not be added to the main Snakemake workflow due to the apply_*_sample_to_network rule exporting all networks in one rule rather than running the rule once for each sample. Its MUCH quicker to just export all at once to redule i/o operations.)

snakemake -s workflow/Snakefile.generate

Global Sensitivity Analysis

Next, do a dry run of the global sensitivity analysis with the following command:

snakemake gsa -n

You should see many hundreds or thouands of steps be prompted. If everything looks correct, run the workflow (for real!) with the command:

snakemake gsa

If you are running on an HPC and want to test the resources required for each solve, you can run the workflow through to one solve with the following command, then check the resouces required with the seff command and the job number.

snakemake test_solve

$ seff 54867276
Job ID: 54867276
Cluster: 
User/Group: 
State: TIMEOUT (exit code 0)
Cores: 1
CPU Utilized: 03:51:58
CPU Efficiency: 96.61% of 04:00:06 core-walltime
Job Wall-clock time: 04:00:06
Memory Utilized: 84.31 GB
Memory Efficiency: 5.43% of 1.52 TB (1.52 TB/node)

All results will be available in the results/{scenario}/gsa/ directory.

Uncertainty Analysis

Once the GSA has completed, you can check what parameters are the most influential in the results/{scenario}/gsa/rankings_top_n.csv file. These are the parameters to copy over the to uncertainty.parameters config option. All other uncertain will be locked to their average value, while these uncertain parameters will be included in the UP.

Once the uncertain parameters have been inputted, do a dry run of the uncertainity propogation with the following command:

snakemake ua -n

You should see many hundreds or thouands of steps be prompted. If everything looks correct, run the workflow (for real!) with the command:

snakemake ua

All results will be available in the results/{scenario}/ua/ directory.

Tuning Resources

Two snakemake profiles are provided by default; one to run locally and one to run on a High Performance Computer (HPC). Information on snakemake profiles can be found here.

Local Profile

If running locally, the default snakemake configuration options will likley be fine. If you do want to tune the snakemake resources for local execution, update the workflow/profiles/default/config.yaml file.

HPC Profile

If you are deploying the workflow to a HPC, some manual resource tuning will need to happen. Default resources can be found in the workflow/profiles/slurm/config.yaml file. Depending on your PyPSA network, the resouceses allocated to the solve rule will likley need to change. Once the input data has been generated, and before you run the GSA, you can run a single model to see the efficiency of the resources allocation. Moreover, ensure to specify the slurm profile in the snakemake call.

snakemake test_solve --workflow-profile workflow/profiles/slurm/

Once complete, inspect the resources either through benchmark files:

(gsa) [trevor23@login1 pypsa-gsa]$ cat benchmarks/solve/az_gsa_testing.txt
s       h:m:s   max_rss max_vms max_uss max_pss io_in   io_out  mean_load       cpu_time
292.8917        0:04:52 17330.54        24511.36        17270.87        17289.28        13.11   0.05    154.39  452.49

Or directly from the scheduler using the job number:

(gsa) [trevor23@login1 pypsa-gsa]$ seff 16085566
Job ID: 16085566
Cluster: fir
User/Group: trevor23/trevor23
State: COMPLETED (exit code 0)
Nodes: 1
Cores per node: 2
CPU Utilized: 00:07:41
CPU Efficiency: 76.32% of 00:10:04 core-walltime
Job Wall-clock time: 00:05:02
Memory Utilized: 20.90 GB
Memory Efficiency: 66.87% of 31.25 GB (15.62 GB/core)

If resource tuning needs to happen, change the threads, runtime, and mem_mb_per_cpu parameters directly in the workflow/profiles/slurm/config.yaml file.

set-threads:
  solve_network: 2
  test_solve_network: 2

set-resources:
  solve_network:
    mem_mb_per_cpu: 16000
    runtime: 12
  test_solve_network:
    mem_mb_per_cpu: 16000
    runtime: 12

Result Dashboard

Parsing through static images to understand the GSA and UA results can be very difficult; as there is so much data! A dashboard is available to help users decipher and understand their results. To locally run the dashboard for your results, run the following commands. Note, this dashboard is designed to analyze data at an ISO level, and is not compatiable for smaller zones.

# move to the dashboard directory
cd dashboard

# extract data from the results for the dashboard
python collect_data.py

# run the dashboard
python app.py

References

This work uses the following tools:

T. Brown, J. Hörsch, D. Schlachtberger, PyPSA: Python for Power System Analysis, 2018, Journal of Open Research Software, 6(1), arXiv:1707.09913, DOI:10.5334/jors.188

Tehranchi, K., Barnes, T., Frysztacki, M., Hofmann, F., & Azevedo, I. L. PyPSA-USA: An Open-Source Energy System Optimization Model for the United States (Version 0.0.1) [Computer software]. https://doi.org/10.5281/zenodo.10815964

Iwanaga, T., Usher, W., & Herman, J. (2022). Toward SALib 2.0: Advancing the accessibility and interpretability of global sensitivity analyses. Socio-Environmental Systems Modelling, 4, 18155. https://doi.org/10.18174/sesmo.18155

This work is heavily inspired by the following literature:

US Energy-Related Greenhouse Gas Emissions in the Absence of Federal Climate Policy. Hadi Eshraghi, Anderson Rodrigo de Queiroz, and Joseph F. DeCarolis Environmental Science & Technology 2018 52 (17), 9595-9604. https://doi.org/10.1021/acs.est.8b01586

Usher W, Barnes T, Moksnes N and Niet T. Global sensitivity analysis to enhance the transparency and rigour of energy system optimisation modelling [version 1; peer review: 1 approved, 2 approved with reservations]. Open Res Europe 2023, 3:30. https://doi.org/10.12688/openreseurope.15461.1

Moret, S., Gironès, V. C., Bierlaire, M., & Maréchal, F. (2017). Characterization of input uncertainties in strategic energy planning models. Applied Energy, 202, 597–617. https://doi.org/10.1016/j.apenergy.2017.05.106

Name		Name	Last commit message	Last commit date
Latest commit History 351 Commits
config		config
dashboard		dashboard
resources		resources
results		results
workflow		workflow
.gitignore		.gitignore
.python-version		.python-version
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

PyPSA-USA: Near Term Emission Targets

Intro

Install

Clone the Repository

via HTTPS

via SSH

via GitHub CLI

Dependencies

via Conda

via UV

How to Configure

Scenario

Network File

Uncertainty Parameterization

Gloabl Sensitivity Analysis

Uncertainty Analysis

Solver

How to Run

Generate Network Specific Data

Global Sensitivity Analysis

Uncertainty Analysis

Tuning Resources

Local Profile

HPC Profile

Result Dashboard

References

About

Uh oh!

Releases

Packages

Languages

License

DeltaE/pypsa-gsa

Folders and files

Latest commit

History

Repository files navigation

PyPSA-USA: Near Term Emission Targets

Intro

Install

Clone the Repository

via HTTPS

via SSH

via GitHub CLI

Dependencies

via Conda

via UV

How to Configure

Scenario

Network File

Uncertainty Parameterization

Gloabl Sensitivity Analysis

Uncertainty Analysis

Solver

How to Run

Generate Network Specific Data

Global Sensitivity Analysis

Uncertainty Analysis

Tuning Resources

Local Profile

HPC Profile

Result Dashboard

References

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages