Skip to content

Commit 5f61f17

Browse files
authored
Merge pull request #151 from nextstrain/open-data
Swap to open data for public forecasts
2 parents 353feae + f0dd868 commit 5f61f17

File tree

4 files changed

+15
-17
lines changed

4 files changed

+15
-17
lines changed

.github/workflows/update-ncov-open-clade-counts.yaml

Lines changed: 1 addition & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -57,9 +57,7 @@ jobs:
5757
5858
trigger_model_runs:
5959
needs: [open_clade_counts]
60-
# Disable the run-model workflow due to low sequence counts
61-
# <https://github.com/nextstrain/forecasts-ncov/issues/119>
62-
if: false
60+
if: ${{ !github.event.inputs.trial_name }}
6361
runs-on: ubuntu-latest
6462
steps:
6563
- run: gh workflow run run-models.yaml --repo nextstrain/forecasts-ncov -f data_provenance=open

config/config.yaml

Lines changed: 4 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,5 @@
11
# See ingest/README.md for all available data
22
data_provenances:
3-
- gisaid
43
- open
54
variant_classifications:
65
- nextstrain_clades
@@ -39,10 +38,10 @@ prepare_data:
3938
nextstrain_clades:
4039
global:
4140
included_days: 150
42-
location_min_seq: 1000
41+
location_min_seq: 200
4342
location_min_seq_days: 150
4443
excluded_locations: "defaults/global_excluded_locations.txt"
45-
clade_min_seq: 500
44+
clade_min_seq: 200
4645
clade_min_seq_days: 150
4746
usa:
4847
included_days: 150
@@ -53,12 +52,12 @@ prepare_data:
5352
pango_lineages:
5453
global:
5554
included_days: 150
56-
location_min_seq: 1000
55+
location_min_seq: 200
5756
location_min_seq_days: 150
5857
excluded_locations: "defaults/global_excluded_locations.txt"
5958
clade_min_seq: 1
6059
clade_min_seq_days: 150
61-
collapse_threshold: 350
60+
collapse_threshold: 200
6261

6362
# Params for the model run scripts
6463
models:

viz/README.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -38,8 +38,8 @@ The command `npm run start:local` will both serve the website and provide access
3838

3939

4040
```bash
41-
VITE_CLADES_PATH="gisaid/nextstrain_clades/global/mlr/YYYY-MM-DD_results.json" \
42-
VITE_LINEAGES_PATH="gisaid/pango_lineages/global/mlr/YYYY-MM-DD_results.json" \
41+
VITE_CLADES_PATH="open/nextstrain_clades/global/mlr/YYYY-MM-DD_results.json" \
42+
VITE_LINEAGES_PATH="open/pango_lineages/global/mlr/YYYY-MM-DD_results.json" \
4343
npm run start:local
4444
```
4545

viz/src/App.jsx

Lines changed: 8 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -6,13 +6,13 @@ const mlrCladesConfig = {
66
modelName: "mlr_clades",
77
modelUrl: customAddress ?
88
`${import.meta.env.VITE_DATA_HOST}/${import.meta.env.VITE_CLADES_PATH}` :
9-
`https://nextstrain-data.s3.amazonaws.com/files/workflows/forecasts-ncov/gisaid/nextstrain_clades/global/mlr/latest_results.json`,
9+
`https://nextstrain-data.s3.amazonaws.com/files/workflows/forecasts-ncov/open/nextstrain_clades/global/mlr/latest_results.json`,
1010
}
1111
const mlrLineagesConfig = {
1212
modelName: "mlr_lineages",
1313
modelUrl: customAddress ?
1414
`${import.meta.env.VITE_DATA_HOST}/${import.meta.env.VITE_LINEAGES_PATH}` :
15-
`https://nextstrain-data.s3.amazonaws.com/files/workflows/forecasts-ncov/gisaid/pango_lineages/global/mlr/latest_results.json`,
15+
`https://nextstrain-data.s3.amazonaws.com/files/workflows/forecasts-ncov/open/pango_lineages/global/mlr/latest_results.json`,
1616
}
1717

1818
function App() {
@@ -32,17 +32,18 @@ function App() {
3232

3333
<div className="warningContainer">
3434
<h2>
35-
On Oct 1, 2025, we received an email from GISAID stating that they will no longer be updating the flat file
36-
of EpiCoV data they've historically provisioned to Nextstrain since Feb 2020. Updates to these analyses are on
37-
hold while we sort out alternative strategies to pull in new data from GISAID.
35+
On Oct 1, 2025, we received an email from GISAID stating that they will no longer be updating the flat file of
36+
EpiCoV data they've historically provisioned to Nextstrain since Feb 2020. Updates to GISAID-based analyses are
37+
currently on hold. Please use this analysis based on open data instead, updates to which are continuing at their
38+
regular weekly cadence.
3839
</h2>
3940
</div>
4041

4142
<div id="mainPanelsContainer">
4243
<h2>Clade frequencies over time</h2>
4344
<p>
4445
Each line represents the estimated frequency of a particular clade through time. Equivalent Pango lineage is given
45-
in parenthesis, eg clade 23A (lineage XBB.1.5). Only locations with more than 1000 sequences from samples collected
46+
in parenthesis, eg clade 23A (lineage XBB.1.5). Only locations with more than 200 sequences from samples collected
4647
in the previous 150 days are included. Results last updated {mlrCladesData?.modelData?.get('updated') || 'loading'}.
4748
</p>
4849
<div id="cladeFrequenciesPanel" class="panelDisplay"> {/* surrounding div(s) used for static-images.js script */}
@@ -64,7 +65,7 @@ function App() {
6465
<h2>Lineage frequencies over time</h2>
6566
<p>
6667
Each line represents the estimated frequency of a particular Pango lineage through time. Lineages with fewer
67-
than 350 observations are collapsed into parental lineage. Only locations with more than 1000 sequences from
68+
than 200 observations are collapsed into parental lineage. Only locations with more than 200 sequences from
6869
samples collected in the previous 150 days are included. Results last updated
6970
{mlrLineagesData?.modelData?.get('updated') || 'loading'}.
7071
</p>

0 commit comments

Comments
 (0)