Skip to content

Commit 7949a3b

Browse files
authored
update data policies with data use agreement (#99)
* update data policies with data use agreement * edits * revisions
1 parent cabfad7 commit 7949a3b

2 files changed

Lines changed: 80 additions & 111 deletions

File tree

1.56 MB
Binary file not shown.

datapolicies.md

Lines changed: 80 additions & 111 deletions
Original file line numberDiff line numberDiff line change
@@ -2,125 +2,94 @@
22
layout: base
33
permalink: /datapolicies/
44
---
5-
# Data Policies -- DRAFT
6-
7-
This page contains a **DRAFT** data policies proposal.
8-
9-
This page describes the disclosure levels for observational (generation
10-
and environmental) and forecast data submitted to the framework, as well
11-
as data derived from the process of comparing forecast fields to
12-
observations (e.g. summary statistics).
13-
14-
An **organization** is an entity that owns data or obtains license to submit
15-
data to the framework. A **user** is an individual working for an organization
16-
that performs the tasks of submitting data to the framework, granting other
17-
users or organizations rights to view data, and downloads data from the
18-
framework.
19-
20-
21-
Guiding Principles
22-
------------------
23-
24-
The following principles guide our data policies.
25-
26-
1. Organizations retain ownership of the data they upload to the framework.
27-
2. Users upload data to the framework on behalf of organizations.
28-
2. Users have complete control over how their organization's data may be
29-
accessed by other users within organizations.
30-
2. Users may delete their organization's data from the framework at any time.
31-
2. The framework will not sell, donate, or otherwise disclose the data that it
32-
controls without express written consent of the data owner.
33-
2. All non-public data will be securely deleted by the conclusion of the DOE
34-
funding period (June 30, 2021) unless express written permission to
35-
retain it is obtained.
36-
37-
38-
Contributing data
39-
-----------------
5+
# Data Policies
6+
7+
This page summarizes the data sharing and privacy policies of the
8+
Solar Forecast Arbiter.
9+
10+
An **organization** is an entity that owns data or obtains license to
11+
submit data to the framework. A **user** is an individual working for an
12+
organization that performs tasks such as submitting data to the
13+
framework and downloading data from the framework. An **organization
14+
administrator** is a user that has can also grant permissions to view or
15+
modify data to other users, including users outside of the
16+
administrator's own organization.
17+
18+
The [Data Use Agreement](/assets/45864_UAZ_Solar_Forecast_Arbiter_Final.pdf)
19+
(DUA) is a **non-negotiable** legal document that all parties are bound to.
20+
The DUA must be signed by an authorized representative
21+
of an organization before its employees will be allowed to upload data,
22+
view data contributed by other users, or generate summary statistics.
23+
24+
The data policies can be summarized as:
25+
26+
* Signing the DUA does **not** obligate an organization to upload data,
27+
nor does it obligate an organization to share uploaded data.
28+
* Organizations retain ownership of the data they upload to the framework.
29+
* Users upload data to the framework on behalf of organizations.
30+
* Organization administrators have complete control over how their
31+
organization's data may be accessed by other users.
32+
* Organization administrators may delete their organization's data from the
33+
Arbiter at any time.
34+
* Uploading data does **not** give Solar Forecast Arbiter team members
35+
the ability to study it. Sharing data with project team members
36+
follows the same procedures as sharing data with any other user.
37+
* All data will be securely deleted within 30 days of the termination of
38+
the project (anticipated late 2021).
39+
40+
The DUA describes two types of data that participants may contribute:
41+
*Open Project Data* and *Limited Project Data*. Limited Project Data is
42+
proprietary data for which access controls are required. Most of the
43+
data policies are structured around addressing concerns about Limited
44+
Project Data. Open Project Data is data that users contribute to the
45+
project's reference data set. This data immediately benefits the whole
46+
community, but organizations lose control over who can access it.
47+
48+
## Contributing data
4049

4150
Each organization contributing data to the framework retains ownership
4251
and control of its data. Each framework user is a member of an
4352
organization and contributes data on behalf of the organization.
4453

4554
Through the framework web interface or API calls, the user associates
4655
data with data access policies. These policies allow specific
47-
organizations/users to access to the data. Example policies are shown
48-
below. The web interface will show an organization a list of all of the
49-
submitted metadata/data it owns and the users/organizations that can
50-
access each data source.
51-
52-
Under some use cases, anonymized time series data and or summary statistics
53-
derived from the data are owned by the framework. This ensures trial fairness,
54-
transparency, and reproducibility. The framework is not allowed
55-
to sell, donate, or otherwise disclose anonymized data or statistics.
56+
users to access to the data. The web interface shows
57+
an organization administrator a list of all of the submitted metadata/data
58+
it owns and the access roles given to other users. Please see the
59+
[Data Access Control documentation](/data-access-workflow/) for more
60+
information.
5661

62+
Under a forecast trial use case, anonymized time series data and/or
63+
summary statistics derived from the data are owned by the framework.
64+
This ensures trial fairness, transparency, and reproducibility. However,
65+
the framework operators are not allowed to view or disclose anonymized
66+
data or statistics unless given permission using the data sharing
67+
features of the web service.
5768

58-
Deleting data
59-
-------------
69+
## Deleting data
6070

6171
An organization may delete its data from the framework at any time, or
62-
an organization may ask the framework administrators to delete its data
63-
from the framework.
64-
65-
All non-public data will be securely deleted at the conclusion of the
66-
DOE funding period (June 30, 2021) unless express written consent is
67-
granted by both the data contributor and the post-DOE-funding framework
68-
administrator.
69-
70-
71-
Data access policies
72-
--------------------
73-
74-
A data access policy defines the ways in which the data can accessed by users.
75-
The framework supports the following data access policies, ordered from least
76-
to most permissible.
77-
78-
* Owner-only
79-
* Default access level for data uploaded by users.
80-
* Peer-to-peer
81-
* A single user, working on behalf of an organization, grants another user
82-
working on behalf of another organization permission to view data.
83-
* Multiparty Confidential Access anonymized
84-
* Groups of organizations/users will be defined in response to each need
85-
e.g. a particular forecast trial.
86-
* Anonymized data is accessible to all users within a specific group.
87-
* Anonymized data is not accessible by users outside of the group (or general public).
88-
* True owners of anonymized data are not known by framework administrators (see note below).
89-
* Multiparty Confidential Access
90-
* Groups of organizations/users will be defined in response to each need
91-
e.g. a particular trial.
92-
* Data is accessible to all users within a specific group.
93-
* Data is not accessible by users outside of the group (or general public).
94-
* Public (with required sign on)
95-
* Reference data such as SURFRAD, SOLRAD, Sandia, NREL MIDC, U Oregon,
96-
DOE Data Acquisition and Archive Portal (DAP), etc.
97-
98-
99-
100-
Non-Disclosure Agreements
101-
-------------------------
102-
103-
The creator of an organization account must agree to common terms before
104-
the account may be activated. The terms describe the data policies and
105-
procedures outlined in this document. The representative of the
106-
organization that agrees to the terms must have the authority to do so.
107-
Only then can the user view data or add new members to the organization.
108-
New members will be prompted to review and agree to the terms when creating a
109-
user account.
110-
111-
In the event that the terms change, organization administrators will be
112-
prompted to reverify their compliance.
113-
114-
115-
Other
116-
-----
117-
118-
The framework will use national standards and best practices for
119-
security of stored data and data transmission.
120-
121-
On anonymity from framework administrator/operator... with some study of
122-
IP logs the framework operator could potentially determine who
123-
contributed what data. This would require super user access on the
124-
servers. This access will be restricted to the smallest number of people
125-
possible. Data contributors that want to prevent this possibility can
126-
use a VPN.
72+
an organization may ask the framework operators to delete its data from
73+
the framework.
74+
75+
All non-public data will be securely deleted within 30 days of the
76+
termination of the project (anticipated late 2021).
77+
78+
## Other
79+
80+
The Solar Forecast Arbiter uses national standards and best practices
81+
for security of stored data and data transmission.
82+
83+
Stakeholders have questioned the ability for framework operators to
84+
access user-contributed data. Accessing user-contributed data requires
85+
highly-restricted super user access on the servers. This access is
86+
restricted to the smallest number of people possible. The DUA expressly
87+
forbids the framework operators from analyzing or examining
88+
user-contributed data except when required to solve service issues.
89+
90+
Stakeholders have also questioned the ability for framework operators to
91+
deanonymize data contributed to anonymous forecast trials. With some
92+
study of IP logs the framework operator could potentially determine who
93+
contributed what data. This too would require highly-restricted super
94+
user access on the servers. Data contributors that want to prevent this
95+
possibility can use a VPN.

0 commit comments

Comments
 (0)