You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardexpand all lines: docs/50_data_publication/00_data_publishing.mdx
+17-11
Original file line number
Diff line number
Diff line change
@@ -4,7 +4,7 @@ slug: "/data_publishing"
4
4
id: "data_publishing"
5
5
---
6
6
7
-
importuseBaseUrlfrom'@docusaurus/useBaseUrl';
7
+
importuseBaseUrlfrom"@docusaurus/useBaseUrl";
8
8
9
9
# Data Publishing
10
10
@@ -14,17 +14,22 @@ This page applies to all researchers who want to publish their data.
14
14
15
15
## Motivation
16
16
17
-
In chemical research, we strive to share results with others, commonly through articles in renowned scientific journals. To be able to actually work with and build upon these results, the scientific community also requires the data that the results were based on.
17
+
In chemical research, we strive to share results with others, commonly through articles in renowned scientific journals. To be able to actually work with and build upon these results, the scientific community also requires the data that the results were based on.
18
18
19
-
Publishing and therefore sharing these chemistry research data in a [FAIR](/docs/fair) manner by considering aspects such as [persistent identifiers](/docs/pid/), rich [metadata](/docs/metadata), [provenance information](/docs/provenance/), [data formats standards](/docs/format_standards) for analytical data, information on the [license](/docs/licenses/) applied, and by providing [machine-readable chemical structures](/docs/machine-readable_chemical_structures) adds value to the research results and enables discovery and reuse.
19
+
Publishing and therefore sharing these chemistry research data in a [FAIR](/docs/fair) manner by considering aspects such as [persistent identifiers](/docs/pid/), rich [metadata](/docs/metadata), [provenance information](/docs/provenance/), [data formats standards](/docs/format_standards) for analytical data, information on the [license](/docs/licenses/) applied, and by providing [machine-readable chemical structures](/docs/machine-readable_chemical_structures) adds value to the research results and enables discovery and reuse.
20
20
21
21
To publish data is essential to ensure that findings are transparent and reproducible. Moreover, it prevents duplicate efforts to generate data, hence, data publishing is also a measure of sustainability.
22
22
23
23
## Benefits of data publishing
24
24
25
25
There are direct benefits for researchers who publish their data. Data publications increase your career recognition, enable new collaborations, and provide a citation advantage compared to articles without an associated and linked dataset in a research data repository.
26
26
27
-
<imgalign="center"src={useBaseUrl('/img/data_pub/data_publication_brian_hole_CC_BY_40.png')}alt="Data Publication Brian Hole CC BY 4.0"width="80%" />
(Source: Brian Hole, [CC BY 4.0](https://creativecommons.org/licenses/by/4.0/), [Slideshare](https://www.slideshare.net/brianhole/the-journal-of-open-archaeology-data-and-prime-incentivising-open-data-archiving))
30
35
@@ -34,8 +39,8 @@ Additionally, the research community benefits here as, for example, new research
34
39
35
40
There are two main ways to publish research data:
36
41
37
-
- publish data in a [research data repository](/docs/repositories/)
38
-
- publish a separate [data article](/docs/data_articles/) with the corresponding dataset published in a [research data repository](/docs/repositories/)
42
+
-publish data in a [research data repository](/docs/repositories/)
43
+
-publish a separate [data article](/docs/data_articles/) with the corresponding dataset published in a [research data repository](/docs/repositories/)
39
44
40
45
:::info Info:
41
46
Field-specific repositories should be the first choice as these repositories enhance the FAIRness of data on behalf of the submitters. To retain the same level of FAIRness, data publishing in generic repositories requires manual FAIRification.
@@ -53,10 +58,11 @@ Still confused about how to publish chemistry data? We do provide a consulting a
53
58
54
59
## Sources and further information
55
60
56
-
-[The citation advantage of linking publications to research data](https://doi.org/10.1371/journal.pone.0230416)
57
-
-[NSF Public Access Plan 2.0](https://nsf-gov-resources.nsf.gov/2023-06/NSF23104.pdf)
58
-
-[DFG Code of Conduct, Guideline 13: Providing public access to research results](https://wissenschaftliche-integritaet.de/en/code-of-conduct/providing-public-access-to-research-results/)
59
-
-[How does Open Science practice differ between research disciplines?](https://theplosblog.plos.org/2023/07/open-science-by-discipline/)
61
+
-[The citation advantage of linking publications to research data](https://doi.org/10.1371/journal.pone.0230416)
62
+
-[NSF Public Access Plan 2.0](https://nsf-gov-resources.nsf.gov/2023-06/NSF23104.pdf)
63
+
-[DFG Code of Conduct, Guideline 13: Providing public access to research results](https://wissenschaftliche-integritaet.de/en/code-of-conduct/providing-public-access-to-research-results/)
64
+
-[How does Open Science practice differ between research disciplines?](https://theplosblog.plos.org/2023/07/open-science-by-discipline/)
65
+
66
+
---
60
67
61
-
----
62
68
Main authors: [ORCID:0000-0002-6243-2840](https://orcid.org/0000-0002-6243-2840) and [ORCID:0000-0003-4480-8661](https://orcid.org/0000-0003-4480-8661)
### Use ORCID iD to Identifiy Authors and ROR to Identifiy Institutions
7
+
8
+
:::tip Standard
9
+
*Authors should provide their ORCID iD to identify the authors/creators and contributors, and their ROR identifier to identify the institution to which they are affiliated.*
10
+
:::
11
+
12
+
ORCID iDs allow authors to be uniquely identified, whereas author names may not be unique, may change, or may have different ordering conventions depending on cultural differences. Similarly, affiliations can vary according to style and granularity. In order to uniquely identify the institutions to which authors are affiliated, the ROR identifier should be provided. Please note that the ROR identifier is not intended to resolve down to the departmental level. Nevertheless, both identifiers improve findability.
13
+
14
+
### How to use Dataset PIDs in Scientific Articles
15
+
16
+
During deposition of research data, a [persistent identifier (PID)](/docs/pid) is assigned to the data. Authors should use PIDs in their scientific article for interlinking and referencing in **two main ways**:
17
+
18
+
#### How to use Dataset PIDs of own Datasets in Scientific Articles
19
+
20
+
21
+
:::tip Standard
22
+
*Authors should add the PID of their corresponding dataset(s) to the data availability statement and should add PIDs of dataset(s) to the reference section in order to specifically cite dataset(s).*
23
+
:::
24
+
25
+
For corresponding data, i.e. directly underlying the results reported in the article, add the PID to the article's [data availability statement](/docs/data_availability_statement), or a similarly termed section. In order to specifically reference and cite your data within the text, add your dataset to the references as well.
26
+
27
+
:::danger Notice:
28
+
This distinction is important, because **the link** to the dataset in CrossRef's DOI metadata of scientific articles **is differently set**, depending on whether the dataset is a directly related source of information or a specifically referenced resource.
29
+
:::
30
+
31
+
#### How to use Dataset PIDs for Datasets by other Researchers in Scientific Articles
32
+
33
+
34
+
:::tip Standard
35
+
*Authors should include PIDs for datasets published by other researchers that have been reused in the references, rather than citing the corresponding article.*
36
+
:::
37
+
38
+
For datasets published by other researchers and reused in a study, include the dataset PID in the reference section of the manuscript and cite within the text accordingly.
39
+
40
+
### How to Link Datasets to their Corresponding Scientific Article
41
+
42
+
:::tip Standard
43
+
*Researchers should link their datasets to be published to their corresponding articles by adding the article DOI to the dataset's DataCite metadata as a related identifier.*
44
+
:::
45
+
46
+
Research data repositories offer the option to add a related identifier to link datasets to related resources, such as a corresponding article. This considerably enhances the [FAIRness](/docs/fair/) of datasets, mainly the findability ([F2](/docs/fair/#f2-data-are-described-with-rich-metadata-defined-by-r1-below)) as well as the interoperability ([I3](/docs/fair/#i3-metadata-include-qualified-references-to-other-metadata)).
47
+
48
+
### Usage of Relation Type for Linking Datasets with Scientific Articles
49
+
50
+
:::tip Standard
51
+
*Researchers should link their datasets to be published to their corresponding articles using the relation type `IsSupplementTo`.*
52
+
:::
53
+
54
+
According to the [DataCite Metadata Schema](https://datacite-metadata-schema.readthedocs.io/en/4.5/appendices/appendix-1/relationType/), [`IsCitedBy`](https://datacite-metadata-schema.readthedocs.io/en/4.5/appendices/appendix-1/relationType/#iscitedby) and [`IsSupplementTo`](https://datacite-metadata-schema.readthedocs.io/en/4.5/appendices/appendix-1/relationType/#issupplementto) are both recommended for discovery. For published articles, [CrossRef's documentation on relationships](https://www.crossref.org/documentation/schema-library/markup-guide-metadata-segments/relationships/) recommends that `isSupplmenetTo` should be used to link datasets generated as part of research results.
55
+
56
+
### Usage of Collection DOIs
57
+
58
+
:::tip Standard
59
+
*Researchers should use the Collection DOI provided by a repository in the data availability statements of their corresponding manuscript to wrap research data objects that are relevant to that of an article to be published.*
60
+
:::
61
+
62
+
Field-specific research data repositories may provide DOIs to reference individual chemical reactions, molecules, and their analytical data. Generic, multidisciplinary research data repositories provide DOIs for a whole published datasets, while more than one published dataset may be relevant to study results published via an article. In other words, many DOIs may be relevant to a published article, whereas **a data availability statement may provide some DOIs but not many DOIs**.
63
+
64
+
To facilitate the process of manuscript submission and article publication, researchers should add the **Collection DOI** provided by the repository to the data availability statement. This will ensure that all data underlying a published manuscript can be linked to an article in the data availability statement and the CrossRef metadata of that article.
65
+
66
+
If individual reactions, molecules, or analytical data should be referenced, add the DOIs of these individual research objects to the reference section and cite within the text accordingly.
67
+
68
+
<!--StandardonhowtopublishNMRdata,tbd-->
69
+
70
+
----
71
+
Main authors: [ORCID:0000-0003-4480-8661](https://orcid.org/0000-0003-4480-8661), [ORCID: 0000-0002-6243-2840](https://orcid.org/0000-0002-6243-2840)
0 commit comments