Data Management: Data Release, Sharing, and Publication
The ability to prepare, release, and share, or disseminate, quality data to the public and to other agencies is an important part of the lifecycle process. The data should be medium- and agent-independent, with an understanding that transfer may occur via automated or non-automated mechanisms. We need to ensure that data are shared, but with controls to protect proprietary and pre-decisional data and the integrity of the data itself. Data sharing also requires complete metadata to be useful to those who are receiving the data.
What the White House Says:
The Office of Science and Technology Policy (OSTP) February 22, 2013, Memorandum entitled "Increasing Access to the Results of Federally Funded Scientific Research" requires public access to digital datasets resulting from federally funded research, including datasets used to support scholarly publications.
The Office of Management and Budget (OMB), May 9, 2013, Memorandum M-13-13 entitled "Open Data Policy-Managing Information as an Asset" requires agencies to collect or create information in a way that supports downstream information processing and dissemination activities. This includes using machine readable and open formats, data standards, and common core and extensible metadata for all new information creation and collection efforts.
What the U.S. Geological Survey Manual Requires:
The USGS Survey Manual chapter, SM 502.8 Fundamental Science Practices: Review and Approval of Scientific Data for Release requires timely public access at no cost to scientific data that are developed or funded by the Bureau’s information and research programs.
SM 502.8 further specifies details on what is required to obtain approval to publish or release data to the public. Until data are approved for release, data are considered provisional or preliminary and subject to revision. Reviews of data and associated metadata are required before data are approved for release. Among other requirements are an appropriate disclaimer statement, a persistent identifier, and procedures to ensure long-term preservation.
Chapter 1100.3 - U.S. Geological Survey Publication Series discusses the publication of information products, which includes data products:
Chapter 1100.6 - Use of Copyrighted Material in USGS Information Products covers the use of copyrighted material in information products, which includes data products:
"Policy: Copyrighted material will not knowingly be incorporated into any information product prepared by or for the USGS, except with the written consent of the copyright owner. . . . Purchase agreements should refer to both the material that is copyrighted and the copyright permission because they are distinct and both need to be acquired."
Chapter 1100.4 - Use of Outside Publications, Including Abstracts addresses the issue of copyright in external publications:
Chapter SM 502.2 - Fundamental Science Practices: Planning and Conducting Data Collection and Research describes planning and conducting data collection and Research:
Chapter SM 502.5 - Fundamental Science Practices: Safeguarding Unpublished U.S. Geological Survey Data, Information, and Associated Scientific Materials describes the safeguard and release of USGS data and Information:
“It is the policy of the USGS to conduct its activities and to make the results of its scientific and engineering investigations available in a manner that will best serve the whole public rather than the interest or benefit of any particular group, corporation, individual, or other entity. Preliminary release of USGS data or information, including early release of USGS publications, is not made exclusively to any group or individual.”