nasa / smd-open-science-guidelines Goto Github PK
View Code? Open in Web Editor NEWLicense: Other
License: Other
Expand publications guidance to include:
Per https://github.com/nasa/smd-open-science-guidelines/blob/main/request_for_comment/draft/rfc_002_how_to_make_data_fair.md#first-steps-2, it states:
Ensure an identifier is included in the DOI metadata directing to detailed product metadata. Use the RelatedIdentifier field with relationType IsDescribedBy.
When we say identifier here, what are we talking about? Is this an internal system PID? If so, what should our relatedIdentifierType
be? For instance, in the Planetary Data System (PDS), we have an internal PID for all products,. Which of these types should we use?
From SPD-41a implementation plan:
CSDO: Provide further examples on Scientific Information as part of the Open Science Guidelines. This would expand on Appendix E and Appendix F.
Discuss with @nasacrawford - What types of examples are desired here?
SPD-41a "SMD-funded data collections shall be indexed as part of the NASA catalog of data."
Technically this is data.NASA.gov. However, data hosted at a NASA repository or indexed by ADS will be, eventually through the SDE, automatically listed there. If proposers follow the guidance on selecting an appropriate repository for their research data, then they do not need to take any additional action to meet this requirement.
Develop guidance on sharing training data, including:
Following web modernization, update links to the division OSDMP templates to ensure that any future updates will be captured. The best approach may be to link to template pages on science.nasa.gov: example
Not included in SPD-41a, but consider adding some basic guidance on considerations for open hardware
See CERN Open Science policy and/or notes from CERN open source sessions
Section F3 states that:
It is a reminder to curators that the globally unique, persistent identifier *must *be propagated to the metadata for the target data itself, not merely cataloged in an institutional database.
Does that imply the DOI needs to be included in the data files themselves (e.g. header in a fits file)?
What happens to a file that is associated with multiple DOIs. For example, an image file from a telescope observation can have a DOI that points to it as a data product, but that image file is also part of an observation that have its own DOI.
How does sharing technology openly intersect with an investigator's ability to patent their work?
For proposers who work with tribal nations and have concerns about data sovereignty and SMD's data sharing requirements, what guidance is available?
It is not currently clear what is an appropriate Code of Conduct document for a codebase on NASA's GitHub organization. Can a standard — or at least, guidance on a — Code of Conduct document for repositories in NASA's GitHub be established? I believe it could be beneficial to streamline this for repository contributors, and it would be helpful for other folks visiting and interacting with repositories on NASA's GitHub.
Preparing code repositories (including on NASA's GitHub org.) to follow Open Science guidelines typically includes adding a Code of Conduct — as a file named CODE_OF_CONDUCT.md
. In general, Codes of Conduct set expectations for how people interact with the project and with each other, and they are expected to be present for certain applications. For example, pyOpenSci explicitly asks for a CODE-of-CONDUCT.md to be included in a submitted repository.
Lab experiments are different than the observations and theory research which are done in many of our divisions. It would be helpful to provide more guidance on what data or software would be useful to make available and how to make it available.
Examples:
The DOI guidance says NASA employees but it probably should say NASA credentials.
Expand current guidance on data management and sharing to include:
Develop flow charts to guide researchers/proposers/reviewers through set of questions to ask when considering whether an OSDMP is compliant with the policy at the proposal stage, and/or whether actions taken to manage and share scientific information are compliant with the policy.
PSD created a nice set of these that have been shared internally within the OSSI. A public version of this information may be a helpful addition to this guidance.
See SPD-41a FAQ on this topic and develop guidance in terms of FAIR principles
The RFC you are commenting on: Registration of DOIs for data citation
Type of comment:
Section(s) of document referenced: C.1.
Comment/Question: Reference to SPD-41a should be updated when the policy is approved.
The original guidance on publications
smd-open-science-guidelines/guidance/research_publications.md
has now been replaced with
smd-open-science-guidelines/OSS_Guidance/Publications.md
We should remove the old guidance or somehow indicate that it has been replaced since it is less comprehensive than the new version. For review by @nasacrawford
This is a really useful document by Tracy Chen and collaborators on best practices for data in astronomical literature. It would be useful to link to in either the data pages or resources pages to provide further information.
As for links of the best practices, use the ADS link to the article (https://ui.adsabs.harvard.edu/abs/2022ApJS..260....5C/abstract), or the link to the NED BP page (http://ned.ipac.caltech.edu/Documents/Guides/BestPractices) with the convenient checklist included.
Per https://github.com/nasa/smd-open-science-guidelines/blob/main/request_for_comment/draft/rfc_002_how_to_make_data_fair.md#first-steps-3, more specifically:
Provide auto generated citation text in common formats like bibtex, RIS, and txt on data set landing pages for authors to use when citing the data.
Is there guidance or a how-to on how to do this easily within out website?
This repository can link to the NASA Transform to Open Science (TOPS) initiative information so that these guidelines are provided within a broader context.
Expand upon existing guidance on restricted information given in data and software sections.
Add more background on the restrictions that generate exceptions to information sharing (e.g. ITAR, EAR, HIPAA).
Search open grants list for relevant examples
Add background on relevant NASEM reports: example.
Per https://github.com/nasa/smd-open-science-guidelines/blob/main/request_for_comment/draft/rfc_002_how_to_make_data_fair.md#first-steps-4, are there any ideas for more concrete direction for how we can provide an integrated look at how DOIs can be accessed across SMD? Maybe a page on SciX for how to access all the information about DOIs across the various systems?
e.g.
System | Web Search Access | API Access |
---|---|---|
System X | link TBD | link TBD |
System Y | link TBD | link TBD |
As requested in https://mastodon.social/@crawfordsm/109676705818114890, I'm creating this issue to suggest that while the current guidelines for data generally require making data CC0 licensed, and encourage metadata and identifiers that make the data citable, the guidance does not actually require or encourage citation of the data when it is used. One could argue that existing scholarly norms already do this, but it would be useful if NASA at least encouraged this as well.
Expand current guidance on sharing materials from scientific events to include broader recommendations on how to hold events openly (focus on accessibility and inclusion).
See if TOPS OS 101 has any guidance on this that we can link to once it's released.
The below link is currently broken in this documentation:
https://github.com/nasa/smd-open-science-guidelines/blob/main/OSS_Guidance/data.nasa.gov
This may be due to the NASA web migration switch, as the transition from beta to primary occurred last week. So, there may be other links that are broken as well.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.