GithubHelp home page GithubHelp logo

workflowhub-eu / about Goto Github PK

View Code? Open in Web Editor NEW
9.0 13.0 17.0 5.88 MB

Website about the project WorkflowHub

Home Page: https://about.workflowhub.eu/

License: BSD 3-Clause "New" or "Revised" License

Ruby 0.05% HTML 99.36% Common Workflow Language 0.24% SCSS 0.35%
workflow repository open-science reproducibility seek biohackcovid20 cwl researchobject bioschemas metadata

about's People

Contributors

alaninmcr avatar bedroesb avatar carolegoble avatar fbacall avatar gannebamm avatar hexylena avatar jmfernandez avatar paulbrack avatar rabuono avatar simleo avatar simonbray avatar stain avatar stuzart avatar supernord avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

about's Issues

Invisible menu on webpage

When you view the webpage https://workflowhub.eu/ in a window with small enough to make the little menu button appear on the top right, the menu does not display. Clicking on the menu shows the search box only. Tested on Linux, Chrome and Firefox and Opera.

Advice on how workflows should markup with bioschemas and synch with workflowhub

BioSchemas/specifications#581 raises this point

What's the ideal way for this to work, we'll probably be submitting workflows to workflowhub ourselves.

    Do we annotate our WFs with ld+json, knowing WFHub will do it too, resulting in "two copies" for anyone scanning for workflows?
    Do we not annotate ours, knowing the "canonical" version can be kept in WF Hub?
    Do we just annotate ours, and somehow WF Hub will discover these and pick them up, saving us from doing anything?

I keep looking into working with wf hub but still haven't found good options for syncing all of our workflows programmatically, and I'm very interested in anything that helps us automate that!

Workflow RO-Crate: clarify spec on workflow types

The current spec says:

The Crate MUST contain a data entity of type ["File", "SoftwareSourceCode", "ComputationalWorkflow"] as the Main Workflow.

This could be interpreted as saying that the types must be exactly ["File", "SoftwareSourceCode", "ComputationalWorkflow"], while WorkflowHub actually allows for more types to appear on the list. This is required by the Provenance Run Crate profile (which adds HowTo), so I propose we change the spec as follows:

The Crate MUST contain a Main Workflow data entity whose types MUST include File, SoftwareSourceCode and ComputationalWorkflow.

Evaluate wfhub for best practices for research software registries

From Nine Best Practices for Research Software Registries and Repositories: A Concise Guide:

For examples, see https://scicodes.net/best-practices-for-software-registries-and-repositories/

I think we have most of these already, but may just need to link to our policies/schemas/docs, and perhaps make them more explicit here on these About pages and from the main hub.

Workflow RO Crate profile example does not appear to comply with the profile

The example crate does not contain an entity of @type ["File", "SoftwareSourceCode", "ComputationalWorkflow"]

Thought the profile here https://about.workflowhub.eu/Workflow-RO-Crate/ says it should

the Main entity does not have @type computational workflow:

    {
      "@id": "example_workflow.cwl",
      "@type": [
        "File",
        "SoftwareSourceCode",
        "HowTo"
      ],
      "programmingLanguage": {
        "@id": "https://w3id.org/workflowhub/workflow-ro-crate#cwl"
      },
      "name": "Example Workflow",
      "image": {
        "@id": "diagram.svg"
      }
    },

List UoM as the Data Controller for WorkflowHub

https://workflowhub.eu/home/terms says

The Data Controller for the WorkflowHub is EOSC-Life WorkflowHub work package, contact at [email protected]. If the WorkflowHub is accessed at a different URL, the data controller is likely someone else

but the GDPR regulations says

‘controller’ means the natural or legal person, public authority, agency or other body which, alone or jointly with others, determines the purposes and means of the processing of personal data; where the purposes and means of such processing are determined by Union or Member State law, the controller or the specific criteria for its nomination may be provided for by Union or Member State law;

EOSC-Life WorkflowHub work package is not the correct work package name, and in addition a work package is not a natural or legal person. Do we want to claim it as an “other body”? Another problem is that the contact list given does not go to the work package or work package leaders.

I suggest to list The University of Manchester, UK as the legal entity that is the Data Controller, as employers of the individuals of that list, and as the formal consortium agreement member of EOSC-Life and the hosting provider of the virtual machine that runs the workflowhub.eu instance. So effectively UoM is already the data controller.

The controller has legal responsibilities, so to avoid individuals working on this potentially be legally responsible, it should be the employer, as the legal entity, that is the Data Controller

WorkflowHub documentation comments [WIP]

The following points are based on going through the wiki:

  • there is no homepage for the user guide (or none I could find), you either access a project page (https://github.com/workflowhub-eu/about/wiki) or a guide on uploading workflows (https://github.com/workflowhub-eu/about/wiki)

  • a bit of introduction on the homepage would be interesting, and another one in the "workflow upload"

  • the difference between uploading and referencing workflows should be explained

  • the workflow type should be explained

  • for "uploading galaxy workflows", I would guess that specifying how to retrieve the ".ga" file would be useful information to some users.

Issue on page: Complete workflow metadata

I would like to report an issue on the Complete workflow metadata page at /docs/complete-workflow-metadata/. Description of the issue: appears that links are broken

They are

For some reason, many markdown links like [Title](#bookmark=kix.gqofhtf06fx) are surrounded in <code> tags resulting in the raw markdown in the final webpage. Though the links wouldn't work. (Similar to #34 ?)

usage analytics

A requirement for reporting to funders and for future funding bids are usage analytics and contribution/download/access dashboards

  • accesses
  • downloads and views
  • access launches

it is not sufficient to just report how many workflows etc- the funders and the project management needs data for evidence of use.

Drop-down list for Related Links

Building on from #12 then instead of just a free-text field for "Related Links" (former "Discussion Channel") it would be nice to have a suggested list of link relations for the user to select from.

Part of this idea is that given a list of suggested links like documentation the user is more likely to fill it in than if it's all free-form.

These kind of links would be part of the value added by Workflow Hub - in theory people can put them in their GitHub README but frequently they don't.

Zenodo calls this "Related Identifiers" - see right margin on https://zenodo.org/record/4012124 for example.

Our drop-down list could also be based on (or be compatible with) DataCite schema's relationType although some of these may not make sense on workflows. (They may make sense on other SEEK types)

Mockup (based on Zenodo's form):

related-links

New in latest DataCite is also typing the resource at the other end - this is often just as useful:

type-of-export

However we may need "Other" as well with a text-field to fill in.

Here is the list Zenodo use:

relation label
datacite:isCitedBy cites this upload
datacite:cites is cited by this upload
datacite:isSupplementTo is supplemented by this upload
datacite:isSupplementedBy is a supplement to this upload
datacite:references is referenced by this upload
datacite:isReferencedBy references this upload
datacite:isNewVersionOf is previous version of this upload
datacite:isPreviousVersionOf is new version of this upload
datacite:isContinuedBy continues this upload
datacite:continues is continued by this upload
datacite:isPartOf has this upload as part
datacite:hasPart is part of this upload
datacite:isReviewedBy reviews this upload
datacite:reviews is reviewed by this upload
datacite:isDocumentedBy documents this upload
datacite:documents is documented by this upload
datacite:compiles is compiled/created by this upload
datacite:isCompiledBy compiled/created this upload
datacite:isDerivedFrom is the source this upload is derived from
datacite:isSourceOf has this upload as its source
datacite:isIdenticalTo is identical to this upload
datacite:isAlternateIdentifier is an alternate identifier of this upload

For us this upload is this workflow. See https://schema.datacite.org/meta/kernel-4.3/doc/DataCite-MetadataKernel_v4.3.pdf#page=46 (page 46) for full DataCite list.

As commented in #12 if we have the types of links we can not just present them in nicer ways but also provide them programmatically in the RO-Crate, BioSchemas markup and in DataCite export.

Other more generic link relations registries are IANA link relations which is used in <link rel="cite-as" href="https://doi.org/10.1234/abcd"> and <a rel="preview" href="thumbnail.png"> in HTML and equivalent Link: HTTP header although they are probably more important for internal site navigation.

Code of Conduct

Looking at the current CoC again, this is fine for me. The only thing for me is the email for reporting. [email protected] is probably not the right one (I have no idea who this goes to), and we likely need/want more than one person looking at it ?

Looking for speaker to present wfhub

Dear all,
thanks so much for making workflowhub a real platform where scientists and others can share their workflows in a citable fashion. In the LEAPS community, we would love to share some more knowledge of automated workflow engines with interested scientists. For this, we are setting up a co-working sprint early in 2023.
It would be great if someone from the core wfhub team could volunteer to present wfhub on Jan 27, 2023, to us (20-25 min talk, sometime between 10-12am CET).
Please reach out to me, in case you are interested. We would love to have you!
Best,
Peter

Rename "Discussion Channels" to "Related Links"

The BioExcel workflows have multiple websites, documentation, tutorials, source code and live Jupyter Notebook deployments.

While it is possible to list these within the rich-text of the Description, this is cumbersome to get consistent. It is also not easy to access such links programmatically (e.g. for inclusion in RO-Crate or DataCite metadata)

The existing "Discussion Channels" almost does the job, e.g. from example https://dev.workflowhub.eu/workflows/116

discussionchannels

Links can be added with a label, as shown above. This issue simply asks to rename "Discussion Channels" to "Related Links" of which a discussion channel would be one of possible links.

I'll make a separate issue about having drop-down selection for type of link.

Update items with my api key - API not working

I am trying to use the api documentation but the links are not working. It sees we have to use fairdomhub.org as entry point for calls but it returns a message of redirecting. I would like to send the ro-crate file parsed and mined from readme descriptions in order to automatize migration to workflowhub.

Help needed for asset management

Currently there is no help at all for asset management and its opaque. This impacts on the permissions of what can be done when by whom.

For example,
image

what is a housekeeper? what is a gatekeeper? does it matter? must I assign these? just some simple words here would help, instean of making me read a complicated document.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.