Comments (10)
Oh, interesting, I was independently wondering to myself if we should disconnect ontology production from the pipeline runs, and do GitHub releases similar to other OBO ontologies. We should probably talk this through.
from go-ontology.
Is this a project? Likely, "yes", although possibly a short one if it went ahead. That said, I really want to be careful in the use cases and coordination here--there is a reason we have historically "locked" these things to annotation and it's an orthogonal (although very real) problem that we're having issues with the production pipelines.
Some important orienting notes on this:
- we already fully build the ontology 3 times a day (http://skyhook.berkeleybop.org/go-ontology-dev/), but the product is transient
- archiving that so it is not volatile, to a public URL, is "easy"
- noctua/minerva are updated every two weeks and, as far as we can tell (geneontology/pipeline#363), use the absolute latest, as it's self-contained
So the questions I have look like:
- exactly who is going to be using a high-frequency ontology and in what use cases? I'm assuming Alex here, but who else?
- where do PURLs go?
- assuming that we get snapshots back to where we want them to be, say daily-ish, where would this stand?
- counting down getting the new machine [https://github.com/geneontology/pipeline/issues/351] and some fixes I think may help [https://github.com/geneontology/pipeline/issues/352#issuecomment-2000772991]), what is the preferred use; does it change?
from go-ontology.
@kltm what do you mean by 'absolute latest' here?
noctua/minerva are updated every two weeks and, as far as we can tell (geneontology/pipeline#363), use the absolute latest, as it's self-contained
Is 'snapshot' considered the 'absolute latest'? Or do you use one of the 'volatile' daily products to build NEO?
from go-ontology.
There's nothing "volatile" per se, but it is "made-to"order" from GH--it is self-contained and has nothing to do with anything else. Ontology building is easy an occurs many places, depending on use.
from go-ontology.
Talking to @pgaudet this morning, we worked out the following points:
- a "rolling" ontology product would be useful right now
- a given use case was to give other curation groups access to a built ontology product
- this would not be for "general consumption", as we want to generally steer people towards products that can be referenced
- this can be revisited when
snapshots
(and releases) are consistent again - there would have to be good communication / documentation on this
from go-ontology.
Initial testing (just populating bucket go-data-product-ontology-build
) on 4pm PT run.
from go-ontology.
This is a thing now: https://ontology-build.geneontology.org
If go-ontology-dev
is successful, it will update. Currently, that means up to three times a day.
I'm now running through some tests and looking at maintenance.
As it stands now, if go-ontology-dev
fails for two weeks, it will go dark (as the cleaning routines take passes through).
Assuming this is about right, we can turn over the conversation to documentation and communication about expectations.
from go-ontology.
Excellent!! Thanks !
from go-ontology.
Moved this to a new Project.
from go-ontology.
@pgaudet I updated the project metadata to remove mention of Noctua (this has nothing to do with noctua) and "daily".
from go-ontology.
Related Issues (20)
- Rhea alignment of glucosidase II HOT 4
- Fix remaining cases of terms with duplicated MetaCyc xref HOT 3
- GO:000903, Rhea reaction has been updated
- Term definitions and scope of GO:0052695 cellular glucuronidation and GO:0019390 glucuronoside biosynthetic process HOT 9
- Fix remaining terms with >1 (apparent) EC xref
- Taxon constraint: GO:0009089 | lysine biosynthetic process via diaminopimelate
- Catalytic activity terms with no cross references - terms in scope for GO
- Add check for database definition cross references to also be a general cross reference HOT 2
- Catalytic activity (MF) terms with multiple MetaCyc mappings HOT 9
- Pathway (BP) terms with multiple MetaCyc mappings HOT 17
- carbonyl reductase (NADPH) activity (GO:0004090)
- alcohol dehydrogenase/aldo-keto reductase hierarchy HOT 8
- typo in term nameGO:0103045 | methione N-acyltransferase activity
- Obsoletion request: GO:0035629 N-terminal protein amino acid N-linked glycosylation; HOT 20
- Obsoletion request: GO:0071170 | site-specific DNA replication termination GO:0071946 | cis-acting DNA replication termination HOT 2
- Missing parent: polyphosphate import into vacuole HOT 1
- Obsoletion request: GO:0030242 autophagy of peroxisome HOT 3
- Obsoletion request: GO:0102028 cystathionine gamma-synthase activity (acts on O-phosphohomoserine) HOT 1
- True path violations under NON proteinogenic amino acid metabolic process HOT 6
- Obsoletion request: Wnt signaling pathway involved in x HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from go-ontology.