GithubHelp home page GithubHelp logo

MongoDB data repair and schema loosening. DON'T FORGET TO RESOTRE THE `id`'s `strucutred_patterns` like 'syntax: "{id_nmdc_prefix}:'... about nmdc-schema HOT 10 OPEN

turbomam avatar turbomam commented on July 20, 2024
MongoDB data repair and schema loosening. DON'T FORGET TO RESOTRE THE `id`'s `strucutred_patterns` like 'syntax: "{id_nmdc_prefix}:'...

from nmdc-schema.

Comments (10)

turbomam avatar turbomam commented on July 20, 2024

Added the following targets to project.Makefile

  • local/mongodb-collection-report.txt
  • local/selected_mongodb_contents.json
  • dump-validate-report-convert-mongodb
  • mongodb-cleanup
  • local/selected_mongodb_contents_jsonschema_check.txt
  • linkml-validate-mongodb
  • local/selected_mongodb_contents.yaml
  • local/selected_mongodb_contents.ttl

from nmdc-schema.

turbomam avatar turbomam commented on July 20, 2024

Looser alternative identifier pattern in src/schema/basic_slots.yaml

--- a/src/schema/basic_slots.yaml
+++ b/src/schema/basic_slots.yaml
@@ -98,7 +98,7 @@ slots:
multivalued: true
description: >-
A list of alternative identifiers for the entity.

old: pattern: '^[a-zA-Z_][a-zA-Z0-9_]:[a-zA-Z_][a-zA-Z0-9_]$'
new: pattern: '^[a-zA-Z0-9][a-zA-Z0-9_.]+:[a-zA-Z0-9_][a-zA-Z0-9_-/.]*$'

from nmdc-schema.

turbomam avatar turbomam commented on July 20, 2024

Removed id's structured_patterns on several classes in src/schema/core.yaml. For lenience with legacy ids in MongoDB. Should be reintroduced for production/main/release.

Added a pattern for id in PlannedProcess

slot_usage:
   id:
     pattern: '^[a-zA-Z0-9][a-zA-Z0-9_\.]+:[a-zA-Z0-9_][a-zA-Z0-9_\-\/\.]*$'

from nmdc-schema.

turbomam avatar turbomam commented on July 20, 2024

Loosened patterns in src/schema/external_identifiers.yaml

old: pattern: '^[a-zA-Z0-9][a-zA-Z0-9_.]+:[a-zA-Z0-9_][a-zA-Z0-9_/.]*$'

new: pattern: '^[a-zA-Z0-9][a-zA-Z0-9_.]+:[a-zA-Z0-9_][a-zA-Z0-9_-/.]*$'

renamed external identifier type to external_identifier

from nmdc-schema.

turbomam avatar turbomam commented on July 20, 2024

Removed id's structured_patterns on several classes in src/schema/nmdc.yaml. For lenience with legacy ids in MongoDB. Should be reintroduced for production/main/release.

Added dubious emsl prefix with http://example.org/emsl_in_mongodb/ expansion because it appears in MongoDB records.

  • which records in which collections?

from nmdc-schema.

turbomam avatar turbomam commented on July 20, 2024

Removed id's structured_patterns on several classes in src/schema/prov.yaml. For lenience with legacy ids in MongoDB. Should be reintroduced for production/main/release.

changed range of started_at_time and ended_at_time from datetime to (implicit?) string.

from nmdc-schema.

turbomam avatar turbomam commented on July 20, 2024

Removed id's structured_patterns on several classes in src/schema/sample_prep.yaml. For lenience with legacy ids in MongoDB. Should be reintroduced for production/main/release.

from nmdc-schema.

turbomam avatar turbomam commented on July 20, 2024

Removed id's structured_patterns on several classes in src/schema/workflow_execution_activity.yaml. For lenience with legacy ids in MongoDB. Should be reintroduced for production/main/release.

made was_informed_by optional in class ???

  • revert that?

gave used in classed MetabolomicsAnalysisActivity, MetaproteomicsAnalysisActivity and NomAnalysisActivity an implicit string range, instead of Instrument. This current usage ofused is more in the spirit of instrument_name, for which a new enum has been created but not applied.

from nmdc-schema.

turbomam avatar turbomam commented on July 20, 2024

to find the structured_patterns of ids that need to be restores, search for

syntax: "{id_nmdc_prefix}:

from nmdc-schema.

aclum avatar aclum commented on July 20, 2024

Is this complete?

from nmdc-schema.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.