Make class `DataGeneration` abstract #74

aclum · 2024-01-19T19:17:49Z

Make DataGeneration an abstract class, only subclasses should be used. Remove datgen as a valid typecode. Move omprc as a vaild typecode for subclasses.

This PR is needed so we can migrate slots off of other classes onto either NucleotideSequencing or MassSpectrometry

Make class DataGeneration abstract, updated allowable typecodes for subclasses.

…class

brynnz22 · 2024-01-22T17:58:49Z

This looks good to me. The only thing I noticed, I commented each I found above. In the example data files, sometimes the prefix dgms is used and sometimes dgns is used. Based on the changes to the syntax in the schema, I think we just want dgns correct? Otherwise, it looks good to me.

aclum · 2024-01-24T18:46:45Z

dgms is the typecode for Class MassSpectrometry, dgns is for NucleotideSequencing so we need both typecodes

turbomam

thanks for the good schema changes and accompanying changes to the valid and invalid example files.

I have some questions bout whether this class really should be abstract or not. But I might just be confused. We should be able to talk through it pretty quickly.

turbomam · 2024-01-24T18:59:09Z

src/data/invalid/ChromatographicSeparationProcess_wrong_associated_study.yaml

@@ -38,6 +38,6 @@ ordered_mobile_phases:
 chromatographic_category: gas_chromatography


these are great invalid file names

how is nmdc-schema supposed to know that the wrong study was associated?

I didn't make this file originally, what is meant by that is the value for associated_study is not the correct typecode/Class.

turbomam · 2024-01-24T19:01:07Z

src/schema/nmdc.yaml

@@ -152,7 +152,7 @@ classes:
    slot_usage:
      id:


I would like to move away from asserting id patterns in slot_usages because it implies that the range is a string, not a class

I will deal with this in the LinkML repo

turbomam · 2024-01-24T19:03:40Z

src/schema/nmdc.yaml

      has_input:
        required: true
        pattern: "^nmdc:(bsm|procsm)-[0-9][a-z]{0,6}[0-9]-[A-Za-z0-9]{1,}(\\.[A-Za-z0-9]{1,})*(_[A-Za-z0-9_\\.-]+)?$"
        comments:
          - pattern should allow typecode for Biosample and ProcessedSample
      part_of:
        range: DataGeneration
-        pattern: "^nmdc:datgen-[0-9][a-z]{0,6}[0-9]-[A-Za-z0-9]{1,}(\\.[A-Za-z0-9]{1,})*(_[A-Za-z0-9_\\.-]+)?$" # better applied as a structured_pattern, but even that might get out of sync from the asserted Study structured_pattern


is this the pattern for a DataGeneration's id? if DataGeneration is abstract, then it can't be instantiated. So an id pattern is irrelevant. In that case, we only have to worry about asserting the patterns for the subclasses.

I'm starting to think that we don't really want DataGeneration to be abstract

mslarae13 · 2024-03-27T20:41:23Z

Requires migration, but done in parallel with another PR

aclum and others added 2 commits January 19, 2024 08:47

Update nmdc.yaml

a9f5453

Make class DataGeneration abstract, updated allowable typecodes for subclasses.

Fixing invalid and valid data after making DataGeneration abstract

4513410

aclum changed the title ~~1617 data generation typecodes~~ 1617 Make class DataGeneration abstract Jan 19, 2024

aclum linked an issue Jan 19, 2024 that may be closed by this pull request

keep omprc typcode for DataGeneration subclasses MassSpectrometry and NucleotideSequencing and make DataGeneration abstract microbiomedata/nmdc-schema#1617

Closed

aclum added 2 commits January 19, 2024 11:52

Changing the test file names since DataGeneration is now an abstract …

ebaa754

…class

fix typocode typo

a4aad43

aclum requested review from turbomam and brynnz22 January 19, 2024 20:17

aclum self-assigned this Jan 19, 2024

turbomam reviewed Jan 24, 2024

View reviewed changes

turbomam self-requested a review January 24, 2024 19:30

turbomam approved these changes Jan 24, 2024

View reviewed changes

Improving invalid example data file name and description.

7792c2f

aclum merged commit dbe5b23 into main Jan 24, 2024
2 checks passed

aclum deleted the 1617-DataGeneration-typecodes branch January 24, 2024 23:16

brynnz22 changed the title ~~1617 Make class DataGeneration abstract~~ 1617 Make class DataGeneration abstract Mar 27, 2024

brynnz22 changed the title ~~1617 Make class DataGeneration abstract~~ Make class DataGeneration abstract Mar 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make class `DataGeneration` abstract #74

Make class `DataGeneration` abstract #74

aclum commented Jan 19, 2024

brynnz22 commented Jan 22, 2024

aclum commented Jan 24, 2024

turbomam left a comment

turbomam Jan 24, 2024

turbomam Jan 24, 2024

aclum Jan 24, 2024

turbomam Jan 24, 2024

turbomam Jan 24, 2024

turbomam Jan 24, 2024

mslarae13 commented Mar 27, 2024

		@@ -38,6 +38,6 @@ ordered_mobile_phases:
		chromatographic_category: gas_chromatography

Make class DataGeneration abstract #74

Make class DataGeneration abstract #74

Conversation

aclum commented Jan 19, 2024

brynnz22 commented Jan 22, 2024

aclum commented Jan 24, 2024

turbomam left a comment

Choose a reason for hiding this comment

turbomam Jan 24, 2024

Choose a reason for hiding this comment

turbomam Jan 24, 2024

Choose a reason for hiding this comment

aclum Jan 24, 2024

Choose a reason for hiding this comment

turbomam Jan 24, 2024

Choose a reason for hiding this comment

turbomam Jan 24, 2024

Choose a reason for hiding this comment

turbomam Jan 24, 2024

Choose a reason for hiding this comment

mslarae13 commented Mar 27, 2024

Make class `DataGeneration` abstract #74

Make class `DataGeneration` abstract #74