Skip to content

Support for Base Editor Data#395

Merged
bencap merged 37 commits intorelease-2025.2.0from
feature/bencap/317/base-editor-data
May 8, 2025
Merged

Support for Base Editor Data#395
bencap merged 37 commits intorelease-2025.2.0from
feature/bencap/317/base-editor-data

Conversation

@bencap
Copy link
Collaborator

@bencap bencap commented Mar 3, 2025

This change:

  • Refactors dataframe validation logic and associated tests to better represent the intended dependency separation of mavedb-api[server] and the core package (this is quite the involved change)
  • Adds base editor validation logic to df validation and the underlying db models for accession based score sets.

@bencap bencap requested review from jstone-dev and sallybg March 3, 2025 20:23
@bencap bencap linked an issue Mar 3, 2025 that may be closed by this pull request
@bencap bencap force-pushed the feature/bencap/317/base-editor-data branch 5 times, most recently from ecae79e to 5c1a51a Compare March 13, 2025 21:44
@bencap bencap changed the base branch from release-2025.1.1 to release-2025.2.0 March 19, 2025 17:12
@bencap bencap force-pushed the feature/bencap/317/base-editor-data branch 2 times, most recently from 31e1e47 to 05a70ca Compare March 19, 2025 21:59
@bencap bencap force-pushed the feature/bencap/317/base-editor-data branch 2 times, most recently from 2f2f4b4 to 767b0a5 Compare April 4, 2025 13:15
@bencap
Copy link
Collaborator Author

bencap commented Apr 28, 2025

Tests will pass after the release of VariantEffect/mavehgvs#40 and VariantEffect/mavehgvs#41.

bencap added 15 commits May 6, 2025 08:43
Refactors dataframe validation logic into 3 component files: column.py, dataframe.py, and variant.py.
This simplifies the validation structure and logically separates validation function based on the part of the df they operate on.
Refactors most of the test suite to better identify dependency separation problems. Validation tests
may now be run with only core (and dev) dependencies installed, and fixtures which operate on server dependencies are conditionally loaded based on the installed modules.
With this change, it will be much more straightforward to identify dependency 'leaks', or server dependencies which mistakenly are leaked into validation type code.
bencap and others added 14 commits May 6, 2025 09:21
This allows the use of the vs-code pytest extension but still prevents the use of external connections. Enabling this socket makes it easier to test within the code editor.
The hgvs package is not able to parse allelic variation (multi-variants denoted by brackets), which
are often a key variant string in base editor data. We work around this by:
- Parsing the multi-variant into MaveHGVS without any target info to ascertain whether it is syntactically valid
- Parsing each subvariant against the provided transcript to ascertain whether it is informationally valid
Adds tests for multi-variant validation for accession based variants. As part of this change,
an additional transcript was added to tests genomic based protein variants in addition to
just testing nucleotide based variants.
Prior to this, we weren't really using SeqRepo to do transcript resolution (unintentionally). Note that to use SeqRepo in this manner, a new environment variable `HGVS_SEQREPO_DIR` should be set.
This change will need to pull in the new MaveHGVS version.
@jstone-dev jstone-dev force-pushed the feature/bencap/317/base-editor-data branch from 767b0a5 to 6226c93 Compare May 6, 2025 16:52
@bencap bencap force-pushed the feature/bencap/317/base-editor-data branch from e47c591 to 9c29649 Compare May 8, 2025 20:47
SA complains about unitialized names for models when some are not imported. This is usually
done in the server_main file, but becomes an issue when testing on only core dependencies
since not all models are loaded and that file is no longer imported by the test suite. Do this
step up front in the top level conftest file instead to ensure we load all model objects into
the SA namespace for testing.
@bencap bencap merged commit dc6559a into release-2025.2.0 May 8, 2025
8 checks passed
@bencap bencap deleted the feature/bencap/317/base-editor-data branch May 8, 2025 23:09
@bencap bencap mentioned this pull request Jun 11, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Storing Base Editor Data in MaveDB

2 participants