Skip to content

Optimize Clinvar Control Refresh Job #649

@bencap

Description

@bencap

With the introduction of #627, the ClinVar control refresh job will be invoked at a per score-set level with a specific month and year version to refresh. We should make an enhancement to this job routine that allows the job to be invoked with a list of versions to refresh. Since variants in ClinGen will only have one associated ClinVar control, we should only need to make one request to the service to fetch it. The current workflow though will make a request for each version.

An updated workflow might look like:

  • Fetch each variant-summary TSV from ClinVar
  • Make a request to ClinGen to fetch the associated ClinVar accession for a variant
  • Associate this accession with our mapped variant based on each TSV

This would significantly reduce the job time by eliminating N-1 ClinGen requests.

Metadata

Metadata

Assignees

No one assigned

    Labels

    app: workerTask implementation touches the workertype: enhancementEnhancement to an existing feature

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions