Skip to content

Fixes based on observations during v2605 anaTuple production #243

Merged
kandrosov merged 10 commits into
cms-flaf:mainfrom
kandrosov:v2605_fixes
May 10, 2026
Merged

Fixes based on observations during v2605 anaTuple production #243
kandrosov merged 10 commits into
cms-flaf:mainfrom
kandrosov:v2605_fixes

Conversation

@kandrosov
Copy link
Copy Markdown
Contributor

@kandrosov kandrosov commented May 9, 2026

  • Re-implemented anaTuple file list creation logic
  • Implemented the run token server to limit load on AFS
  • Detect empty nanoAOD files and skip them
  • Added LO DY for 10 < m_ll < 50 GeV

@kandrosov
Copy link
Copy Markdown
Contributor Author

@cms-flaf-bot please test

  • HH_bbWW_version=PR_75
  • HH_bbtautau_version=PR_58

@cms-flaf-bot
Copy link
Copy Markdown
Collaborator

pipeline#14733800 started

Copy link
Copy Markdown
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

This PR updates FLAF’s anaTuple production workflow based on v2605 observations by adding a run token server to rate-limit job starts (reducing AFS load), improving anaTuple file list / merge-plan creation (including run-aware merging for data), skipping empty nanoAOD inputs, and extending DY dataset coverage with LO DY samples in the 10–50 GeV mass window.

Changes:

  • Added a TCP token server + client and integrated it into HTCondor job bootstrap to throttle job starts per analysis path.
  • Reworked anaTuple merge planning to be run-aware for data (emit per-item runs and apply run filtering during merge) and enhanced file list creation (skip unavailable/empty inputs).
  • Added LO DY (10 < m_ll < 50 GeV) dataset entries across Run3 eras and updated dataset exception patterns.

Reviewed changes

Copilot reviewed 17 out of 17 changed files in this pull request and generated 6 comments.

Show a summary per file
File Description
test/test_run_token_server.py Adds a small client script to smoke-test the token server protocol.
test/hello_world_task.py Adds a minimal LAW task for basic workflow testing.
run_tools/run_token_server.py Implements the TCP token server that rate-limits tokens per path.
run_tools/law_customizations.py Wires token-server settings into HTCondor job rendering and ships the token client script with jobs.
run_tools/get_run_token.py Implements the token client used by bootstrap.sh before sourcing the environment.
bootstrap.sh Requests a run token before sourcing env.sh to reduce AFS stampede effects.
AnaProd/tasks.py Skips empty nanoAOD files; updates merge-plan creation call; propagates runs into merge step.
AnaProd/MergeAnaTuples.py Applies a per-merge-item run filter for data before duplicate-event filtering.
AnaProd/AnaTupleFileList.py Re-implements file clustering and merge schema planning, including run-aware contamination avoidance for data.
config/Run3_2024/datasets.yaml Adds LO DY 10–50 samples (per-flavor) for Run3_2024.
config/Run3_2023BPix/datasets.yaml Adds LO DY 10–50 sample for Run3_2023BPix.
config/Run3_2023/datasets.yaml Adds LO DY 10–50 sample for Run3_2023.
config/Run3_2022EE/datasets.yaml Adds LO DY 10–50 sample for Run3_2022EE.
config/Run3_2022/datasets.yaml Adds LO DY 10–50 sample for Run3_2022.
config/dataset_exceptions.yaml Extends dataset consistency exceptions for the new DY LO samples.
config/law.cfg Registers the test.hello_world_task module with LAW.

Comment thread run_tools/run_token_server.py
Comment thread config/Run3_2024/datasets.yaml
Comment thread config/dataset_exceptions.yaml
Comment thread run_tools/get_run_token.py Outdated
Comment thread test/test_run_token_server.py Outdated
Comment thread run_tools/run_token_server.py
@cms-flaf-bot
Copy link
Copy Markdown
Collaborator

pipeline#14733800 failed

@kandrosov
Copy link
Copy Markdown
Contributor Author

@cms-flaf-bot please test

  • HH_bbWW_version=PR_75
  • HH_bbtautau_version=PR_58

@cms-flaf-bot
Copy link
Copy Markdown
Collaborator

pipeline#14734901 started

@cms-flaf-bot
Copy link
Copy Markdown
Collaborator

pipeline#14734901 failed

@cms-flaf-bot
Copy link
Copy Markdown
Collaborator

pipeline#14734901 passed

@kandrosov kandrosov merged commit 4f2aed1 into cms-flaf:main May 10, 2026
5 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants