Fixes based on observations during v2605 anaTuple production by kandrosov · Pull Request #243 · cms-flaf/FLAF

kandrosov · 2026-05-09T08:38:17Z

Re-implemented anaTuple file list creation logic
Implemented the run token server to limit load on AFS
Detect empty nanoAOD files and skip them
Added LO DY for 10 < m_ll < 50 GeV

kandrosov · 2026-05-09T08:50:25Z

@cms-flaf-bot please test

HH_bbWW_version=PR_75
HH_bbtautau_version=PR_58

cms-flaf-bot · 2026-05-09T08:50:38Z

pipeline#14733800 started

Copilot

Pull request overview

This PR updates FLAF’s anaTuple production workflow based on v2605 observations by adding a run token server to rate-limit job starts (reducing AFS load), improving anaTuple file list / merge-plan creation (including run-aware merging for data), skipping empty nanoAOD inputs, and extending DY dataset coverage with LO DY samples in the 10–50 GeV mass window.

Changes:

Added a TCP token server + client and integrated it into HTCondor job bootstrap to throttle job starts per analysis path.
Reworked anaTuple merge planning to be run-aware for data (emit per-item runs and apply run filtering during merge) and enhanced file list creation (skip unavailable/empty inputs).
Added LO DY (10 < m_ll < 50 GeV) dataset entries across Run3 eras and updated dataset exception patterns.

Reviewed changes

Copilot reviewed 17 out of 17 changed files in this pull request and generated 6 comments.

Show a summary per file

File	Description
`test/test_run_token_server.py`	Adds a small client script to smoke-test the token server protocol.
`test/hello_world_task.py`	Adds a minimal LAW task for basic workflow testing.
`run_tools/run_token_server.py`	Implements the TCP token server that rate-limits tokens per path.
`run_tools/law_customizations.py`	Wires token-server settings into HTCondor job rendering and ships the token client script with jobs.
`run_tools/get_run_token.py`	Implements the token client used by `bootstrap.sh` before sourcing the environment.
`bootstrap.sh`	Requests a run token before sourcing `env.sh` to reduce AFS stampede effects.
`AnaProd/tasks.py`	Skips empty nanoAOD files; updates merge-plan creation call; propagates `runs` into merge step.
`AnaProd/MergeAnaTuples.py`	Applies a per-merge-item run filter for data before duplicate-event filtering.
`AnaProd/AnaTupleFileList.py`	Re-implements file clustering and merge schema planning, including run-aware contamination avoidance for data.
`config/Run3_2024/datasets.yaml`	Adds LO DY 10–50 samples (per-flavor) for Run3_2024.
`config/Run3_2023BPix/datasets.yaml`	Adds LO DY 10–50 sample for Run3_2023BPix.
`config/Run3_2023/datasets.yaml`	Adds LO DY 10–50 sample for Run3_2023.
`config/Run3_2022EE/datasets.yaml`	Adds LO DY 10–50 sample for Run3_2022EE.
`config/Run3_2022/datasets.yaml`	Adds LO DY 10–50 sample for Run3_2022.
`config/dataset_exceptions.yaml`	Extends dataset consistency exceptions for the new DY LO samples.
`config/law.cfg`	Registers the `test.hello_world_task` module with LAW.

cms-flaf-bot · 2026-05-09T09:22:59Z

pipeline#14733800 failed

kandrosov · 2026-05-09T14:08:41Z

@cms-flaf-bot please test

HH_bbWW_version=PR_75
HH_bbtautau_version=PR_58

cms-flaf-bot · 2026-05-09T14:09:01Z

pipeline#14734901 started

cms-flaf-bot · 2026-05-09T14:47:52Z

pipeline#14734901 failed

cms-flaf-bot · 2026-05-09T15:46:54Z

pipeline#14734901 passed

kandrosov added 9 commits May 7, 2026 15:55

Detect empty nanoAOD files and skip them

d874535

CreateMergeSchema: limit number of steps to 10k

52ab83f

Implemented run token server.

97d568c

Fixed run token logic

d86465b

Added DYto2L_MLL_10to50_madgraphMLM dataset

9c6fd82

Re-implemented anaTuple file list creation logic

ae55dad

formatting

723da93

added exceptions for LO low mass DY

74aeed7

added 2025

25dc769

kandrosov requested a review from Copilot May 9, 2026 08:51

Copilot started reviewing on behalf of kandrosov May 9, 2026 08:52 View session

Copilot AI reviewed May 9, 2026

View reviewed changes

Comment thread run_tools/run_token_server.py

Comment thread config/Run3_2024/datasets.yaml

Comment thread config/dataset_exceptions.yaml

Comment thread run_tools/get_run_token.py Outdated

Comment thread test/test_run_token_server.py Outdated

Comment thread run_tools/run_token_server.py

bugfixes

df162da

kandrosov merged commit 4f2aed1 into cms-flaf:main May 10, 2026
5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fixes based on observations during v2605 anaTuple production #243

Fixes based on observations during v2605 anaTuple production #243
kandrosov merged 10 commits into
cms-flaf:mainfrom
kandrosov:v2605_fixes

kandrosov commented May 9, 2026 •

edited

Loading

Uh oh!

kandrosov commented May 9, 2026

Uh oh!

cms-flaf-bot commented May 9, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

cms-flaf-bot commented May 9, 2026

Uh oh!

kandrosov commented May 9, 2026

Uh oh!

cms-flaf-bot commented May 9, 2026

Uh oh!

cms-flaf-bot commented May 9, 2026

Uh oh!

cms-flaf-bot commented May 9, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

kandrosov commented May 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kandrosov commented May 9, 2026

Uh oh!

cms-flaf-bot commented May 9, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

cms-flaf-bot commented May 9, 2026

Uh oh!

kandrosov commented May 9, 2026

Uh oh!

cms-flaf-bot commented May 9, 2026

Uh oh!

cms-flaf-bot commented May 9, 2026

Uh oh!

cms-flaf-bot commented May 9, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

kandrosov commented May 9, 2026 •

edited

Loading