docs: CLAUDE.md proposals for 5 repos (EMR migration, identity-graph, geo-pipeline, agent-v3, batch-delivery)#7
Open
shrivastavakapil2000 wants to merge 2 commits into
Open
Conversation
Proposals generated from review of merged PRs by SayaliPat, shrivastavakapil2000, JoeVsVolcano, mike-brant, and nathan-resonate. Each file documents what to add or create as CLAUDE.md in the corresponding repo: - step-function-workflow-orchestrator: decommissioned pipelines (fusion-behavior-preprocess, cookiejar-sample-export), EMR 7.12/Spark 3 migration (7 pipelines), geo-location changes - batch-expression-modeling: BlockGraph delivery config, formatter path layout change, formatter metrics lambda, batch stitch throttling (CDP-118913/118857/118972) - identity-graph: new CLAUDE.md for PRISM identity system (11 Spark jobs + prism_dbt v1.0) - kshrivastava: new CLAUDE.md for V3 multi-agent orchestration framework (Temporal + roles + skills) - dos-data-pipeline: new CLAUDE.md for geo-location ETL (district_source, 4-CSV mapping, backfill rewrite) https://claude.ai/code/session_014wxhfqnUnnc6kcDYEgWDx3
There was a problem hiding this comment.
Pull request overview
This PR adds draft CLAUDE.md content under claude-md-proposals/ for manual copy/application into five separate repositories, based on recent merged PR activity and operational learnings (EMR/Spark migrations, identity-graph, geo-location pipeline, multi-agent framework, and batch delivery).
Changes:
- Adds proposed
CLAUDE.mdappend-only updates forstep-function-workflow-orchestratorandbatch-expression-modeling. - Adds proposed new
CLAUDE.mdfiles foridentity-graph,kshrivastava, anddos-data-pipeline. - Captures repo-specific operational guidance (Spark/EMR versions, deployment rules, config keys, and recent bug-fix notes).
Reviewed changes
Copilot reviewed 5 out of 5 changed files in this pull request and generated 2 comments.
Show a summary per file
| File | Description |
|---|---|
| claude-md-proposals/step-function-workflow-orchestrator.CLAUDE.md | Proposed CLAUDE.md append sections covering decommissioned pipelines, EMR 7/Spark 3 migration notes, and geo-location recent changes. |
| claude-md-proposals/batch-expression-modeling.CLAUDE.md | Proposed CLAUDE.md append section documenting vendor-config additions and batch delivery operational changes. |
| claude-md-proposals/identity-graph.CLAUDE.md | Proposed new CLAUDE.md describing PRISM architecture, repo layout, and Snowflake/dbt deployment rules. |
| claude-md-proposals/kshrivastava.CLAUDE.md | Proposed new CLAUDE.md describing the Temporal-based V3 multi-agent orchestration framework and conventions. |
| claude-md-proposals/dos-data-pipeline.CLAUDE.md | Proposed new CLAUDE.md describing DOS geo-location pipeline architecture and Spark 2 constraints. |
💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.
…trator proposal - Correct header: "two sections" → "three sections" (Decommissioned Pipelines, EMR 7.12.0/Spark 3 Migration, and Geo-Location Pipeline Recent Changes) - Fix arg name: myZipDistrictMappingsBasePath → zipDistrictMappingsBasePath (consistent with dos-data-pipeline.CLAUDE.md and PR description) https://claude.ai/code/session_014wxhfqnUnnc6kcDYEgWDx3
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
CLAUDE.md maintenance pass triggered by recent merged PR activity from:
SayaliPat,shrivastavakapil2000,JoeVsVolcano,mike-brant,nathan-resonate.This PR adds proposed CLAUDE.md content for 5 repos under
claude-md-proposals/. Because this session only has write access toresonate/.github, the proposals are stored here for manual application to each repo.Proposals
1.
step-function-workflow-orchestrator→ append to existing CLAUDE.mdFile:
claude-md-proposals/step-function-workflow-orchestrator.CLAUDE.mdNew sections to append:
fusion-behavior-preprocess(PR #734) andcookiejar-sample-export(PR #748) — removed from Terraform, AWS resources still need cleanup2.
batch-expression-modeling→ append to existing CLAUDE.mdFile:
claude-md-proposals/batch-expression-modeling.CLAUDE.mdNew section to append:
stitch_columns(plural),audience_bitmap_path = person_jar,stitch_table_name = person_identity_graph_betadate=*/vendor=*/method=*/akey=*format.count,format.aggregate) viabatch-delivery-formatter-publish-metricsMaxConcurrency=2, stagger byMap.Item.Index3.
identity-graph→ create new CLAUDE.mdFile:
claude-md-proposals/identity-graph.CLAUDE.mdComplete new CLAUDE.md covering:
prism_dbtv1.0 — 4 primitives:waterfall_match,identifier_expand,persons_project,lookupperson_id ≠ RID, ZIP11 routing4.
kshrivastava→ create new CLAUDE.mdFile:
claude-md-proposals/kshrivastava.CLAUDE.mdComplete new CLAUDE.md covering:
consultations_needed→ specialist →SpecialistOutputthreaded backavailable_specialistscatalog injected into every rolediscover_skills(),SKILL.mdrouting key)5.
dos-data-pipeline→ create new CLAUDE.mdFile:
claude-md-proposals/dos-data-pipeline.CLAUDE.mdComplete new CLAUDE.md covering:
district_sourceprovenance column:L2_CONFIRMED/L2_UNCONFIRMED/IP_INFERRED/nullL2_CONFIRMED(CDP-118947 fix)zipDistrictMappingsBasePathGeoLocationFullBackfillrewrite as daily-on-multi-day-pixels (CDP-118946)How to Apply
For each proposal file, copy the content to the corresponding repo. For
step-function-workflow-orchestratorandbatch-expression-modeling, append the content to the existing CLAUDE.md. Foridentity-graph,kshrivastava, anddos-data-pipeline, create a new CLAUDE.md at the repo root (strip the header comment line# This is a NEW file...).https://claude.ai/code/session_014wxhfqnUnnc6kcDYEgWDx3
Generated by Claude Code