Multipoles analysis by p-slash · Pull Request #157 · andreicuceu/vega

p-slash · 2026-03-24T19:32:00Z

This PR is a version of the multipoles analysis code. I solved some heavy merge conflicts. I can reproduce my analysis with this code. However, baseline analysis should be checked. One notable shortcoming: Cannot marginalize small scales and do multipoles compression at the same time.

…ltipole matrix

…trix

…le compression

…eader

…ing. seems like a weird idea.

…rection setting

…and rt to keep compatibility

…tion

…g multipoles

…of the old code

codecov · 2026-03-24T19:34:16Z

Codecov Report

❌ Patch coverage is 28.34138% with 445 lines in your changes missing coverage. Please review.
✅ Project coverage is 35.59%. Comparing base (88b62bd) to head (c86d244).

Files with missing lines	Patch %	Lines
vega/metals.py	30.37%	187 Missing and 17 partials ⚠️
vega/data.py	19.31%	65 Missing and 6 partials ⚠️
vega/coordinates.py	31.64%	53 Missing and 1 partial ⚠️
vega/output.py	9.37%	29 Missing ⚠️
vega/vega_interface.py	30.00%	20 Missing and 1 partial ⚠️
vega/postprocess/fit_results.py	13.63%	19 Missing ⚠️
vega/utils.py	17.64%	14 Missing ⚠️
vega/power_spectrum.py	50.00%	4 Missing and 4 partials ⚠️
vega/scale_parameters.py	27.27%	7 Missing and 1 partial ⚠️
vega/analysis.py	0.00%	7 Missing ⚠️
... and 3 more

Additional details and impacted files

@@            Coverage Diff             @@
##           master     #157      +/-   ##
==========================================
- Coverage   36.61%   35.59%   -1.03%     
==========================================
  Files          30       30              
  Lines        4296     4557     +261     
  Branches      815      854      +39     
==========================================
+ Hits         1573     1622      +49     
- Misses       2555     2755     +200     
- Partials      168      180      +12

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

Copilot

Pull request overview

This PR integrates a “multipoles analysis” workflow into Vega, adding support for r–μ binning, multipole compression, and covariance corrections (Hartlap/Percival), plus updating outputs/readers to carry the extra metadata needed to reproduce the analysis.

Changes:

Add multipole (ℓ) support end-to-end: r–μ coordinates, Legendre bin averaging, P(k)→ξ transforms, and model/data/covariance conversions.
Add global-covariance Hartlap scaling and a Percival correction factor (propagated into FITS bestfit covariance/errors).
Update FITS output and postprocessing readers to include correlation sizes and multipole metadata.

Reviewed changes

Copilot reviewed 17 out of 17 changed files in this pull request and generated 8 comments.

Show a summary per file

File	Description
vega/vega_interface.py	Adds global Hartlap/Percival handling and global-cov multipole conversion.
vega/utils.py	Adds Legendre bin helpers and Percival correction function.
vega/templates/parameters.ini	Updates default BAO/AP parameter defaults (epsilon, aap).
vega/scale_parameters.py	Adds `aiso_aap` parametrisation and centralises allowed parametrisations.
vega/power_spectrum.py	Adds optional r–μ binning mode for binning kernel `G(k)`.
vega/postprocess/fit_results.py	Extends correlation output metadata and adjusts legacy FITS reading.
vega/pktoxi.py	Adds optional μ-smoothing for ξℓ Legendre evaluation.
vega/parameters/latex_names.txt	Adds latex name for `aap` and adjusts `aiso` latex.
vega/parameters/default_values.txt	Adjusts epsilon range and adds `aap` defaults.
vega/output.py	Writes multipole-aware model grids, stores extra header metadata, applies Percival to bestfit cov/errors.
vega/model.py	Threads r–μ binning/multipole logic into model evaluation and post-distortion transforms.
vega/minimizer.py	Stores a p-value on the minimizer for output.
vega/metals.py	Refactors metals computation and adds interpolation support for r–μ binning.
vega/data.py	Adds r–μ coordinate mode, multipole compression, and (optional) Hartlap scaling on covariance.
vega/correlation_item.py	Plumbs multipole configuration into correlation items.
vega/coordinates.py	Introduces `RMuCoordinates` and consolidates common masking logic in base `Coordinates`.
vega/analysis.py	Updates global-mock generation to use model/data masks in the global-cov pathway.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-03-24T19:38:53Z

vega/vega_interface.py

+                else:
+                    G[i, i] = eye_array(self.data[name].full_data_size)
+            G = block_array(G, format='csr')
+            self.global_cov = G.dot(G.dot(self.global_cov).T).T


Global covariance transformation for multipoles is currently G.dot(G.dot(C).T).T, which is a non-obvious way to compute G @ C @ G.T and can be slower / easier to get wrong. Prefer the explicit and standard form self.global_cov = G @ self.global_cov @ G.T (or G.dot(self.global_cov).dot(G.T)) for clarity and to avoid unnecessary transposes.

Suggested change

self.global_cov = G.dot(G.dot(self.global_cov).T).T

# Apply standard covariance transformation: C -> G C G^T

self.global_cov = G.dot(self.global_cov).dot(G.T)

Not sure if scipy sparse arrays support this. Needs testing

Copilot · 2026-03-24T19:38:54Z

vega/utils.py

+def percival_correction(nsamples, nbins, nparams):
+    """Percival 2014 correction of the estimated parameter covariance.
+    MNRAS, Volume 439, Issue 3, p.2531-2541
+    """
+    a = nsamples - nbins
+    denom = (a - 1) * (a - 4)
+    A, B = 2.0 / denom, (a - 2.0) / denom
+    return (1.0 + B * (nbins - nparams)) / (1.0 + A + B * (nparams - 1))


percival_correction() can divide by zero or return nonsensical values when nsamples - nbins is ≤ 4 because denom = (a-1)*(a-4). Since this is used alongside Hartlap (which only enforces nsamples > nbins + 2), consider validating nsamples > nbins + 4 here and raising a clear ValueError when the correction is undefined.

Copilot · 2026-03-24T19:38:54Z

vega/data.py

+            mult_matrix = np.zeros((n_out, nr * nmu))
+            for i, x in enumerate(np.nonzero(data_mask_ell)[0]):
+                mult_matrix[mell[i], self.model_mask] = M[x, self.data_mask]


In _convert_to_multipoles(), the distortion-matrix remapping assumes self.data_mask and self.model_mask select the same number of elements (it assigns M[x, self.data_mask] into mult_matrix[..., self.model_mask]). If these masks differ, this will raise a shape error or silently mis-map bins. Add an explicit check that the masked sizes match (and/or build an index mapping between model/data coordinates) before doing this assignment.

Suggested change

mult_matrix = np.zeros((n_out, nr * nmu))

for i, x in enumerate(np.nonzero(data_mask_ell)[0]):

mult_matrix[mell[i], self.model_mask] = M[x, self.data_mask]

# Build explicit index mappings between data/model coordinates and validate sizes

data_mask_indices = np.flatnonzero(self.data_mask)

model_mask_indices = np.flatnonzero(self.model_mask)

if data_mask_indices.size != model_mask_indices.size:

raise ValueError(

"In _convert_to_multipoles, data_mask and model_mask select different "

f"numbers of elements (data: {data_mask_indices.size}, "

f"model: {model_mask_indices.size}). "

"Cannot safely remap distortion matrix."

)

data_ell_indices = np.flatnonzero(data_mask_ell)

if data_ell_indices.size != mell.size:

raise ValueError(

"In _convert_to_multipoles, data and model ell masks select different "

f"numbers of elements (data_ell: {data_ell_indices.size}, "

f"model_ell: {mell.size}). "

"Cannot safely remap distortion matrix."

)

mult_matrix = np.zeros((n_out, nr * nmu))

for row_idx, x in zip(mell, data_ell_indices):

mult_matrix[row_idx, model_mask_indices] = M[x, data_mask_indices]

vega/output.py

vega/scale_parameters.py

Copilot · 2026-03-24T19:38:55Z

vega/postprocess/fit_results.py

+    ndata: int
+    is_multipoles: bool
+    nell: int


CorrelationOutput now has new required fields (ndata, is_multipoles, nell) but not all constructor call sites were updated. In this file, read_correlations() still instantiates CorrelationOutput(...) without these args, which will raise TypeError at runtime. Either give these new fields defaults in the dataclass, or pass them everywhere CorrelationOutput is constructed (including the non-legacy reader).

Suggested change

ndata: int

is_multipoles: bool

nell: int

ndata: int = 0

is_multipoles: bool = False

nell: int = 0

vega/output.py

Copilot · 2026-03-24T19:38:55Z

vega/analysis.py

        # TODO The corr items need to have an imposed order
-        masked_fiducial = []
-        for name, data in self._data.items():
-            mask = data.dist_model_coordinates.get_mask_to_other(data.data_coordinates)
-            if data.data_mask.size == fiducial_model[name].size:
-                masked_fiducial.append(fiducial_model[name])
-            elif mask.size == fiducial_model[name].size:
-                masked_fiducial.append(fiducial_model[name][mask])
-            else:
-                raise ValueError('Input fiducial has unknown size. '
-                                 'It must match the data or the model.')
-        masked_fiducial = np.concatenate(masked_fiducial)
+        full_fiducial_model = np.concatenate([fiducial_model[name] for name in self._data])
+        masked_fiducial = full_fiducial_model[full_model_mask]
+        # Naim comment:  unsure about the purpose of the below code
+        # masked_fiducial = []
+        # for name, data in self._data.items():
+        #     mask = data.dist_model_coordinates.get_mask_to_other(data.data_coordinates)
+        #     if data.data_mask.size == fiducial_model[name].size:
+        #         masked_fiducial.append(fiducial_model[name])
+        #     elif mask.size == fiducial_model[name].size:
+        #         masked_fiducial.append(fiducial_model[name][mask])
+        #     else:
+        #         raise ValueError('Input fiducial has unknown size. '
+        #                          'It must match the data or the model.')
+        # masked_fiducial = np.concatenate(masked_fiducial)

        if forecast:
-            self.current_mc_mock = masked_fiducial[full_data_mask]
+            self.current_mc_mock = masked_fiducial
+            # self.current_mc_mock = masked_fiducial[full_data_mask]
        else:
            ran_vec = np.random.randn(full_data_mask.sum())
-            self.current_mc_mock = masked_fiducial[full_data_mask] + self._cholesky_global_cov.dot(
+            # self.current_mc_mock = masked_fiducial[full_data_mask] + self._cholesky_global_cov.dot(
+            self.current_mc_mock = masked_fiducial + self._cholesky_global_cov.dot(
                ran_vec)


masked_fiducial is built using full_model_mask, but the global covariance Cholesky and noise vector are built using full_data_mask. If model_mask and data_mask differ (e.g., due to distortion/model coordinate differences), this leads to shape mismatches when adding the noise term. Use full_data_mask consistently for the masked mean, or explicitly map the fiducial model from model-space to data-space before applying the global covariance.

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

p-slash added 30 commits June 23, 2025 14:04

initial implementation of new coordinates class

9ec5bcd

comment out andrei recommended lines

629a721

add few more attributes to prevent crashing

3e6472f

legendre compression attempt

ccf0cf3

fix use multipoles and rmu bin check

4127e2f

convert model to multipoles by saving a matrix

6389c67

fixes to previous commit

f72f8fe

include data mask into multipole matrix. distortion matrix carries mu…

cf02316

…ltipole matrix

use csr_array instead of csr_matrix. multipole matrix is now a csr_ma…

eb67a84

…trix

change of attribute names for multipoles

5a2fd61

distortion model has its own coordinates. need to recalculate multipo…

b038377

…le compression

attempt at adding the bin smoothing effect

e070ecd

add mu bin smoothing

87a5934

turn off mu bin smoothing.

07b92f0

Option to apply Hartlap correction to cov. Requires NSAMPLES in the h…

d2c97da

…eader

smoothing legendre polynomials in the model to account for bin averag…

2d0c12c

…ing. seems like a weird idea.

fix: include the new option in init_from_Pk

3003686

try interpolating the metals matrix

e38f77d

fixed and finalized r, mu binning of metals

36a1c35

upsample option for metals model when in rmu binning scheme

ec92b87

metal coordinates only support RtRpCoordinates for now

ebe8376

fix regular mu grid when mu_min < 0

7959114

global covariance support for xcf and hartlap correction

54a5e56

calculate the percival correction for the global cov with Hartlap cor…

346ac6f

…rection setting

fix: do not fail for an empty global_cov_file string

158b78a

save ell and r for multipoles

0d6e464

add percial correction to the output files

a89a855

option to use "weighted" legendre multipoles.

43c9949

fix epsilon values and limits

b8531d5

use np nan for pad value

cc18151

p-slash added 13 commits July 2, 2025 14:56

sum number of pairs in mu axis and enable saving it. Do not save ell …

ad6d922

…and rt to keep compatibility

modify Minimizer and Output class to also save p_value

e75a6d8

additional keys in MODEL header to note datasize

813274a

fix padding in output: Nan cannot be represented in integer arrays

7477754

introduce aiso aAP parameterization used by DESI galaxy clustering

3ae8c14

Turn off mu smoothing as I think it is included in Legendre decomposi…

5ab7301

…tion

additional variables in FitResults's CorrelationOutput to for plottin…

9e150bc

…g multipoles

rmu_metal_grid_factor is 1 by default

fc9d8e3

matrix to average data points for reference

874d32f

analysis changes regarding masked_fiducial. unsure about the purpose …

0a325ce

…of the old code

fix merge conflict (minor)

e65a2f5

Merge branch 'master' into r-mu-binning

c3a954b

fix attempt at output.py

6a30431

p-slash requested a review from Copilot March 24, 2026 19:32

Copilot started reviewing on behalf of p-slash March 24, 2026 19:32 View session

Copilot AI reviewed Mar 24, 2026

View reviewed changes

p-slash and others added 3 commits March 24, 2026 17:59

Update vega/scale_parameters.py. Typo in the docstring.

057b6c2

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Update vega/output.py

ad18f49

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Update vega/output.py

c86d244

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Multipoles analysis#157

Multipoles analysis#157
p-slash wants to merge 46 commits intomasterfrom
r-mu-binning-merge-master

p-slash commented Mar 24, 2026

Uh oh!

codecov bot commented Mar 24, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Mar 24, 2026

Uh oh!

p-slash Mar 24, 2026

Uh oh!

Copilot AI Mar 24, 2026

Uh oh!

Copilot AI Mar 24, 2026

Uh oh!

Uh oh!

Uh oh!

Copilot AI Mar 24, 2026

Uh oh!

Uh oh!

Copilot AI Mar 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	self.global_cov = G.dot(G.dot(self.global_cov).T).T
	# Apply standard covariance transformation: C -> G C G^T
	self.global_cov = G.dot(self.global_cov).dot(G.T)

-            mult_matrix = np.zeros((n_out, nr * nmu))
-            for i, x in enumerate(np.nonzero(data_mask_ell)[0]):
-                mult_matrix[mell[i], self.model_mask] = M[x, self.data_mask]
+            # Build explicit index mappings between data/model coordinates and validate sizes
+            data_mask_indices = np.flatnonzero(self.data_mask)
+            model_mask_indices = np.flatnonzero(self.model_mask)
+            if data_mask_indices.size != model_mask_indices.size:
+                raise ValueError(
+                    "In _convert_to_multipoles, data_mask and model_mask select different "
+                    f"numbers of elements (data: {data_mask_indices.size}, "
+                    f"model: {model_mask_indices.size}). "
+                    "Cannot safely remap distortion matrix."
+                )
+            data_ell_indices = np.flatnonzero(data_mask_ell)
+            if data_ell_indices.size != mell.size:
+                raise ValueError(
+                    "In _convert_to_multipoles, data and model ell masks select different "
+                    f"numbers of elements (data_ell: {data_ell_indices.size}, "
+                    f"model_ell: {mell.size}). "
+                    "Cannot safely remap distortion matrix."
+                )
+            mult_matrix = np.zeros((n_out, nr * nmu))
+            for row_idx, x in zip(mell, data_ell_indices):
+                mult_matrix[row_idx, model_mask_indices] = M[x, data_mask_indices]

Conversation

p-slash commented Mar 24, 2026

Uh oh!

codecov bot commented Mar 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Mar 24, 2026

Choose a reason for hiding this comment

Uh oh!

p-slash Mar 24, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 24, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 24, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Copilot AI Mar 24, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Copilot AI Mar 24, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

codecov bot commented Mar 24, 2026 •

edited

Loading