fix: skip resources with empty IDs from conditional env vars in config processing #4455

Elbehery · 2026-01-06T21:23:53Z

What does this PR do?

This PR fixes config processing to skip registered resources when their ID fields resolve to empty/None from conditional environment variable syntax (e.g., ${env.VAR:+value}).

Changes

Added RESOURCE_ID_FIELDS constant listing all resource ID fields: model_id, shield_id, dataset_id, scoring_fn_id, benchmark_id, toolgroup_id
Modified replace_env_vars() to check if any resource ID field resolves to empty/None and skip the entire resource entry early during config processing
Added unit tests for the new functionality

Why is this needed?

When using conditional env var syntax like ${env.ENABLE_BENCHMARK:+my-benchmark}, if the env var is not set, the benchmark_id (or other resource ID) resolves to empty/None. Previously this would cause validation errors. Now these entries are gracefully skipped during config processing.

Fixes #4453

Elbehery · 2026-01-06T21:26:28Z

cc @leseb @saichandrapandraju

leseb · 2026-01-07T14:40:07Z

need to backport in 0.4.x branch

Elbehery · 2026-01-07T14:41:27Z

sure, shall we merge this first ?

leseb

the fix needs to be in replace_env_vars not after the processing, also we need to not only handle benchmarks but other registerable ressources. this diff worked for me

diff --git a/src/llama_stack/core/stack.py b/src/llama_stack/core/stack.py
index 3ea2e8996..d573fba64 100644
--- a/src/llama_stack/core/stack.py
+++ b/src/llama_stack/core/stack.py
@@ -110,6 +110,10 @@ REGISTRY_REFRESH_INTERVAL_SECONDS = 300
 REGISTRY_REFRESH_TASK = None
 TEST_RECORDING_CONTEXT = None

+# ID fields for registered resources that should trigger skipping
+# when they resolve to empty/None (from conditional env vars like :+)
+RESOURCE_ID_FIELDS = ["model_id", "shield_id", "dataset_id", "scoring_fn_id", "benchmark_id", "toolgroup_id"]
+

 def is_request_model(t: Any) -> bool:
     """Check if a type is a request model (Pydantic BaseModel).
@@ -346,15 +350,31 @@ def replace_env_vars(config: Any, path: str = "") -> Any:
                             logger.debug(
                                 f"Skipping config env variable expansion for disabled provider: {v.get('provider_id', '')}"
                             )
-                            # Create a copy with resolved provider_id but original config
-                            disabled_provider = v.copy()
-                            disabled_provider["provider_id"] = resolved_provider_id
                             continue
                     except EnvVarError:
                         # If we can't resolve the provider_id, continue with normal processing
                         pass

-                # Normal processing for non-disabled providers
+                # Special handling for registered resources: check if ID field resolves to empty/None
+                # from conditional env vars (e.g., ${env.VAR:+value}) and skip the entry if so
+                if isinstance(v, dict):
+                    should_skip = False
+                    for id_field in RESOURCE_ID_FIELDS:
+                        if id_field in v:
+                            try:
+                                resolved_id = replace_env_vars(v[id_field], f"{path}[{i}].{id_field}")
+                                if resolved_id is None or resolved_id == "":
+                                    logger.debug(
+                                        f"Skipping resource with empty {id_field} (conditional env var not set)"
+                                    )
+                                    should_skip = True
+                                    break
+                            except EnvVarError:
+                                pass
+                    if should_skip:
+                        continue
+
+                # Normal processing
                 result.append(replace_env_vars(v, f"{path}[{i}]"))
             except EnvVarError as e:
                 raise EnvVarError(e.var_name, e.path) from None

leseb

please revise your commit message and PR title/content

…g processing Added handling in replace_env_vars() to skip registered resources when their ID fields (model_id, shield_id, dataset_id, scoring_fn_id, benchmark_id, toolgroup_id) resolve to empty/None from conditional env var syntax like ${env.VAR:+value}. Signed-off-by: Mustafa Elbehery <melbeher@redhat.com>

Elbehery · 2026-01-08T09:41:32Z

updated

skamenan7 · 2026-01-08T13:00:36Z

src/llama_stack/core/stack.py

+                                    should_skip = True
+                                    break
+                            except EnvVarError:
+                                pass


Please log without swallowing the exception :)

skamenan7 · 2026-01-08T13:02:37Z

src/llama_stack/core/stack.py

+                            try:
+                                resolved_id = replace_env_vars(v[id_field], f"{path}[{i}].{id_field}")
+                                if resolved_id is None or resolved_id == "":
+                                    logger.debug(


this log will not be visible in default log level of info. consider info log level so users know about this skip.

Also, consider adding resource type or path information in the log to know if that id is model, shield or dataset.

skamenan7 · 2026-01-08T13:07:12Z

tests/unit/server/test_replace_env_vars.py

    data = {"port": "8080", "enabled": "true", "count": "123", "ratio": "3.14"}
    expected = {"port": "8080", "enabled": "true", "count": "123", "ratio": "3.14"}
    assert replace_env_vars(data) == expected
+


Is there a test when ID field uses ${env.VAR} syntax (without := or :+ operators) and the env var is not set. Feels like it is missing.

Elbehery requested review from ashwinb, bbrowning, cdoern, ehhuang, franciscojavierarceo, leseb, mattf and raghotham as code owners January 6, 2026 21:23

meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Jan 6, 2026

Elbehery force-pushed the 20260106_allow_empty_banchmarkId branch from 12753c6 to 4e19c02 Compare January 6, 2026 21:25

Elbehery mentioned this pull request Jan 6, 2026

feat: add Garak eval inline and remote providers opendatahub-io/llama-stack-distribution#184

Open

Elbehery force-pushed the 20260106_allow_empty_banchmarkId branch 2 times, most recently from da5bf74 to 8b144c9 Compare January 7, 2026 13:56

leseb requested changes Jan 7, 2026

View reviewed changes

Elbehery force-pushed the 20260106_allow_empty_banchmarkId branch 2 times, most recently from c5f2dfb to b29e3a1 Compare January 7, 2026 22:26

leseb requested changes Jan 8, 2026

View reviewed changes

Elbehery changed the title ~~fix: filter benchmarks with None benchmark_id before validation~~ fix: skip resources with empty IDs from conditional env vars in config processing Jan 8, 2026

Elbehery force-pushed the 20260106_allow_empty_banchmarkId branch from b29e3a1 to 031f4b9 Compare January 8, 2026 09:41

skamenan7 approved these changes Jan 8, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: skip resources with empty IDs from conditional env vars in config processing #4455

fix: skip resources with empty IDs from conditional env vars in config processing #4455

Elbehery commented Jan 6, 2026 •

edited

Loading

Uh oh!

Elbehery commented Jan 6, 2026

Uh oh!

leseb commented Jan 7, 2026

Uh oh!

Elbehery commented Jan 7, 2026

Uh oh!

leseb left a comment

Uh oh!

leseb left a comment

Uh oh!

Elbehery commented Jan 8, 2026

Uh oh!

skamenan7 Jan 8, 2026

Uh oh!

skamenan7 Jan 8, 2026

Uh oh!

skamenan7 Jan 8, 2026

Uh oh!

skamenan7 Jan 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

fix: skip resources with empty IDs from conditional env vars in config processing #4455

Are you sure you want to change the base?

fix: skip resources with empty IDs from conditional env vars in config processing #4455

Conversation

Elbehery commented Jan 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Changes

Why is this needed?

Uh oh!

Elbehery commented Jan 6, 2026

Uh oh!

leseb commented Jan 7, 2026

Uh oh!

Elbehery commented Jan 7, 2026

Uh oh!

leseb left a comment

Choose a reason for hiding this comment

Uh oh!

leseb left a comment

Choose a reason for hiding this comment

Uh oh!

Elbehery commented Jan 8, 2026

Uh oh!

skamenan7 Jan 8, 2026

Choose a reason for hiding this comment

Uh oh!

skamenan7 Jan 8, 2026

Choose a reason for hiding this comment

Uh oh!

skamenan7 Jan 8, 2026

Choose a reason for hiding this comment

Uh oh!

skamenan7 Jan 8, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Elbehery commented Jan 6, 2026 •

edited

Loading