[Bug] False positive in pre-publish validator: function-call assignment to token variable is treated as leaked secret

### Summary

The pre-publish validator is producing a false positive for normal Python code that assigns a function return value to a variable named `token`. 

### Steps To Reproduce

  ### Current behavior

  Publishing fails with an error like:

  ```text
  Pre-publish validation failed:
  scripts/f2e_mock.py line 1842 contains a value that looks like a secret or token. Replace real credentials with placeholders before publishing.
```

  A representative line that triggers the error is:

  token = extract_group_token_value(response, group_choice.group_id)

  This line does not contain a hardcoded credential. It only assigns the return value of a function call.

  ### Minimal example

  def maybe_generate_group_token(...):
      requested_token = secrets.token_hex(20)
      ...
      token = extract_group_token_value(response, group_choice.group_id)
      if token:
          return token
      ...

  ### Why this looks like a false positive

  The pre-publish validator currently uses a regex-based heuristic in:

  - server/skillhub-domain/src/main/java/com/iflytek/skillhub/domain/skill/validation/BasicPrePublishValidator.java

  Relevant rule:

  (?i)(api[_-]?key|access[_-]?key|secret|password|token)\s*[:=]\s*['\"]?([A-Za-z0-9_\-]{12,})

  This regex scans text line-by-line and does not parse Python syntax. As a result:

  - the left-hand side matches token
  - the right-hand side matches the identifier prefix extract_group_token_value
  - the validator treats that identifier as a “secret-like value”

  So function calls / identifiers can be mistaken for leaked secrets.

  ### Related code path

  The validator is invoked during publish here:

  - server/skillhub-domain/src/main/java/com/iflytek/skillhub/domain/skill/service/SkillPublishService.java

  ### Expected behavior

  The validator should block obvious hardcoded credentials, but should not reject:

  - variable assignments
  - function-call return values
  - ordinary identifiers that happen to contain words like token, secret, etc.

  Example that should be allowed:

  token = extract_group_token_value(response, group_choice.group_id)


### Expected Behavior

  ### Suggested regression test

  A test case similar to this should pass:

  token = extract_group_token_value(response, group_choice.group_id)

  while real hardcoded secrets such as:

  token = "ghp_xxxxxxxxxxxxxxxxxxxx"
  api_key = "sk-xxxxxxxxxxxxxxxxxxxx"

  should still fail.

### Environment

_No response_

### API Contract Impact

_No response_

### Logs Or Screenshots

_No response_

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bug] False positive in pre-publish validator: function-call assignment to token variable is treated as leaked secret #234

Summary

Steps To Reproduce

Current behavior

Minimal example

Why this looks like a false positive

Related code path

Expected behavior

Expected Behavior

Suggested regression test

Environment

API Contract Impact

Logs Or Screenshots

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Bug] False positive in pre-publish validator: function-call assignment to token variable is treated as leaked secret #234

Description

Summary

Steps To Reproduce

Current behavior

Minimal example

Why this looks like a false positive

Related code path

Expected behavior

Expected Behavior

Suggested regression test

Environment

API Contract Impact

Logs Or Screenshots

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions