Skip to content

br: reword 'Base64-encoded storage' for accurate machine translation#23047

Open
yahonda wants to merge 2 commits into
pingcap:release-8.5from
yahonda:fix-br-compact-log-backup
Open

br: reword 'Base64-encoded storage' for accurate machine translation#23047
yahonda wants to merge 2 commits into
pingcap:release-8.5from
yahonda:fix-br-compact-log-backup

Conversation

@yahonda

@yahonda yahonda commented Jun 12, 2026

Copy link
Copy Markdown
Member

What is changed, added or deleted? (Required)

This PR rewords ambiguous English phrases in br/br-compact-log-backup.md to produce more accurate machine translations to Japanese.

Problem:
The original English text contains the phrase "Base64-encoded storage", which machine translators (and human translators) interpret as "storage itself being Base64-encoded". This leads to unnatural Japanese translations like "Base64エンコードストレージ" (Base64-encoded storage), which is confusing because what is actually encoded is the storage URI/string, not the storage.

The root cause traces back to the Chinese source phrase "有了存储的 Base64" (the Base64 of the storage / the storage's Base64), which was translated to English as "Base64-encoded storage" — reversing the possessive relationship and changing the meaning.

Changes:

Before After Why
"matches the storage output" "should match the output" "storage output" is a confusing compound noun that MT renders literally as "storage出力"
"With the Base64-encoded storage, you can initiate the compaction..." "Once you have the Base64-encoded string from the previous step, you can start compaction..." Clarifies that the string (not the storage) was encoded; "initiate the compaction" is also overly formal
"the Base64-encoded storage string obtained earlier" "the Base64-encoded string obtained in the previous step" Same compound-noun ambiguity as above

These changes ensure machine translation produces natural Japanese: e.g., "Base64エンコードされた文字列" instead of the nonsensical "Base64エンコードストレージ".

Which TiDB version(s) do your changes apply to? (Required)

  • master (the latest development version)
  • v9.0 (TiDB 9.0 versions)
  • v8.5 (TiDB 8.5 versions)
  • v8.1 (TiDB 8.1 versions)
  • v7.5 (TiDB 7.5 versions)
  • v7.1 (TiDB 7.1 versions)
  • v6.5 (TiDB 6.5 versions)
  • v6.1 (TiDB 6.1 versions)

What is the related PR or file link(s)?

  • This PR is translated from:
  • Other reference link(s):

Do your changes match any of the following descriptions?

  • Delete files
  • Change aliases
  • Need modification after applied to another branch
  • Might cause conflicts after applied to another branch

@ti-chi-bot

ti-chi-bot Bot commented Jun 12, 2026

Copy link
Copy Markdown

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by:
Once this PR has been reviewed and has the lgtm label, please assign kissmydb for approval. For more information see the Code Review Process.
Please ensure that each of them provides their approval before proceeding.

The full list of commands accepted by this bot can be found here.

Details Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@ti-chi-bot ti-chi-bot Bot added missing-translation-status This PR does not have translation status info. size/XS Denotes a PR that changes 0-9 lines, ignoring generated files. labels Jun 12, 2026

@gemini-code-assist gemini-code-assist Bot left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request updates the documentation for log backup compaction in br-compact-log-backup.md and adds the "Row-based standard storage" metric to the built-in monitoring documentation in built-in-monitoring.md. The review feedback suggests style improvements to align with the documentation guidelines, specifically by removing unnecessary filler words to make instructions more direct and converting passive voice to active voice for better readability.

Important

The consumer version of Gemini Code Assist on GitHub is being sunset. Starting June 18, 2026, new organization installations will be blocked, and all code review activity will officially cease on July 17, 2026.
For more details on the timeline and next steps, please review the Help Documentation.

Comment thread br/br-compact-log-backup.md Outdated
Comment thread tidb-cloud/built-in-monitoring.md Outdated
Three changes to br/br-compact-log-backup.md to improve machine
translation quality to Japanese and avoid ambiguity:

1. 'matches the storage output' → 'should match the output'
   - 'storage output' is a confusing compound that MT renders literally
   - The actual output is the storage value/URI

2. 'With the Base64-encoded storage' → 'Once you have the Base64-encoded string from the previous step'
   - 'Base64-encoded storage' falsely implies the storage itself is encoded
   - What was encoded is the storage URI/string, not the storage
   - This mistranslation from Chinese '有了存储的 Base64' causes 'Base64エンコードストレージ' in Japanese

3. 'Base64-encoded storage string' → 'Base64-encoded string obtained in the previous step'
   - Same compound-noun ambiguity as above

These changes ensure MT produces natural Japanese: ストレージ文字列
vs ストレージのBase64文字列, and avoids the nonsensical 'Base64エンコードストレージ'.
@yahonda yahonda force-pushed the fix-br-compact-log-backup branch from 931cd30 to 15545a5 Compare June 12, 2026 02:56
@ti-chi-bot ti-chi-bot Bot added the needs-1-more-lgtm Indicates a PR needs 1 more LGTM. label Jun 12, 2026
@ti-chi-bot

ti-chi-bot Bot commented Jun 12, 2026

Copy link
Copy Markdown

[LGTM Timeline notifier]

Timeline:

  • 2026-06-12 05:59:44.731061407 +0000 UTC m=+1112485.801378798: ☑️ agreed by qiancai.

@qiancai qiancai added type/enhancement The issue or PR belongs to an enhancement. needs-cherry-pick-master Should cherry pick this PR to master branch. and removed needs-1-more-lgtm Indicates a PR needs 1 more LGTM. missing-translation-status This PR does not have translation status info. labels Jun 12, 2026
@qiancai qiancai self-assigned this Jun 12, 2026
github-actions Bot added a commit to qiancai/docs-cn that referenced this pull request Jun 12, 2026
Synced from: pingcap/docs#23047
Target PR: pingcap#21714
AI Provider: azure

Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

needs-cherry-pick-master Should cherry pick this PR to master branch. size/XS Denotes a PR that changes 0-9 lines, ignoring generated files. type/enhancement The issue or PR belongs to an enhancement.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants