Skip to content

lake: incremental updates 0607#23019

Merged
ti-chi-bot[bot] merged 5 commits into
pingcap:feature/preview-cloud-lakefrom
lilin90:update-0607
Jun 8, 2026
Merged

lake: incremental updates 0607#23019
ti-chi-bot[bot] merged 5 commits into
pingcap:feature/preview-cloud-lakefrom
lilin90:update-0607

Conversation

@lilin90

@lilin90 lilin90 commented Jun 8, 2026

Copy link
Copy Markdown
Member

What is changed, added or deleted? (Required)

Incremental docs updates for TiDB Cloud Lake till June 7, 2026

Which TiDB version(s) do your changes apply to? (Required)

Tips for choosing the affected version(s):

By default, CHOOSE MASTER ONLY so your changes will be applied to the next TiDB major or minor releases. If your PR involves a product feature behavior change or a compatibility change, CHOOSE THE AFFECTED RELEASE BRANCH(ES) AND MASTER.

For details, see tips for choosing the affected versions.

  • master (the latest development version)
  • v9.0 (TiDB 9.0 versions)
  • v8.5 (TiDB 8.5 versions)
  • v8.1 (TiDB 8.1 versions)
  • v7.5 (TiDB 7.5 versions)
  • v7.1 (TiDB 7.1 versions)
  • v6.5 (TiDB 6.5 versions)
  • v6.1 (TiDB 6.1 versions)

What is the related PR or file link(s)?

  • This PR is translated from:
  • Other reference link(s):

Do your changes match any of the following descriptions?

  • Delete files
  • Change aliases
  • Need modification after applied to another branch
  • Might cause conflicts after applied to another branch

@lilin90 lilin90 self-assigned this Jun 8, 2026
@lilin90 lilin90 added translation/no-need No need to translate this PR. area/tidb-cloud This PR relates to the area of TiDB Cloud. labels Jun 8, 2026
@ti-chi-bot ti-chi-bot Bot added the size/XXL Denotes a PR that changes 1000+ lines, ignoring generated files. label Jun 8, 2026
Add a (Beta) label to the page front matter title and the main heading in tidb-cloud-lake/guides/integrate-with-amazon-sqs-s3.md to indicate the integration task is in beta.

@gemini-code-assist gemini-code-assist Bot left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request updates the TiDB Cloud Lake documentation to mark the Amazon SQS (S3) integration task and IAM role data source as Beta features. It also documents the new NDJSON support for schema evolution, detailing the workflow, privilege requirements, sampling options, and inference rules. Additionally, the system settings reference table has been completely updated with more comprehensive settings and descriptions. The review feedback focuses on enforcing sentence case for headings across several files in accordance with the style guide, and correcting grammatical errors in the system settings descriptions.

Important

The consumer version of Gemini Code Assist on GitHub is being sunset. Starting June 18, 2026, new organization installations will be blocked, and all code review activity will officially cease on July 17, 2026.
For more details on the timeline and next steps, please review the Help Documentation.

Confirm that `QueueArn` points to the target SQS queue, `Events` includes `s3:ObjectCreated:*`, and `FilterRules` matches the `Object Key Prefix` / `Object Key Suffix` configured in the {{{ .lake }}} data source.

## Step 4: Create an IAM Role for Platform to Assume
## Step 4: Create an IAM Role for {{{ .lake }}} to Assume

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

low

According to the style guide, headings should use sentence case. Please update this heading to use sentence case.

Suggested change
## Step 4: Create an IAM Role for {{{ .lake }}} to Assume
## Step 4: Create an IAM role for {{{ .lake }}} to assume
References
  1. Use sentence case for headings (e.g., ## Configure the cluster). (link)


Query-based COPY is not affected. For example, `COPY INTO <table> FROM (SELECT ... FROM @stage)` keeps the existing privilege requirements.

## Parquet Example

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

low

According to the style guide, headings should use sentence case. Please update this heading to use sentence case.

Suggested change
## Parquet Example
## Parquet example
References
  1. Use sentence case for headings (e.g., ## Configure the cluster). (link)


Row 3 has `currency = NULL` because its source file did not contain that column.

## NDJSON Example

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

low

According to the style guide, headings should use sentence case. Please update this heading to use sentence case.

Suggested change
## NDJSON Example
## NDJSON example
References
  1. Use sentence case for headings (e.g., ## Configure the cluster). (link)


{{{ .lake }}} loads NDJSON files with `TYPE = ndjson`. NDJSON files do not have an embedded columnar schema like Parquet files, so {{{ .lake }}} samples file content, infers fields that are missing from the target table, and appends them as nullable columns.

### Step 1: Create a Table and Stage

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

low

According to the style guide, headings should use sentence case. Please update this heading to use sentence case.

Suggested change
### Step 1: Create a Table and Stage
### Step 1: Create a table and stage
References
  1. Use sentence case for headings (e.g., ## Configure the cluster). (link)

CREATE OR REPLACE STAGE events_stage;
```

### Step 2: Generate NDJSON Files with Different Fields

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

low

According to the style guide, headings should use sentence case. Please update this heading to use sentence case.

Suggested change
### Step 2: Generate NDJSON Files with Different Fields
### Step 2: Generate NDJSON files with different fields
References
  1. Use sentence case for headings (e.g., ## Configure the cluster). (link)

| COLUMN_MATCH_MODE | For Parquet: column name matching mode | `case-insensitive` |
| SCHEMA_EVOLUTION | For NDJSON: sampling options used to infer columns that are missing from the target table. Requires `ENABLE_SCHEMA_EVOLUTION = true` and the `ALTER` privilege on the target table. | `AUTO` sampling |

### SCHEMA_EVOLUTION Options

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

low

According to the style guide, headings should use sentence case. Please update this heading to use sentence case.

Suggested change
### SCHEMA_EVOLUTION Options
### SCHEMA_EVOLUTION options
References
  1. Use sentence case for headings (e.g., ## Configure the cluster). (link)

Comment thread tidb-cloud-lake/sql/system-settings.md Outdated
Comment thread tidb-cloud-lake/sql/system-settings.md Outdated
Comment thread tidb-cloud-lake/sql/system-settings.md Outdated
Comment thread tidb-cloud-lake/sql/system-settings.md Outdated
lilin90 and others added 3 commits June 8, 2026 18:24
Adjust the heading level for 'NDJSON Inference Rules' from ### to #### to improve document structure and nesting consistency. No content changes were made beyond the header level.
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
@lilin90 lilin90 added the lgtm label Jun 8, 2026
@lilin90

lilin90 commented Jun 8, 2026

Copy link
Copy Markdown
Member Author

/approve

@ti-chi-bot

ti-chi-bot Bot commented Jun 8, 2026

Copy link
Copy Markdown

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: lilin90

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Details Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@ti-chi-bot ti-chi-bot Bot added the approved label Jun 8, 2026
@ti-chi-bot ti-chi-bot Bot merged commit b7fa8dd into pingcap:feature/preview-cloud-lake Jun 8, 2026
11 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

approved area/tidb-cloud This PR relates to the area of TiDB Cloud. lgtm size/XXL Denotes a PR that changes 1000+ lines, ignoring generated files. translation/no-need No need to translate this PR.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant