lake: incremental updates 0607 by lilin90 · Pull Request #23019 · pingcap/docs

lilin90 · 2026-06-08T10:16:28Z

What is changed, added or deleted? (Required)

Incremental docs updates for TiDB Cloud Lake till June 7, 2026

Which TiDB version(s) do your changes apply to? (Required)

Tips for choosing the affected version(s):

By default, CHOOSE MASTER ONLY so your changes will be applied to the next TiDB major or minor releases. If your PR involves a product feature behavior change or a compatibility change, CHOOSE THE AFFECTED RELEASE BRANCH(ES) AND MASTER.

For details, see tips for choosing the affected versions.

What is the related PR or file link(s)?

This PR is translated from:
Other reference link(s):

Do your changes match any of the following descriptions?

Delete files
Change aliases
Need modification after applied to another branch
Might cause conflicts after applied to another branch

Add a (Beta) label to the page front matter title and the main heading in tidb-cloud-lake/guides/integrate-with-amazon-sqs-s3.md to indicate the integration task is in beta.

gemini-code-assist

Code Review

This pull request updates the TiDB Cloud Lake documentation to mark the Amazon SQS (S3) integration task and IAM role data source as Beta features. It also documents the new NDJSON support for schema evolution, detailing the workflow, privilege requirements, sampling options, and inference rules. Additionally, the system settings reference table has been completely updated with more comprehensive settings and descriptions. The review feedback focuses on enforcing sentence case for headings across several files in accordance with the style guide, and correcting grammatical errors in the system settings descriptions.

Important

The consumer version of Gemini Code Assist on GitHub is being sunset. Starting June 18, 2026, new organization installations will be blocked, and all code review activity will officially cease on July 17, 2026.
For more details on the timeline and next steps, please review the Help Documentation.

gemini-code-assist · 2026-06-08T10:19:06Z

 Confirm that `QueueArn` points to the target SQS queue, `Events` includes `s3:ObjectCreated:*`, and `FilterRules` matches the `Object Key Prefix` / `Object Key Suffix` configured in the {{{ .lake }}} data source.

-## Step 4: Create an IAM Role for Platform to Assume
+## Step 4: Create an IAM Role for {{{ .lake }}} to Assume


According to the style guide, headings should use sentence case. Please update this heading to use sentence case.

Suggested change

## Step 4: Create an IAM Role for {{{ .lake }}} to Assume

## Step 4: Create an IAM role for {{{ .lake }}} to assume

References

Use sentence case for headings (e.g., ## Configure the cluster). ^(link)

gemini-code-assist · 2026-06-08T10:19:07Z

+
+Query-based COPY is not affected. For example, `COPY INTO <table> FROM (SELECT ... FROM @stage)` keeps the existing privilege requirements.
+
+## Parquet Example


According to the style guide, headings should use sentence case. Please update this heading to use sentence case.

Suggested change

## Parquet Example

## Parquet example

References

Use sentence case for headings (e.g., ## Configure the cluster). ^(link)

gemini-code-assist · 2026-06-08T10:19:07Z


 Row 3 has `currency = NULL` because its source file did not contain that column.

+## NDJSON Example


According to the style guide, headings should use sentence case. Please update this heading to use sentence case.

Suggested change

## NDJSON Example

## NDJSON example

References

Use sentence case for headings (e.g., ## Configure the cluster). ^(link)

gemini-code-assist · 2026-06-08T10:19:07Z

+
+{{{ .lake }}} loads NDJSON files with `TYPE = ndjson`. NDJSON files do not have an embedded columnar schema like Parquet files, so {{{ .lake }}} samples file content, infers fields that are missing from the target table, and appends them as nullable columns.
+
+### Step 1: Create a Table and Stage


According to the style guide, headings should use sentence case. Please update this heading to use sentence case.

Suggested change

### Step 1: Create a Table and Stage

### Step 1: Create a table and stage

References

Use sentence case for headings (e.g., ## Configure the cluster). ^(link)

gemini-code-assist · 2026-06-08T10:19:07Z

+CREATE OR REPLACE STAGE events_stage;
+```
+
+### Step 2: Generate NDJSON Files with Different Fields


According to the style guide, headings should use sentence case. Please update this heading to use sentence case.

Suggested change

### Step 2: Generate NDJSON Files with Different Fields

### Step 2: Generate NDJSON files with different fields

References

Use sentence case for headings (e.g., ## Configure the cluster). ^(link)

gemini-code-assist · 2026-06-08T10:19:07Z

 | COLUMN_MATCH_MODE | For Parquet: column name matching mode | `case-insensitive` |
+| SCHEMA_EVOLUTION | For NDJSON: sampling options used to infer columns that are missing from the target table. Requires `ENABLE_SCHEMA_EVOLUTION = true` and the `ALTER` privilege on the target table. | `AUTO` sampling |
+
+### SCHEMA_EVOLUTION Options


According to the style guide, headings should use sentence case. Please update this heading to use sentence case.

Suggested change

### SCHEMA_EVOLUTION Options

### SCHEMA_EVOLUTION options

References

Use sentence case for headings (e.g., ## Configure the cluster). ^(link)

Adjust the heading level for 'NDJSON Inference Rules' from ### to #### to improve document structure and nesting consistency. No content changes were made beyond the header level.

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

lilin90 · 2026-06-08T10:47:37Z

/approve

ti-chi-bot · 2026-06-08T10:47:46Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: lilin90

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Details

Needs approval from an approver in each of these files:

~~OWNERS~~ [lilin90]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

lake: incremental updates 0607

9d4a5b9

lilin90 self-assigned this Jun 8, 2026

lilin90 added translation/no-need No need to translate this PR. area/tidb-cloud This PR relates to the area of TiDB Cloud. labels Jun 8, 2026

ti-chi-bot Bot added the size/XXL Denotes a PR that changes 1000+ lines, ignoring generated files. label Jun 8, 2026

Mark SQS (S3) integration task as Beta

56f5cc1

Add a (Beta) label to the page front matter title and the main heading in tidb-cloud-lake/guides/integrate-with-amazon-sqs-s3.md to indicate the integration task is in beta.

gemini-code-assist Bot reviewed Jun 8, 2026

View reviewed changes

lilin90 and others added 3 commits June 8, 2026 18:24

Change NDJSON heading from h3 to h4

330a1fb

Adjust the heading level for 'NDJSON Inference Rules' from ### to #### to improve document structure and nesting consistency. No content changes were made beyond the header level.

Update tidb-cloud-lake/sql/system-settings.md

f85fb8a

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

Apply suggestions from code review

32e7988

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

lilin90 added the lgtm label Jun 8, 2026

ti-chi-bot Bot added the approved label Jun 8, 2026

ti-chi-bot Bot merged commit b7fa8dd into pingcap:feature/preview-cloud-lake Jun 8, 2026
11 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

lake: incremental updates 0607#23019

lake: incremental updates 0607#23019
ti-chi-bot[bot] merged 5 commits into
pingcap:feature/preview-cloud-lakefrom
lilin90:update-0607

lilin90 commented Jun 8, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot Jun 8, 2026

Uh oh!

gemini-code-assist Bot Jun 8, 2026

Uh oh!

gemini-code-assist Bot Jun 8, 2026

Uh oh!

gemini-code-assist Bot Jun 8, 2026

Uh oh!

gemini-code-assist Bot Jun 8, 2026

Uh oh!

gemini-code-assist Bot Jun 8, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

lilin90 commented Jun 8, 2026

Uh oh!

ti-chi-bot Bot commented Jun 8, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

	## Step 4: Create an IAM Role for {{{ .lake }}} to Assume
	## Step 4: Create an IAM role for {{{ .lake }}} to assume


		Query-based COPY is not affected. For example, `COPY INTO <table> FROM (SELECT ... FROM @stage)` keeps the existing privilege requirements.

		## Parquet Example


		Row 3 has `currency = NULL` because its source file did not contain that column.

		## NDJSON Example


		{{{ .lake }}} loads NDJSON files with `TYPE = ndjson`. NDJSON files do not have an embedded columnar schema like Parquet files, so {{{ .lake }}} samples file content, infers fields that are missing from the target table, and appends them as nullable columns.

		### Step 1: Create a Table and Stage

	### Step 2: Generate NDJSON Files with Different Fields
	### Step 2: Generate NDJSON files with different fields

Conversation

lilin90 commented Jun 8, 2026

What is changed, added or deleted? (Required)

Which TiDB version(s) do your changes apply to? (Required)

What is the related PR or file link(s)?

Do your changes match any of the following descriptions?

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot Jun 8, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Jun 8, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Jun 8, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Jun 8, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Jun 8, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Jun 8, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

lilin90 commented Jun 8, 2026

Uh oh!

ti-chi-bot Bot commented Jun 8, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant