glossary: add some terms to glossary #14298

ran-huang · 2023-07-20T07:05:05Z

What is changed, added or deleted? (Required)

glossary: add some terms to glossary

Which TiDB version(s) do your changes apply to? (Required)

Tips for choosing the affected version(s):

By default, CHOOSE MASTER ONLY so your changes will be applied to the next TiDB major or minor releases. If your PR involves a product feature behavior change or a compatibility change, CHOOSE THE AFFECTED RELEASE BRANCH(ES) AND MASTER.

For details, see tips for choosing the affected versions (in Chinese).

What is the related PR or file link(s)?

This PR is translated from: glossary: add some terms to glossary docs-cn#11227
Other reference link(s):

Do your changes match any of the following descriptions?

Delete files
Change aliases
Need modification after applied to another branch
Might cause conflicts after applied to another branch

Signed-off-by: Ran <[email protected]>

lilin90 · 2023-12-06T07:44:21Z

It seems that not all terms in the Chinese version pingcap/docs-cn#11227 are covered in this PR. Would you add the other terms to this PR later?

ran-huang · 2023-12-06T07:51:43Z

It seems that not all terms in the Chinese version pingcap/docs-cn#11227 are covered in this PR. Would you add the other terms to this PR later?

@lilin90 sure

ran-huang · 2023-12-08T06:53:55Z

/rebase

glossary.md

dveeden · 2023-12-08T07:21:00Z

glossary.md

+
+[TiCDC](/ticdc/ticdc-overview.md) is a tool for incrementally replicating data in TiDB. It pulls the data change logs from the upstream TiKV and parses them into ordered row-level change data, which it then outputs to the downstream. For more information about the concepts and terms of TiCDC, see [TiCDC Glossary](/ticdc/ticdc-glossary.md).
+
+### TiDB Data Migration (DM)


I think people might be looking for DM under "D" instead of under "T".

Suggested change

### TiDB Data Migration (DM)

### Data Migration (DM)

glossary.md

Co-authored-by: Daniël van Eeden <[email protected]>

glossary.md

Co-authored-by: Daniël van Eeden <[email protected]>

ti-chi-bot · 2023-12-28T09:57:48Z

[LGTM Timeline notifier]

Timeline:

2023-09-27 12:40:51.835538137 +0000 UTC m=+19249.422648283: ☑️ agreed by hfxsd.
2023-12-08 07:24:51.901644684 +0000 UTC m=+1771520.566870890: ✖️🔁 reset by dveeden.
2023-12-28 09:57:47.58059558 +0000 UTC m=+1732558.617822497: ☑️ agreed by dveeden.

glossary.md

ti-chi-bot · 2024-01-26T10:13:22Z

@ran-huang: The following test failed, say /retest to rerun all failed tests or /retest-required to rerun all mandatory failed tests:

Test name	Commit	Details	Required	Rerun command
pull-verify	`068374f`	link	true	`/test pull-verify`

Full PR test history. Your PR dashboard.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here.

ti-chi-bot · 2025-01-23T09:01:49Z

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by:
Once this PR has been reviewed and has the lgtm label, please ask for approval from qiancai, ensuring that each of them provides their approval before proceeding. For more information see the Code Review Process.

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

OWNERS

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

benmeadowcroft · 2025-01-23T21:25:02Z

glossary.md

@@ -42,18 +42,22 @@ Baseline Capturing captures queries that meet capturing conditions and create bi

 ### Batch Create Table

-Batch Create Table is a feature introduced in TiDB v6.0.0. This feature is enabled by default. When restoring data with a large number of tables (nearly 50000) using BR (Backup & Restore), the feature can greatly speed up the restore process by creating tables in batches. For details, see [Batch Create Table](/br/br-batch-create-table.md).
+Batch Create Table is a feature introduced in TiDB v6.0.0. This feature is enabled by default. When restoring data with a large number of tables (nearly 50000) using BR (Backup & Restore), the feature can greatly speed up the restore process by creating tables in batches. For more information, see [Batch Create Table](/br/br-batch-create-table.md).


I find the use of a version number to introduce a feature doesn't add much value. We should be more upfront about defining the feature itself. Looking at uses of version numbers it is inconsistently used so I would suggest removing it.

My suggestion for this entry would be something like:

The Batch Create Table feature greatly speeds up the creation of multiple tables at a time by creating tables in batches. For example, when restoring 1,000's of tables using the BR (Backup & Restore) tool this can help shorten the overall recovery time. For more information, see Batch Create Table.

benmeadowcroft · 2025-01-23T21:41:49Z

glossary.md


 ## C

 ### Cached Table

 With the cached table feature, TiDB loads the data of an entire table into the memory of the TiDB server, and TiDB directly gets the table data from the memory without accessing TiKV, which improves the read performance.

+### Cluster
+
+A cluster is a group of nodes that work together to provide services. It typically consists of different types of nodes. For example, a TiDB cluster usually consists of TiDB nodes, TiKV nodes, and PD nodes, and a DM cluster usually consists of DM Master nodes and DM Worker nodes.


In a generic sense of the term cluster, the phrase "It typically consists of different types of nodes", is confusing. TiDB actually consists of multiple layers of clusters. The TiDB nodes, TiKV nodes, and PD nodes each belong to their own cluster, see the first diagram in https://docs.pingcap.com/tidb/stable/tidb-architecture.

I think it would be worthwhile expanding a bit to talk about the scalability and availability benefits of clustering instead.

My suggestion for this entry would be:

A cluster is a group of nodes that work together to provide services. For example, a cluster of TiKV nodes provides the storage services for TiDB. Using clusters of nodes, in a distributed system, can deliver higher availability and greater scalability than a single node can provide. As a distributed system, TiDB uses clusters of nodes to deliver highly available and scalable services: a cluster of TiDB Servers provides a scalable SQL layer to clients; a cluster of PD nodes provides a resilient metadata layer for TiDB; and a cluster of TiKV servers, using the raft-consensus protocol, provides a highly available, scalable, and resilient storage service for TiDB. See the TiDB Architecture doc for more information.

I’m not sure if using ‘a cluster of TiKV’ and ‘a cluster of PD nodes’ might confuse users, as we typically refer to a TiDB cluster as consisting of TiDB nodes, TiKV nodes, and PD nodes. To avoid confusion, we could simply mention them as TiDB nodes, TiKV nodes, and PD nodes here.

benmeadowcroft · 2025-01-23T21:45:26Z

glossary.md


 ### Continuous Profiling

-Introduced in TiDB 5.3.0, Continuous Profiling is a way to observe resource overhead at the system call level. With the support of Continuous Profiling, TiDB provides performance insight as clear as directly looking into the database source code, and helps R&D and operation and maintenance personnel to locate the root cause of performance problems using a flame graph. For details, see [TiDB Dashboard Instance Profiling - Continuous Profiling](/dashboard/continuous-profiling.md).
+Introduced in TiDB 5.3.0, Continuous Profiling is a way to observe resource overhead at the system call level. With the support of Continuous Profiling, TiDB provides performance insight as clear as directly looking into the database source code, and helps R&D and operation and maintenance personnel to locate the root cause of performance problems using a flame graph. For more information, see [TiDB Dashboard Instance Profiling - Continuous Profiling](/dashboard/continuous-profiling.md).


I would remove the "Introduced in TiDB 5.3.0," at the start of the sentence. That version is now well out of it's maintenance support (to 2023-11-30) and extended support (2024-11-30) periods as per https://www.pingcap.com/tidb-release-support-policy/

There are also too many "ands" in the second sentence, I would suggest rewriting as:

With the support of Continuous Profiling, TiDB provides fine-grained insights into performance problems helping operations teams locate the root cause of performance problems using a flame graph.

benmeadowcroft · 2025-01-23T21:48:10Z

glossary.md

+
+PD Control (pd-ctl) is a command-line tool to interface with the placement driver (PD) of the cluster. You can use it to obtain cluster status information and modify the cluster. For more information, see [PD Control User Guide](/pd-control.md).
+
+### pending/down


Should these be Capitalized? "Pending/Down" as in the previous version? Is there a reason they were moved to lower case?

benmeadowcroft · 2025-01-23T21:51:00Z

glossary.md


 "Pending" and "down" are two special states of a peer. Pending indicates that the Raft log of followers or learners is vastly different from that of leader. Followers in pending cannot be elected as leader. "Down" refers to a state that a peer ceases to respond to leader for a long time, which usually means the corresponding node is down or isolated from the network.

 ### Placement Driver (PD)

 Placement Driver (PD) is a core component in the [TiDB Architecture](/tidb-architecture.md#placement-driver-pd-server) responsible for storing metadata, assigning [Timestamp Oracle (TSO)](/tso.md) for transaction timestamps, orchestrating data placement on TiKV, and running [TiDB Dashboard](/dashboard/dashboard-overview.md). For more information, see [TiDB Scheduling](/tidb-scheduling.md).

+### Placement Rules
+
+Placement rules are used to configure the placement of data in a TiKV cluster through the SQL interface. With this feature, you can specify the deployment of tables and partitions to different regions, data centers, cabinets, and hosts. Use cases include optimizing data availability strategies at low cost, ensuring that local data replicas are available for local stale reads, and complying with local data compliance requirements.


Placement rules are not just configured using the SQL interface (though that is preferred), they can also be configured pd-ctl, see https://docs.pingcap.com/tidb/stable/configure-placement-rules#set-rules-using-pd-ctl. The link guiding uses to the SQL interface is probably sufficient.

My suggestion for this paragraph is:

Placement rules are used to configure the placement of data in a TiKV cluster. With this feature, you can specify the deployment of tables and partitions to different regions, data centers, cabinets, and hosts. Use cases include optimizing data availability strategies at low cost, ensuring that local data replicas are available for local stale reads, and complying with local data compliance requirements.

benmeadowcroft · 2025-01-23T21:59:15Z

glossary.md

@@ -286,13 +351,37 @@ A store refers to the storage node in the TiKV cluster (an instance of `tikv-ser

 ## T

+### Temporary table
+
+Temporary tables solve the issue of temporarily storing the intermediate results of an application, which frees you from frequently creating and dropping tables. You can store the intermediate calculation data in temporary tables. When the intermediate data is no longer needed, TiDB automatically cleans up and recycles the temporary tables. This avoids user applications being too complicated, reduces table management overhead, and improves performance.


Should that start with: "Temporary tables solve the issue of temporarily storing the intermediate results of an application's calculations,"?

benmeadowcroft · 2025-01-23T22:04:36Z

glossary.md

+
+### TiCDC
+
+[TiCDC](/ticdc/ticdc-overview.md) is a tool for incrementally replicating data in TiDB. It pulls the data change logs from the upstream TiKV and parses them into ordered row-level change data, and then outputs the data to the downstream. For more information about the concepts and terms of TiCDC, see [TiCDC Glossary](/ticdc/ticdc-glossary.md).


I would suggest expanding this a bit, something like:

TiCDC is a tool for incrementally replicating data from TiDB to other downstream targets. These downstream targets may include other TiDB instances, MySQL compatible databases, object storage locations, and streaming processors (like Kafka and Pulsar). TiCDC pulls the data change logs from the upstream TiKV, parses them into ordered row-level change data, and then outputs the data to the downstream. For more information about the concepts and terms of TiCDC, see TiCDC Glossary.

ti-chi-bot · 2025-01-23T22:08:55Z

@benmeadowcroft: adding LGTM is restricted to approvers and reviewers in OWNERS files.

In response to this:

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

glossary.md

qiancai · 2025-01-24T06:04:58Z

glossary.md


 ## C

 ### Cached Table

 With the cached table feature, TiDB loads the data of an entire table into the memory of the TiDB server, and TiDB directly gets the table data from the memory without accessing TiKV, which improves the read performance.

+### Cluster
+
+A cluster is a group of nodes that work together to provide services. It typically consists of different types of nodes. For example, a TiDB cluster usually consists of TiDB nodes, TiKV nodes, and PD nodes, and a DM cluster usually consists of DM Master nodes and DM Worker nodes.


I’m not sure if using ‘a cluster of TiKV’ and ‘a cluster of PD nodes’ might confuse users, as we typically refer to a TiDB cluster as consisting of TiDB nodes, TiKV nodes, and PD nodes. To avoid confusion, we could simply mention them as TiDB nodes, TiKV nodes, and PD nodes here.

glossary.md

ti-chi-bot · 2025-01-24T06:49:00Z

@benmeadowcroft: adding LGTM is restricted to approvers and reviewers in OWNERS files.

In response to this:

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

glossary.md

Add temp.md

f6d5e0b

ran-huang added needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. translation/from-docs-cn This PR is translated from a PR in pingcap/docs-cn. type/enhancement The issue or PR belongs to an enhancement. labels Jul 20, 2023

Delete temp.md

28ef780

ti-chi-bot bot added size/XS Denotes a PR that changes 0-9 lines, ignoring generated files. and removed needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. labels Jul 20, 2023

update glossary

c2943f5

Signed-off-by: Ran <[email protected]>

hfxsd assigned ran-huang Sep 27, 2023

hfxsd approved these changes Sep 27, 2023

View reviewed changes

ti-chi-bot bot added the needs-1-more-lgtm Indicates a PR needs 1 more LGTM. label Sep 27, 2023

add several entries

9efc1dd

ti-chi-bot bot added size/M Denotes a PR that changes 30-99 lines, ignoring generated files. and removed size/XS Denotes a PR that changes 0-9 lines, ignoring generated files. labels Dec 7, 2023

ran-huang added 2 commits December 7, 2023 17:38

Merge branch 'master' into add-terms-to-glossary-11227

c3659af

add more entries

b538df1

ti-chi-bot bot added size/L Denotes a PR that changes 100-499 lines, ignoring generated files. and removed size/M Denotes a PR that changes 30-99 lines, ignoring generated files. labels Dec 8, 2023

sort

9646189

ran-huang mentioned this pull request Dec 8, 2023

glossary: add some terms to glossary pingcap/docs-cn#11227

Open

12 tasks

dveeden requested changes Dec 8, 2023

View reviewed changes

ti-chi-bot bot removed the needs-1-more-lgtm Indicates a PR needs 1 more LGTM. label Dec 8, 2023

ran-huang and others added 2 commits December 11, 2023 16:19

Apply suggestions from code review

d261a0d

Co-authored-by: Daniël van Eeden <[email protected]>

Apply suggestions from code review

f6b40af

Co-authored-by: Daniël van Eeden <[email protected]>

ran-huang commented Dec 14, 2023

View reviewed changes

glossary.md Outdated Show resolved Hide resolved

ran-huang and others added 2 commits December 14, 2023 15:40

Apply suggestions from code review

0f9ef7e

Co-authored-by: Daniël van Eeden <[email protected]>

Apply suggestions from code review

e0f5485

Co-authored-by: Daniël van Eeden <[email protected]>

dveeden approved these changes Dec 28, 2023

View reviewed changes

ti-chi-bot bot added the needs-1-more-lgtm Indicates a PR needs 1 more LGTM. label Dec 28, 2023

ran-huang commented Dec 28, 2023

View reviewed changes

glossary.md Outdated Show resolved Hide resolved

Update glossary.md

068374f

qiancai assigned qiancai and unassigned ran-huang Jan 23, 2025

Merge remote-tracking branch 'upstream/master' into pr/14298

ed4a4d0

update wording

08a8af2

qiancai requested a review from benmeadowcroft January 23, 2025 09:47

benmeadowcroft requested changes Jan 23, 2025

View reviewed changes

qiancai reviewed Jan 24, 2025

View reviewed changes

Apply suggestions from code review

91346ca

qiancai reviewed Jan 24, 2025

View reviewed changes

glossary.md Outdated Show resolved Hide resolved

qiancai reviewed Jan 24, 2025

View reviewed changes

glossary.md Outdated Show resolved Hide resolved

update links

6240a7b

benmeadowcroft approved these changes Jan 24, 2025

View reviewed changes

qiancai reviewed Jan 24, 2025

View reviewed changes

glossary.md Outdated Show resolved Hide resolved

qiancai reviewed Jan 24, 2025

View reviewed changes

glossary.md Outdated Show resolved Hide resolved

qiancai added 2 commits January 24, 2025 17:19

Update glossary.md

1a2ba4a

Apply suggestions from code review

6fd3090

qiancai reviewed Jan 24, 2025

View reviewed changes

glossary.md Outdated Show resolved Hide resolved

Update glossary.md

9dff453

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

glossary: add some terms to glossary #14298

glossary: add some terms to glossary #14298

ran-huang commented Jul 20, 2023

lilin90 commented Dec 6, 2023 •

edited

Loading

ran-huang commented Dec 6, 2023

ran-huang commented Dec 8, 2023

dveeden Dec 8, 2023

ti-chi-bot bot commented Dec 28, 2023

ti-chi-bot bot commented Jan 26, 2024

ti-chi-bot bot commented Jan 23, 2025

benmeadowcroft Jan 23, 2025

benmeadowcroft Jan 23, 2025

qiancai Jan 24, 2025

benmeadowcroft Jan 23, 2025

benmeadowcroft Jan 23, 2025

benmeadowcroft Jan 23, 2025

benmeadowcroft Jan 23, 2025

benmeadowcroft Jan 23, 2025

benmeadowcroft Jan 23, 2025

ti-chi-bot bot commented Jan 23, 2025

qiancai Jan 24, 2025

ti-chi-bot bot commented Jan 24, 2025


		[TiCDC](/ticdc/ticdc-overview.md) is a tool for incrementally replicating data in TiDB. It pulls the data change logs from the upstream TiKV and parses them into ordered row-level change data, which it then outputs to the downstream. For more information about the concepts and terms of TiCDC, see [TiCDC Glossary](/ticdc/ticdc-glossary.md).

		### TiDB Data Migration (DM)


		PD Control (pd-ctl) is a command-line tool to interface with the placement driver (PD) of the cluster. You can use it to obtain cluster status information and modify the cluster. For more information, see [PD Control User Guide](/pd-control.md).

		### pending/down


		### TiCDC

		[TiCDC](/ticdc/ticdc-overview.md) is a tool for incrementally replicating data in TiDB. It pulls the data change logs from the upstream TiKV and parses them into ordered row-level change data, and then outputs the data to the downstream. For more information about the concepts and terms of TiCDC, see [TiCDC Glossary](/ticdc/ticdc-glossary.md).

glossary: add some terms to glossary #14298

Are you sure you want to change the base?

glossary: add some terms to glossary #14298

Conversation

ran-huang commented Jul 20, 2023

What is changed, added or deleted? (Required)

Which TiDB version(s) do your changes apply to? (Required)

What is the related PR or file link(s)?

Do your changes match any of the following descriptions?

lilin90 commented Dec 6, 2023 • edited Loading

ran-huang commented Dec 6, 2023

ran-huang commented Dec 8, 2023

Choose a reason for hiding this comment

ti-chi-bot bot commented Dec 28, 2023

[LGTM Timeline notifier]

ti-chi-bot bot commented Jan 26, 2024

ti-chi-bot bot commented Jan 23, 2025

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ti-chi-bot bot commented Jan 23, 2025

Choose a reason for hiding this comment

ti-chi-bot bot commented Jan 24, 2025

lilin90 commented Dec 6, 2023 •

edited

Loading