Skip to content

Commit

Permalink
[docs] Bump the data size and tablets limits
Browse files Browse the repository at this point in the history
Some of the changes that landed in 1.4.0, namely Todd's memory consumption
and log segments improvements, plus the beginning of Adar's thread consolidation
effort, make it so that it's easier for Kudu to store more data per node.

Some notes (mostly coming from Adar):
 - Memory consumption now seems to be around 1.5GB / TB of data on disk after
   startup for a TPC-H lineitem table.
 - File descriptor consumption is about 2 per log segment plus 1 per log index.
   Tablets with some replication lag will use more segments. To that is added
   the fd cache that defaults to 40% of the configured max fds.
 - Thread usage is about 5 for hot replicas, then 2 when they become cold (new
   1.4.0 concept that Todd added).

Based on the above, doubling our current limitations of 4TB spread over 1000
tablets to 8TB spread over 2000 means that:
 - 8TB requires at least 12GB of memory, then some more for the MRS, block cache,
   and scanners (around 256KB per column per scan).
 - 6000 fds are required to spin up 2000 tablets, plus what the fd cache uses.
 - 10k threads are required to just to start Kudu.

Change-Id: Ie60d2c3548c402c6a08db9bb724bc6367db989ca
Reviewed-on: http://gerrit.cloudera.org:8080/7503
Reviewed-by: Todd Lipcon <[email protected]>
Tested-by: Todd Lipcon <[email protected]>
  • Loading branch information
jdcryans authored and toddlipcon committed Jul 28, 2017
1 parent ef8e46b commit 0e41d4c
Showing 1 changed file with 3 additions and 3 deletions.
6 changes: 3 additions & 3 deletions docs/known_issues.adoc
Original file line number Diff line number Diff line change
Expand Up @@ -111,7 +111,7 @@
== Server management

* Production deployments should configure a least 4GB of memory for tablet servers,
and ideally more than 10GB.
and ideally more than 16GB when approaching the data and tablet <<Scale>> limits.

* Write ahead logs (WAL) can only be stored on one disk.

Expand Down Expand Up @@ -140,9 +140,9 @@
* Recommended maximum number of masters is 3.

* Recommended maximum amount of stored data, post-replication and post-compression,
per tablet server is 4TB.
per tablet server is 8TB.

* Recommended maximum number of tablets per tablet server is 1000, post-replication.
* Recommended maximum number of tablets per tablet server is 2000, post-replication.

* Maximum number of tablets per table for each tablet server is 60, post-replication,
at table-creation time.
Expand Down

0 comments on commit 0e41d4c

Please sign in to comment.