Managing Shards in an InfluxDB Cluster#

The InfluxDB storage engine organizes data into shards, which are logical partitions of the database that contain a subset of the data. Sasquatch uses the InfluxDB default configuration in which the shard duration is 7 days, starting at 00:00:00.000 UTC on Sunday and ending at 23:59:59.999 UTC on the following Saturday.

An InfluxDB cluster enables horizontal scaling and high availability by distributing shards across multiple data nodes, while the cluster metadata is managed by the meta nodes.

This guide outlines the use of the influxd-ctl and influx_inspect tools to manage shards in an InfluxDB cluster.

Listing Shards#

The influxd-ctl tool is available on the InfluxDB meta nodes. To view all shards and their metadata use the influxd-ctl show-shards command.

kubectl -n sasquatch exec -it sasquatch-influxdb-enterprise-meta-0  -- influxd-ctl show-shards

This command displays information about all shards in the cluster, including the shard ID, Database, Retention Policy, Replication status, Start timestamp, End timestamp, and Owners.

For example, the following command lists the shard IDs for the EFD database and the corresponding Start and End timestamps:

kubectl -n sasquatch exec -it sasquatch-influxdb-enterprise-meta-0 -- influxd-ctl show-shards | awk '$2 == "efd" { print $1, $2, $6, $7 }' | sort -k1,1n

In Sasquatch deployments, the InfluxDB cluster has replication factor two, meaning that each shard is replicated on two data nodes. The data nodes where the shards are stored is reported under the Owners column.

Detailed information about the shards such as State (hot or cold), Last Modified Time and Size can be obtained by running the influxd-ctl show-shards -v command.

The filesystem layout in a data node is as follows:

<data-dir>/<database>/<retention-policy>/<shard-id>

<data-dir>: The base directory for data storage, defined by the data configuration setting in the influxdb.conf file (/var/lib/influxdb/data by default).
<database>: Each database has its own subdirectory under the data directory.
<retention-policy>: Within each database directory, there is a subdirectory for each retention policy.
<shard-id>: Shards are stored in directories named by their unique numeric shard ID.

Checking data integrity#

To verify the integrity of a shard across the cluster, use the influx_inspect tool on a data node, e.g. sasquatch-influxdb-enterprise-data-0:

kubectl -n sasquatch exec -it sasquatch-influxdb-enterprise-data-0  -- influx_inspect verify -dir /var/lib/influxdb/data/<database>/<retention-policy>/<shard-id>

This command checks the integrity of the Time-Structured Merge Tree (TSM) files.

The same shard can differ in size across the cluster due to compaction and other operations. The influx_inspect verify command checks the integrity of the shard data, not the size.

Read more about shard compaction operations in the InfluxDB storage engine documentation.

The Anti-Entropy service in InfluxDB Enterprise ensures that the data is consistent across the cluster by comparing the data in the shards on different data nodes. However, this tool consumes too much resources and InfluxData recommended turning it off in Sasquatch.

Shard movement#

Shard movement is the process of moving a shard from one data node to another. This operation is useful when a data node is decommissioned or when rebalancing the cluster.

The influxd-ctl tool provides commands to move shards between data nodes.

Read more about cluster rebalancing in the InfluxDB documentation.

Backup and restore#

The influxd-ctl tool provides commands to backup and restore shards.

A meta node doesn’t have enough space to keep the backup files. To perform backup and restore operations, download the influxd-ctl tool and bind it to a meta node. Download the influxd-ctl tool from the InfluxData website:

wget https://dl.influxdata.com/enterprise/releases/influxdb-meta-1.11.3_c1.11.3-1.x86_64.rpm
rpm2cpio influxdb-meta-1.11.3_c1.11.3-1.x86_64.rpm | cpio -idmv

To backup a shard, use the influxd-ctl backup:

influxd-ctl -bind  <meta pod IP address>:8091 backup -db efd -shard <shard ID>  /backup-dir

To restore a shard, use the influxd-ctl restore command:

influxd-ctl -bind  <meta pod IP address>:8091 restore -db efd -shard <shard ID> -shard <shard ID> -newshard <new shard ID> -newrf 2 /backup-dir

Where <shard ID> identifies the shard to be restored from the backup and <new shard ID> identifies the shard in the destination database to restore to. The -newrf 2 option specifies the replication factor for the restored shard ensuring that it is restored to two data nodes.

Note

If you are restoring a shard from the same database, <new shard ID> is the same as the <shard ID>.

If you are restoring a shard from a different database (e.g. restoring data the Summit EFD database to the USDF EFD database) shard IDs do not align, and so <new shard ID> should reflect the shard ID in the destination database which has the same same start time as in the source database.

Hot shards can be truncated using the influxd-ctl truncate-shards command before backup and restore operations. After truncating a shard, another shard is created and new writes are directed to the new shard. Truncated shards are marked as cold.

For cold shards, it is possible to manually copy the shard TSM files to one of the destination data nodes under the appropriate directory, and then use the influxd-ctl copy-shards command to copy the shard to the other data node.

This procedure was applied to restore shard 786 at the USDF EFD database, after InfluxData ran an offline compaction of that shard to fix a slow query issue. In this case the shard restore is as follows:

# List owners of shard 786
kubectl exec -it sasquatch-influxdb-enterprise-meta-0 -n sasquatch -- influxd-ctl show-shards | grep 786

# Manually remove the TSM and index files from shard 786 in data-0:
kubectl exec -it sasquatch-influxdb-enterprise-data-0 -n sasquatch -- /bin/bash
cd /var/lib/influxdb/data/efd/autogen/
rm -r 786

# Manually copy the fully compacted TSM and index files for shard 786 to data-0
kubectl -n sasquatch cp efd/autogen/786/  sasquatch-influxdb-enterprise-data-0:/var/lib/influxdb/data/efd/autogen/

# Remove shard 786 data and metadata from data-1 using the influxd-ctl remove-shard command
kubectl exec -it sasquatch-influxdb-enterprise-meta-0 -n sasquatch -- influxd-ctl remove-shard sasquatch-influxdb-enterprise-data-1.sasquatch-influxdb-enterprise-data.sasquatch.svc.cluster.local:8088 786

# Copy shard 786 from data-0 to data-1
kubectl exec -it sasquatch-influxdb-enterprise-meta-0 -n sasquatch -- influxd-ctl copy-shard sasquatch-influxdb-enterprise-data-0.sasquatch-influxdb-enterprise-data.sasquatch.svc.cluster.local:8088 sasquatch-influxdb-enterprise-data-1.sasquatch-influxdb-enterprise-data.sasquatch.svc.cluster.local:8088 786

Finally, restart the InfluxDB data statefulset to reload the shards data and rebuild the TSM in-memory indexes.

Note

The difference between removing the shard files manually and using the influxd-ctl remove-shard command is that, the remove-shard command removes the shard from the meta node and the data node, while manually removing the shard TSM and index files only removes the shard data (the data node is still listed as owner of that shard).