Create KB article (Best Practices for Optimizing Longhorn Disk Performance) #51

jillian-maroket · 2023-12-27T10:20:22Z

This KB article addresses a customer request for best practice recommendations regarding Harvester configuration with disk performance in mind.

Jira: SURE-6729
GitHub issue: harvester/harvester#3356

David Ko and the Longhorn engineering team provided most of the content. I added supporting details from the Longhorn documentation.

netlify · 2023-12-27T10:20:42Z

✅ Deploy Preview for harvester-home-preview ready!

Name	Link
🔨 Latest commit	`1f04d45`
🔍 Latest deploy log	https://app.netlify.com/sites/harvester-home-preview/deploys/659e4121c10fa5000825bfeb
😎 Deploy Preview	https://deploy-preview-51--harvester-home-preview.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify site configuration.

bk201

Since it's in Harvester KB, I suggest adding Harvester doc links in the KB if possible (we can always add Longhorn doc as a reference too).

kb/2023-12-27/best_practices_disk_performance.md

bk201 · 2023-12-28T01:26:12Z

kb/2023-12-27/best_practices_disk_performance.md

+
+  For data-intensive applications, you can use pod scheduling functions such as node selector or taint toleration. These functions allow you to schedule the workload to a specific storage-tagged node together with one replica.  
+
+- **Revision counter**: You can disable the [revision counter](https://longhorn.io/docs/1.5.3/advanced-resources/deploy/revision_counter/) to improve IO performance, especially for write-intensive applications. When the revision counter is disabled, Longhorn does not track write operations for replicas and the Longhorn Engine does not check the `revision.counter` file after restarting.


@innobead Do we want to recommend this for production environments?

@innobead Can you comment on this?

Let's remove this from this practice, as the focus here should be on actions that are 'recommended' be taken.

However, this depends on a very strict requirement already mentioned in the 'important information' below, and it could be challenging to achieve in a prod setup. Additionally, it's not a recommendation but rather an optional method to enhance performance.

I have removed this item.

kb/2023-12-27/best_practices_disk_performance.md

bk201 · 2023-12-28T01:28:41Z

kb/2023-12-27/best_practices_disk_performance.md

+
+- **Recurring snapshots**: Periodically clean up system-generated snapshots and retain only the number of snapshots that makes sense for your implementation. For applications with replication capability, periodically [delete all types of snapshots](https://longhorn.io/docs/1.5.3/concepts/#243-deleting-snapshots).
+
+- **Recurring filesystem trim**: Periodically [trim the filesystem](https://longhorn.io/docs/1.5.3/volumes-and-nodes/trim-filesystem/) inside volumes to reclaim disk space.


@innobead I suggest we remove this. Most harvester VMs use block mode volume, I'm not sure if the trim operation works here or not. (Probably doing a trim inside VM?)

@innobead Can you comment on this?

Agreed, let's remove this as it only works for the fs mount point (volume mode: fs).

Longhorn volume supports unmap, which should continue functioning if users perform trim on the VM fs. However, this should not be recommended, as there is a known issue at harvester/harvester#4739. Additionally, the user experience is not good.

I have removed this item.

LucasSaintarbor

LGTM 👍

innobead

In general, LGTM. After resolve the comments, we can merge this.

However, we continue improving this doc over time, since there will be some new practices to be introduced later. For example, snapshot space management, default priority class, etc.

@jillian-maroket We should have one in Longhorn doc as well, but it should be in a dedicated in https://longhorn.io/docs/1.6.0/best-practices/. Please help create a ticket for that and plan it.

Add folder and best practices kb article

b1a5a4f

jillian-maroket requested review from innobead and bk201 December 27, 2023 10:20

jillian-maroket requested a review from LucasSaintarbor December 28, 2023 00:10

bk201 reviewed Dec 28, 2023

View reviewed changes

Use doc links provided by Kiefer

b50c27c

LucasSaintarbor approved these changes Dec 28, 2023

View reviewed changes

innobead reviewed Jan 10, 2024

View reviewed changes

Remove 2 items per David's feedback

1f04d45

jillian-maroket merged commit cb6902f into harvester:main Jan 10, 2024
5 checks passed

jillian-maroket mentioned this pull request Jan 12, 2024

Add disk performance optimization measures to best-practices.md longhorn/website#840

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Create KB article (Best Practices for Optimizing Longhorn Disk Performance) #51

Create KB article (Best Practices for Optimizing Longhorn Disk Performance) #51

jillian-maroket commented Dec 27, 2023

netlify bot commented Dec 27, 2023 •

edited

Loading

bk201 left a comment

bk201 Dec 28, 2023

jillian-maroket Jan 2, 2024

innobead Jan 10, 2024

jillian-maroket Jan 10, 2024

bk201 Dec 28, 2023

jillian-maroket Jan 2, 2024

innobead Jan 10, 2024

jillian-maroket Jan 10, 2024

LucasSaintarbor left a comment

innobead left a comment •

edited

Loading


		For data-intensive applications, you can use pod scheduling functions such as node selector or taint toleration. These functions allow you to schedule the workload to a specific storage-tagged node together with one replica.

		- Revision counter: You can disable the [revision counter](https://longhorn.io/docs/1.5.3/advanced-resources/deploy/revision_counter/) to improve IO performance, especially for write-intensive applications. When the revision counter is disabled, Longhorn does not track write operations for replicas and the Longhorn Engine does not check the `revision.counter` file after restarting.


		- Recurring snapshots: Periodically clean up system-generated snapshots and retain only the number of snapshots that makes sense for your implementation. For applications with replication capability, periodically [delete all types of snapshots](https://longhorn.io/docs/1.5.3/concepts/#243-deleting-snapshots).

		- Recurring filesystem trim: Periodically [trim the filesystem](https://longhorn.io/docs/1.5.3/volumes-and-nodes/trim-filesystem/) inside volumes to reclaim disk space.

Create KB article (Best Practices for Optimizing Longhorn Disk Performance) #51

Create KB article (Best Practices for Optimizing Longhorn Disk Performance) #51

Conversation

jillian-maroket commented Dec 27, 2023

netlify bot commented Dec 27, 2023 • edited Loading

✅ Deploy Preview for harvester-home-preview ready!

bk201 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

LucasSaintarbor left a comment

Choose a reason for hiding this comment

innobead left a comment • edited Loading

Choose a reason for hiding this comment

netlify bot commented Dec 27, 2023 •

edited

Loading

innobead left a comment •

edited

Loading