Skip to content

Optimize log recovery time #2028

@LiebingYu

Description

@LiebingYu

Search before asking

  • I searched in the issues and found nothing similar.

Description

In #1749 we introduce log recovery for unclean shut down. However, we have observed that for production clusters, this recovery time is excessively long, which can cause Fluss running in Kubernetes to be killed by health checks due to timeout, leading to repeated restarts of the TabletServer. Therefore, it is necessary to optimize this recovery process to reduce the recovery time.

Willingness to contribute

  • I'm willing to submit a PR!

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

Projects

No projects

Relationships

None yet

Development

No branches or pull requests

Issue actions