Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Tracking Issue for Managing Low Disk Space Conditions #1700

Open
1 of 13 tasks
smklein opened this issue Sep 11, 2022 · 2 comments
Open
1 of 13 tasks

Tracking Issue for Managing Low Disk Space Conditions #1700

smklein opened this issue Sep 11, 2022 · 2 comments
Labels
mvp nexus Related to nexus Sled Agent Related to the Per-Sled Configuration and Management storage Related to storage.
Milestone

Comments

@smklein
Copy link
Collaborator

smklein commented Sep 11, 2022

Spawned from RFD 312

  • Tracking the right info in the DB
    • Store the "maximum allowed size" of a dataset
    • Add constraints where appropriate to limit this size (e.g., datasets < zpool size, regions < dataset size)
  • Quotas & Reservations
    • (Nexus) Determine what Quotas & Reservations should be for all known datasets
    • (Nexus) Request that quotas/reservations are set in a request to the Sled Agent
    • (Sled Agent) Actually apply Quotas & Reservations
    • (Nexus) Request a "ballast" for reserved capacity explicitly
  • Scaling
    • (Sled Agent) Add metric to periodically monitor space usage
    • (Nexus) Periodically query space usage of datasets, and take actions at certain thresholds...
      • ... When expected, perform horizontal scaling
      • ... Identify low-space conditions (long-term, we should be providing an alert to operators if we cannot remedy the situation ourselves).
@smklein smklein added storage Related to storage. Sled Agent Related to the Per-Sled Configuration and Management nexus Related to nexus labels Sep 11, 2022
@smklein smklein added this to the MVP milestone Jan 23, 2023
@smklein smklein added the mvp label Jan 23, 2023
@askfongjojo
Copy link

When considering caps and reservations, we probably need to consider DRAM and CPU as well so there is room for VM migration. If the sleds are all nearly full, it will be impossible to evacuate an entire sled.

@smklein
Copy link
Collaborator Author

smklein commented Feb 24, 2023

When considering caps and reservations, we probably need to consider DRAM and CPU as well so there is room for VM migration. If the sleds are all nearly full, it will be impossible to evacuate an entire sled.

Agreed, #2299 is my first attempt to capture this information

@smklein smklein changed the title Tracking Issue for Managing Low-Space Conditions Tracking Issue for Managing Low Disk Space Conditions Mar 23, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
mvp nexus Related to nexus Sled Agent Related to the Per-Sled Configuration and Management storage Related to storage.
Projects
None yet
Development

No branches or pull requests

2 participants