🚀 AgentPool has separate levers for `cooldownPeriodSeconds` and `scalingPeriodSeconds` #341

starlightromero · 2024-02-14T16:26:55Z

Description

Currently cooldownPeriodSeconds affects the time to wait between scaling events. It would be useful if the time to wait between scaling events could be detached from the time the agents stick around after a run.

I propose scalingPeriodSeconds is the time to wait between scaling events. And cooldownPeriodSeconds is the time to wait after a run before starting scalingPeriodSeconds.

Potential YAML Configuration

apiVersion: app.terraform.io/v1alpha2
kind: AgentPool
metadata:
  name: this
  namespace: default
spec:
  organization: kubernetes-operator
  token:
    secretKeyRef:
      name: tfc-operator
      key: token
  name: agent-pool-demo
  agentTokens:
    - name: white
    - name: blue
    - name: red
  agentDeployment:
    replicas: 3
    spec:
      containers:
        - name: tfc-agent
          image: "hashicorp/tfc-agent:latest"
  autoscaling:
    minReplicas: 2
    maxReplicas: 4
    cooldownPeriodSeconds: 300
    scalingPeriodSeconds: 30

References

N/A

Community Note

Please do not leave "+1" or other comments that do not add relevant new information or questions, they generate extra noise for issue followers and do not help prioritize the request.
Please vote on this issue by adding a 👍 reaction to the original issue to help the community and maintainers prioritize this request.
If you are interested in working on this issue or have submitted a pull request, please leave a comment.

The text was updated successfully, but these errors were encountered:

sheneska · 2024-02-21T17:21:24Z

Hi @starlightromero, could you please provide more context on what exactly you are asking re cooldownPeriodSeconds?

briantist · 2024-02-21T21:46:15Z

@sheneska another way to put it might be to have separate cooldowns for scale-out vs scale-in.

It's of particular concern when scaling to zero, because no agents will be launched until the cooldown period expires, no matter how big the queue is.

Right now, we work around it by having a very short cooldown period (like 1 minute), so that if there are no agents it takes at most a minute to launch one.

The downside of this is that agents disappear very quickly after a run, and having to relaunch one takes a bit of time, so it adds delay to the next run.

Ideally, after being launched, an agent sticks around for a bit, maybe 30 minutes or whatever, so that subsequent runs have an available agent to use. But if we set cooldown to 30 minutes, and it scales to zero, and then 2 minutes later we have another run, that run will wait for 28 minutes before another agent is launched.

So the ability to have asymmetrical cooldown times would be especially helpful: we want to be able to quickly scale-out in response to load, and scale-in more slowly to reduce latency for runs that start closely in time.

marianopeterson · 2024-03-18T17:20:21Z

the ability to have asymmetrical cooldown times would be especially helpful: we want to be able to quickly scale-out in response to load, and scale-in more slowly to reduce latency for runs that start closely in time.

It's important to me to be able to manage time to scale up independently from time to scale down, so that I can better manage the tradeoff between cost control and user experience.

alexsomesan · 2024-05-07T16:56:09Z

Thanks for the additional context. It's really helping us understand the impact of this potential change. We've included it as a candidate for our next round of planning.

iBrandyJackson added enhancement New feature or request acknowledged labels Mar 28, 2024

jrhouston mentioned this issue Jul 19, 2024

Add seperate fields for configuring autoscaler scale-up and scale-down #441

Merged

arybolovlev closed this as completed in #441 Jul 25, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

🚀 AgentPool has separate levers for `cooldownPeriodSeconds` and `scalingPeriodSeconds` #341

🚀 AgentPool has separate levers for `cooldownPeriodSeconds` and `scalingPeriodSeconds` #341

starlightromero commented Feb 14, 2024

sheneska commented Feb 21, 2024

briantist commented Feb 21, 2024 •

edited

Loading

marianopeterson commented Mar 18, 2024

alexsomesan commented May 7, 2024

🚀 AgentPool has separate levers for cooldownPeriodSeconds and scalingPeriodSeconds #341

🚀 AgentPool has separate levers for cooldownPeriodSeconds and scalingPeriodSeconds #341

Comments

starlightromero commented Feb 14, 2024

Description

Potential YAML Configuration

References

Community Note

sheneska commented Feb 21, 2024

briantist commented Feb 21, 2024 • edited Loading

marianopeterson commented Mar 18, 2024

alexsomesan commented May 7, 2024

🚀 AgentPool has separate levers for `cooldownPeriodSeconds` and `scalingPeriodSeconds` #341

🚀 AgentPool has separate levers for `cooldownPeriodSeconds` and `scalingPeriodSeconds` #341

briantist commented Feb 21, 2024 •

edited

Loading