Autoscaling
This section goes over the various scaling options that are available on Porter. Note that these scaling options are available on both Web and Worker services.
Horizontal Scaling
You can set the number of replicas you’d like to run for each of your application service from the Resources tab.
The load on your application service will be automatically be balanced across all the available replicas.
Autoscaling
You can configure your services to scale automatically based on resource usage. Autoscaling on your services will be triggered based on the CPU or RAM usage threshold you configure in percentage value.
For example, if you set autoscaling threshold to be 70% CPU and 70% RAM, your application service will be scaled up and down to ensure that there is always 30% excess capacity for CPU and RAM. Note that autoscaling will be triggered when either one of the CPU or RAM threshold is hit (i.e. it is not required that both thresholds are hit).
Custom Autoscaling
Porter supports autoscaling based on custom metrics (e.g. queue latency, queue length, etc.). Your cluster comes with Prometheus installed by default, which collects various metrics available on your cluster.
For detailed instructions on setting up custom metrics and autoscaling, see our Custom Metrics and Autoscaling guide.