You can use autoscaling to automatically increase or decrease computing resources based on usage so that you are using only the resources you need. Here are some tutorials to show you how to implement autoscaling for your services.
Tutorial - Autoscaling services using CPU and memory
Autoscaling Marathon services using CPU and memory…Read More
Tutorial - Autoscaling using requests per second
Setting up microscaling based on requests per second…Read More
Tutorial - Microscaling
Understanding microscaling based on queue length…Read More