Skip to main content

Scaling

Scaling (increasing the number of Floom instances) is needed when handling large amount of traffic or extensive processing such as video, audio, speech generation etc.

Being a container, Floom is easily scalable to an unlimited number of containers.

In Kubernetes for example, you'll have to add a "Horizontal Pod Auto Scaler" to automatically scale Floom:

apiVersion: autoscaling/v2
kind: HorizontalPodAutoscaler
metadata:
name: floom-core-hpa
spec:
scaleTargetRef:
apiVersion: apps/v1
kind: Deployment
name: floom-core
minReplicas: 1
maxReplicas: 10 # Adjust max replicas according to your needs
metrics:
- type: Resource
resource:
name: cpu
target:
type: Utilization
averageUtilization: 80 # Scale up if CPU usage exceeds 80%