scale-predictor

Description

A predictor to estimate how many instances to scale up based on historical invocation trace.

For local testing and development, run with the following command:

pip install -r requirements.txt
python3 main.py

grpc server will be started on localhost:50051

You can directly pull and run scale-predictor from docker hub using the following command:

docker pull zhaidea/scale-predictor:latest
docker run -p 50051:50051 zhaidea/scale-predictor

or you can build your own image by running Dockerfile:

docker build -t scale-predictor .
docker run -p 50051:50051 scale-predictor

Upload your image:

docker login -u <your_username> -p <yourpassword>
docker build -t <username>/scale-predictor:latest .
docker push <username>/scale-predictor:latest

Modify zhaidea in config/predictor.yaml to your username On k8s clauster:

kubectl apply -f config/predictor.yaml -n knative-serving
kubectl apply -f config/predictor-service.yaml -n knative-serving

Name		Name	Last commit message	Last commit date
Latest commit History 39 Commits
config		config
models		models
protos		protos
scripts		scripts
src/scale_predictor		src/scale_predictor
tests		tests
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
environment.yml		environment.yml
inverse_loss.py		inverse_loss.py
main.py		main.py
requirements.txt		requirements.txt
requirements_darts.txt		requirements_darts.txt