Helm Chart for Deploying funcX stack
This application includes:
- FuncX Web-Service
- FuncX Websocket Service
- FuncX Forwarder
- Kuberentes endpoint
- Postgres database
- Redis Shared Data Structure
- RabbitMQ broker
There are two modes in which funcx-endpoints could be deployed:
- funcx-endpoint deployed outside k8s, connecting to hosted services in k8s
- funcx-endpoint deployed inside k8s
NOTE
This only works on Linux systems.
Here are the steps to install, preferably into your active conda environment:
git clone https://github.com/funcx-faas/funcX.git
cd funcX
git checkout main
pip install funcx_sdk
pip install funcx_endpoint
Next create an endpoint configuration:
funcx-endpoint
Update the endpoint's configuration file to point the endpoint to locally deployed services, which we will setup in the next sections. If using default values, the funcx_service_address should be set to http://localhost:5000/v2.
~/.funcx/default/config.py
config = Config(
executors=[HighThroughputExecutor(
provider=LocalProvider(
init_blocks=1,
min_blocks=0,
max_blocks=1,
),
)],
funcx_service_address="http://127.0.0.1:5000/api/v1", # <--- UPDATE THIS LINE
)
We can deploy the kubernetes endpoint as a pod as part of the chart. It
needs to have a valid copy of the funcx's funcx_sdk_tokens.json
which can
be created by running on your local workstation and running
funcx-endpoint start
You will be prompted to follow the authorization link and paste the resulting
token into the console. Once you do that, funcx-endpoint will create a
~/.funcx
directory and provide you with a token file.
The Kubernetes endpoint expects this file to be available as a Kubernetes
secret named funcx-sdk-tokens
.
You can install this secret with:
kubectl create secret generic funcx-sdk-tokens \
--from-file ~/.funcx/credentials/funcx_sdk_tokens.json
-
Update cloudformation stack if necessary
-
Make a clone of this repository
-
Download subcharts:
helm dependency update funcx
-
Create your own
values.yaml
inside the Git ignored directorydeployed_values/
-
Obtain Globus Client ID and Secret. These secrets need to exist in the correct Globus Auth app. Ask for access to the credentials by contacting https://github.com/BenGalewsky or sending a message to the
dev
funcx Slack channel. Once you have your credentials, paste them into yourvalues.yaml
:webService: globusClient: <<your app client>> globusKey: <<your app secret>>
-
Install the helm chart:
helm install -f deployed_values/values.yaml funcx funcx
-
You can access your web service through the ingress or via a port forward to the web service pod. Instructions are provided in the displayed notes.
-
You should be able to see the endpoint registering with the web service in their respective logs, along with the forwarder log. Check the endpoint's logs for its ID.
⚠️ Only for debugging: You can set the forwarder curve server key manually by creating a public/secret curve pair, registering them with kubernetes as a secret and then specifyingforwarder.server_cert = true
. By default the forwarder auto generates keys, and distributes it to the endpoint during registration.
To manually setup the keys for debugging, here are the steps:
-
Create your curve server public/secret keypair with the create_certs.py script:
# CD into the funcx-forwarder repo python3 test/create_certs.py -d .curve
-
Similar to the
funcx-sdk-tokens
add the public/secret pair as kubernetes secret:funcx-forwarder-secrets
for the forwarder service.
# Make sure the server.key_secret file is in your $PWD/.curve dir
kubectl create secret generic funcx-forwarder-secrets --from-file=.curve/server.key --from-file=.curve/server.key_secret
- Once the endpoint is registered to the newly deployed
funcx-forwarder
, make sure to check the~/.funcx/<ENDPOINT_NAME>/certificates/server.key
file to confirm that the manually added key has been returned to the endpoint.
Default configuration values come from funcx/values.yaml
. These can be
overridden per-deployment by placing replacements in the non-version-controlled
deployed_values/values.yaml
- for example, the globusClient/globusKey values
earlier in the install instructions.
This is a recommended initial set of values to override:
webService:
globusClient: <GLOBUS_CLIENT_ID_STRING>
globusKey: <GLOBUS_CLIENT_KEY_STRING>
# Note that we install numpy into the worker so that we can run tests against the local
# deployment
# Note that the workerImage needs the same python version as is used in the funcx_endpoint
# image.
funcx_endpoint:
image:
pullPolicy: Always
tag: main
maxBlocks: 2
initBlocks: 0
minBlocks: 2
workerInit: pip install funcx-endpoint==0.3.2 numpy
workerImage: python:3.7-buster
rabbitmq:
auth:
erlangCookie: c00kie
pullPolicy: Always
Here are some values that can be overriden:
Value | Desciption | Default |
---|---|---|
secrets |
Name of a secret deployed into the cluster. Must follow example_secrets.yaml | - |
webService.image |
Docker image name for the web service | funcx/web-service |
webService.tag |
Docker image tag for the web service | 213_helm_chart |
webService.pullPolicy |
Kubernetes pull policy for the web service container | IfNotPresent |
webService.loglevel |
Setting for the App logging level | DEBUG |
webService.advertisedRedisPort |
Redis port that the forwarder (outside of cluster) can reach | 6379 |
webService.advertisedRedisHost |
Redis host that the forwarder (outside of cluster) can reach | localhost |
webService.globusClient |
Client ID for globus app. Obtained from http://developers.globus.org | |
webService.globusKey |
Secret for globus app. Obtained from http://developers.globus.org | |
webService.replicas |
Number of replica web services to deploy | 1 |
endpoint.enabled |
Deploy an internal kubernetes endpoint? | true |
endpoint.replicas |
Number of replica endpoint pods to deploy | 1 |
endpoint.image |
Docker image name for the endpoint | funcx/kube-endpoint |
endpoint.tag |
Docker image tag for the endpoint | 213_helm_chart |
endpoint.pullPolicy |
Kubernetes pull policy for the endpoint container | IfNotPresent |
forwarder.enabled |
Deploy an internal kubernetes forwarder? | true |
forwarder.minInterchangePort |
The minimum port to assign interchanges. This will be the first port opened int he pod | 54000 |
forwarder.maxInterchangePort |
The maximum port to assign interchanges. Only the first three ports are opened in the pod | 54002 |
forwarder.image |
Docker image name for the forwarder | funcx/funcx/forwarder |
forwarder.tag |
Docker image tag for the forwarder | dev |
forwarder.pullPolicy |
Kubernetes pull policy for the forwarder container | IfNotPresent |
ingress.enabled |
Deploy an ingres to route traffic to web app? | false |
ingress.host |
Host name for the ingress. You will be able to reach your web service via a url that starts with the helm release name and ends with this host | uc.ssl-hep.org |
services.postgres.enabled |
Deploy postgres along with service? | true |
services.postgres.externalURI |
If postgres is deployed externally, what URI connects to it? | sqlite:////sqlite/app.db |
services.redis.enabled |
Deploy redis along with service? | true |
services.redis.externalHost |
If redis is deployed externally, what is the host name? | |
services.redis.externalPort |
If redis is deployed externally, what is the port? | 6379 |
storage.awsSecrets |
Name of Kubernetes secret that holds the AWS credentials | |
storage.s3Bucket |
S3 bucket where storage will write results and payloads | funcx |
storage.redisThreshold |
Threshold to switch large results to S3 from redis (in bytes) | 20000 |
storage.s3ServiceAccount |
Kubernetes Service Account to use for accessing AWS access tokens (blank to skip service account) |
The chart can take advantage of Bitnami's sealed secrets controller to encrypt sensitive config data so it can safely be checked into the GitHub repo.
Create a secrets.yaml file with these values (the values need to be base64
encoded (use echo "secret value" | base64
)). Use the file example_secrets.yaml to see
how the file should be formatted and the names of the secrets.
With kubectl configured to talk to the target cluster, encode the secrets file with the command:
cat local-dev-secrets.yaml | \
kubeseal --controller-namespace kube-system \
--controller-name sealed-secrets \
--format yaml > local-dev-sealed-secrets.yaml
The globus accounts need a role change before you get the keys, before your default account usually doesn't come with many privileges. The script below fetches your keys and creates yaml file that can be installed into your cluster:
Example yaml for references:
apiVersion: v1
kind: Secret
metadata:
name: mysecret
type: Opaque
data:
AWS_ACCESS_KEY_ID: <<base64 encoded access key>>
AWS_SECRET_ACCESS_KEY: <<base64 encoded secret>>
AWS_SESSION_TOKEN: |-
<<base64
encoded
multiline
session_token>>
# First get the target role ARN
# assuming that you are using assume-role configuration in ~/.aws/config , you
# can get this from your current profile's role_arn value
# if you are not using assume-role to access the account where you intend to
# access an S3, these steps may vary
role_arn=$(aws configure get 'role_arn')
echo "Role Arn: $role_arn"
# parse sts assume-role output into a shell array (works in bash, zsh)
# you can `echo "${credarr[1]}"` to see the access key, for example
credarr=("$(aws sts assume-role \
--role-arn "$role_arn" --role-session-name test1 \
--query 'Credentials.[AccessKeyId,SecretAccessKey,SessionToken]' \
--output text | tr '\t' '\n')")
# base64 encode these credentials
access_key_id=$(echo -n "${credarr[1]}" | base64)
secret_key=$(echo -n "${credarr[2]}" | base64)
session_token=$(echo -n "${credarr[3]}" | base64)
cat <<EOF > secret.token.yaml
apiVersion: v1
kind: Secret
metadata:
name: mysecret
type: Opaque
data:
AWS_ACCESS_KEY_ID: $access_key_id
AWS_SECRET_ACCESS_KEY: $secret_key
AWS_SESSION_TOKEN: |-
$session_token
EOF
This chart uses two subcharts to supply dependent services. You can update settings for these by referenceing the subchart name and values from their READMEs.
For example
postgresql.postgresqlUsername: funcx
Subchart | Link to Documentation |
---|---|
postgresql | https://github.com/bitnami/charts/tree/master/bitnami/postgresql |
redis | https://github.com/bitnami/charts/tree/master/bitnami/redis |
It's common for the database to have limited external access. In this case it's easier to create a busy-box inside the cluster and access plsql internally.
In the scripts directory there is psql-busybox.yaml
. Create the pod with
$ kubectl create -f scripts/psql-busybox.yaml
You can then create a shell with kubectl exec -it psql bash
Inside that shell there is a fun pg sql client which can be invoked with the
same Postgres URL found in the web app's config file (/opt/funcx/app.conf
)
pgcli postgresql://funcx:XXXXXXXXXXXX@funcx-production-db.XXXXXX.rds.amazonaws.com:5432/funcx
The following is an incomplete guide to deploying a new release onto our development or production clusters.
Here are the components that need updating as part of a release, in the order they should be updated due to dependencies. Note that only components that have changes for release need to updated and the rest can safely be skipped:
-
funcx-forwarder
- Update version number
- merge above changes to main in a PR
- Create a branch off of main with the version number, for, eg: 'v0.3.3'.
For dev releases, do alpha releases
v0.3.3a0
- Ensure that the branch has the CI tests passing and the publish step working
-
funcx-web-service
- Same steps as funcx-forwarder
-
funcx-websocket-service
- Same steps as funcx-websocket-service
-
Update helm-charts
- Update the smoke-tests in the helm-charts to use the new version numbers in
conftest.py
- Update the smoke-tests in the helm-charts to use the new version numbers in
-
Prepare to deploy to cluster.
- Confirm that all the bits to be deployed should be available on dockerhub.
- Run
kubectl config current-context
which should return something like:
arn:aws:eks:us-east-1:512084481048:cluster/funcx-dev
- Make sure the right cluster is pointed to by kubectl, and use this terminal for all following steps.
-
Download the current values deployed to the target cluster as a backup. Note: you can use this as a base values.yaml.
helm get values funcx > enviornment.yaml
-
Update the values to use the release branchnames as the new tags
-
Deploy with:
helm upgrade -f prod-values.yaml funcx funcx
⚠️ It is preferable to upgrade rather than blow away the current deployment and redeploy because, wiping the current deployment loses state that ties the Route53 entries to point at the ALB, and any configuration on the ALB itself could be lost.
⚠️ If the deployment was wiped here are the steps: * Go to Route53 on AWS Console and select the hosted zone:dev.funcx.org
. Select the appropriate A record for the deployment you are updating and edit the record to update the value to something likedualstack.k8s-default-funcxfun-dd14845f35-608065658.us-east-1.elb.amazonaws.com.
* Add the ALB to the existing WAF Rules here:https://console.aws.amazon.com/wafv2/homev2/web-acl/funcx-prod-web-acl/d82023f9-2cd8-4aed-b8e3-460dd399f4b0/overview?region=us-east-1#
-
While a new forwarder will be launched on upgrade, the new one will not go online since it requires the ports that are in use by the older one. So you must manually delete the older funcx-forwarder pod.
kubectl get pods # Find the older funcx-forwarder pod
kubctl delete pods <NAME_OF_THE_OLDER_FUNCX_FORWARDER>
It is occasionally useful to deploy a full FuncX stack in the dev cluster under
a different namespace. This is useful when two developers are both working on
or debugging a feature as well as to verify a feature works as expected before
potentially deploying to the main dev environment deployment. These
instructions will get a second FuncX deployment (with k8s based redis,
postgres, and rabbitmq) running at a specified host under *.api.dev.funcx.org
.
- To avoid forwarder port conflicts, ensure at least as many nodes are running
in EKS as there will be forwarder deployments since forwarders rely on host
ports to be addressable. To scale the node group you can use
eksctl scale nodegroup --cluster=funcx-dev --name=funcx-dev-node-group --nodes=2 --nodes-max=2
wherenodes-max
andnodes
are set to as many as are needed. - Create a new namespace for your deployment: e.g.
kubectl create namespace josh-funcx
- Create a
values.yaml
that includes information about the host name to use in the ingress definition. E.g.: ingress: enabled: true host: josh-test.dev.funcx.org name: dev-lb subnets: subnet-0c0d6b32bb57c39b2, subnet-0906da1c44cbe3b8d use_alb: true - Install the helm chart as described above, but specifying the new
values.yaml
file and the namespace. E.g.:helm install -f deployed_values/values.yaml josh-funcx funcx --namespace
- Create a new route53 record for the given host (josh-test.dev.funcx.org).
We won't have to do this after external dns has been enabled.