Mount Volumes
This page shows how to add volumes to your Spark applications.
Standard configuration
By default, Spark uses temporary storage to spill data to disk during shuffles and other operations when no volume is present. With Spark on Kubernetes, pods are initialized with an emptyDir volume for each directory listed in spark.local.dir. When the pod is deleted, the ephemeral storage will be cleared and the data removed.
For certain Spark applications with larger shuffle workfloads or a need to persist data beyond the life of the application, it may be desirable to mount an external volume. Depending on your cloud provider, there are several volume options you can choose from to make your Spark workspaces more dynamic and scalable.
By default, if you select a node type that includes local SSDs on AWS, such as d instances (e.g., m5d, r5d, c5d), Ocean Spark will help you leverage the performance of local SSDs. Read the next section to learn more.
Mounting local SSDs on AWS
For certain Spark applications with larger shuffle workfloads, the speed of the disk is the main bottleneck to achieve good Spark performance. This can be remediated by using faster disk types like NVMe-based SSDs.
Instances like d instances (e.g., m5d, r5d, c5d) or i3 instances all come with local SSDs. However, they must be mounted into the file system of the instance.
Ocean Spark provides a user data script that you can use to mount SSDs automatically. This script must be added in the User Data section of the Virtual Node Group (VNG) configuration tab in Ocean.
Be sure to fill in the variable EKS_CLUSTER_NAME with the name of your EKS cluster.
We recommend adding this user data script by default to all VNGs to be used for Spark applications.
Most likely, the Spark-dedicated VNGs in your EKS cluster already use this script (Ocean Spark adds it by default). You can verify this by browsing to the Virtual Node Group tab in the Ocean section of the Spot console.
Once the user data script is installed in the VNGs, Ocean Spark will automatically and transparently configure Spark to use the local SSDs for shuffle data. Two examples are provided below.
In this sample piece of configuration, m5d instances are used as executors:
      "executor": {
           "instanceSelector": "m5d"
        }
Ocean Spark will use the local SSDs for shuffle data. This will greatly enhance the performance of the Spark application if it uses a lot of shuffle.
In this sample piece of configuration, m5 instances are used as executors:
      "executor": {
           "instanceSelector": "m5"
        }
Ocean Spark will understand that there are no local SSDs and will use a non-SSD location on the host file system for shuffle data. If the CPU profile of the Spark application in the Spot console indicates a high amount of shuffle or IO, consider switching to an instance type with SSDs, like m5d.
The use of local SSDs for shuffle files is only supported in Spark 3 and above.
Mount secrets as files in volumes
Instead of setting environment variables from Kubernetes secrets, you can also mount secrets directly as files into a volume.
First, create a kubernetes secret in your cluster, and make sure to use the namespace spark-apps so your application can access it. Here is an example basic-auth secret below:
apiVersion: v1
kind: Secret
metadata:
  namespace: spark-apps
  name: basic-auth
data:
  password: cGFzc3dvcmQK # password
  user: dXNlcgo= # user
type: Opaque
In your application json config, add a volume object that references the secret name, data key you would like to include in the volume, and a name for the volume reference. In your executor or driver config, add a volumeMounts object that includes the volume name referenced in the previous step and path where you would like the file to be mounted. You can find an example application json config below that creates a volume named volume-secret that references the user key in the basic-auth secret and mounts that secret file to /opt/spark/work-dir/secrets on the executor pod(s).
{
  "executor": {
    "cores": 1,
    "instances": 3,
    "instanceType": "r5.xlarge",
    "volumeMounts": [
      {
        "mountPath": "/opt/spark/work-dir/secrets",
        "name": "volume-secret"
      }
    ]
  },
  "volumes": [
    {
      "secret": {
        "items": [
          {
            "key": "user",
            "path": "secret.yaml"
          }
        ],
        "secretName": "basic-auth"
      },
      "name": "volume-secret"
    }
  ]
}
If you kubectl exec into one of the executor pods, you will find the secrets.yaml file located at /opt/spark/work-dir/secrets.
Mount persistent cloud volumes to your container
For larger volume needs, sharing volumes across applications, or persisting data beyond an application's lifecycle, you can mount a cloud provider volume to your container. Instead of passing in a mount path, find the appropriate key for the cloud volume of your choice, and add the necessary data to the object. You can find the full list of supported volume options in the API reference for submitting applications.
To enable volume mounting, you must first create the volume instance in your respective cloud provider. Then, you must make sure your cluster role has read/write access to the volume.
On AWS, you can create an AWS Elastic Block Storage in the AWS console, or by running the following command:
aws ec2 create-volume --availability-zone <your availability zone> <name of EBS>
You can see more configuration options in the EBS documentation.
To mount an EBS volume to your spark application, add the following json to your application submission:
{
  "volumes": [
    {
      "name": "spark-aws-dir",
      "awsElasticBlockStore": {
        "fsType": "type of file system",
        "readOnly": false,
        "volumeID": "id of ebs"
      }
    }
  ]
}
Dynamically provision volumes using Persistent Volume Claims
As of Spark 3.1, you can dynamically provision and mount volumes to both your executor and driver pods using persistent volume claims. Persistent volume claims are kubernetes resources that allow you to allocate and mount resources of an elastic disk to your kubernetes pods. When your application is complete, kubernetes will remove the pod and delete the volume from your cloud provider. You can read more about the different configuration options in Running Spark on Kubernetes.
To enable PVC provisioning, you must add a few configuration settings to your sparkConf:
{
  "sparkConf": {
    "spark.kubernetes.executor.volumes.persistentVolumeClaim.data.options.claimName": "OnDemand",
    "spark.kubernetes.executor.volumes.persistentVolumeClaim.data.options.storageClass": "standard",
    "spark.kubernetes.executor.volumes.persistentVolumeClaim.data.options.sizeLimit": "500Gi",
    "spark.kubernetes.executor.volumes.persistentVolumeClaim.data.mount.path": "/var/data",
    "spark.kubernetes.executor.volumes.persistentVolumeClaim.data.mount.readOnly": "false"
  }
}
In the sample above, the claimName must be set to OnDemand. You can set the mount path to anywhere you like on the container, just make sure spark is aware of the drive and actually using it. You may need to alter the setting spark.local.dir to the location of your mount point.
In addition, you must also attach a cluster role of edit to the spark-driver service account. You can run the following command to attach the cluster role:
kubectl create clusterrolebinding <cluster-role-binding-name> --clusterrole=edit --serviceaccount=spark-apps:spark-driver --namespace=default
Dynamic PVC Reuse
With the release of Spark 3.2, you can now dynamically reuse persistent volume claims within the same application. This becomes particularly beneficial when using spot instances. In a normal Spark application leveraging spot instances for executors, if you lose a node to a spot kill, you will lose any shuffle or output data that was stored on that node, and Spark will be forced to recompute the results. With PVC reuse enabled, the Spark driver will maintain the PVC(s) of the lost spot instance, add new executors to the cluster, and attach the PVC(s) to the new executors, maintaining the work and progress of the previous node. This feature can significantly reduce application runtime and cost, offsetting one of the major drawbacks of using spot instances. To enable dynamic PVC reuse, add the following two lines, in addition to the configuration above, to your Spark conf:
{
    "sparkConf": {
       "spark.kubernetes.driver.reusePersistentVolumeClaim":"true",
       "spark.kubernetes.driver.ownPersistentVolumeClaim":"true"
    }
}
Related Topics
Take the Product Tour and learn about everything you can do in the Ocean Spark console.