Introducing Arlon! Your Kubernetes Cluster Management Assembly Line • Platform9

We are launching Arlon – a Kubernetes native tool for complete end-to-end cluster configuration and lifecycle management, security policy and application, using a declarative approach.

In this post, I will discuss why declarative management of infrastructure is important for your Kubernetes clusters, challenges around Kubernetes cluster lifecycle management and how the Cluster API (CAPI) project addresses it. Learn the approach Arlon takes towards solving these problems.

“It Works On My Cluster”

As Kubernetes deployments are starting to scale, enterprises are starting to deal with a chaotic environment of cluster and workload sprawl.

The problem looks something like this – distributed dev and ops teams within a large company start deploying multiple Kubernetes clusters – across their local workstations, data centers, public clouds, edge sites, and at times, on-premises at end-customer sites.

Clusters deployed by different teams are created using a combination of homegrown automation, hand-crafted clusters, or using tools like kubeadm. This divergence in how the clusters are deployed means that the desired configurations are not fully replicated across different clusters. So, different clusters may not meet the same configuration for node size, autoscaling, networking, and RBAC policies that are critical for governance and security.

The result? Bugs that appear only on production sites and that are not replicated on dev or staging. The issue is not with the app, but the configuration of infrastructure on which the app is running.

The Benefit of the Cluster API

The CAPI project helps address some of the challenges mentioned above. The project uses Kubernetes-style declarative APIs and constructs to fully automate cluster lifecycle management, across a broad range of infrastructure and public cloud options. This means operators can define their clusters using the familiar yaml format, then use the CAPI tooling to deploy the clusters.

Today, CAPI is a key Kubernetes project with many users and contributors.

The Cluster Creation Assembly Line Problem

While the CAPI project makes a significant contribution towards simplifying the cluster creation problem, it does not fully address the problem of creating what we call “complete clusters” in a repeatable and predictable manner. Let me explain why.

In a typical large organization, the creation of a production Kubernetes cluster that is considered “ready” to deploy production applications is a multi-step process. The cluster infrastructure configuration is only a part of this process. The cluster must go through several steps of processing before it’s ready for prime time.

For example, let’s say that you are using AWS for all your k8s deployments. Once the basic cluster is created with the right infrastructure configuration,

The DevOps team needs to ensure that the right observability tools are deployed on it to ensure correct monitoring for the cluster.
The Security team must ensure that the cluster has proper access control rules configured, that network security policies are set up properly, etc.
The App Developers must ensure that the prerequisite middleware applications that the application relies on are installed.
Finally, the application itself gets deployed on the cluster and is launched.

Kubernetes Cluster Creation: Traditional Model

The greater the number of steps, the higher the likelihood of things going wrong in the middle if not properly automated, especially at scale.

Existing CD tools like ArgoCD or FluxCD focus primarily on solving the problem of deploying applications on a cluster. They do not get into the process of creating the cluster itself. As a result, operators build homegrown solutions to address this problem.

Kubernetes Cluster Creation: ArgoCD and ClusterAPI

Introducing Arlon – Your Assembly Line To Manage Complete Clusters

We created Arlon to solve the above problem, using standard open source components for cluster lifecycle automation and continuous delivery.

Arlon consists of three key components:

Declarative Cluster Lifecycle – This is handled using Cluster API
Continuous Delivery – This is handled using ArgoCD
Continuous Reconciliation – This is handled by Arlon

Essentially, Arlon lets you leverage current open source standards for declarative cluster lifecycle management and continuous delivery, and in turn, gives you a complete cluster creation assembly line.

What Arlon Does

Arlon introduces the following concepts:

A Cluster spec: A Cluster spec is a description of the cluster infrastructure and external settings of a cluster. Kubernetes version, cloud provider, cluster type, and node instance type are some examples of properties that are captured in a Cluster spec.
A Profile: A Profile is a grouping of configuration bundles that will be installed into the cluster
A Configuration bundle: A Configuration bundle is a unit of configuration that contains, or references, one or more Kubernetes manifests. A bundle can encapsulate anything that can be deployed onto a cluster: an RBAC ruleset, an infrastructure add-on such as Prometheus, an application, etc.

How Arlon Works

Arlon is composed of a controller, a library, and a CLI that exposes the library’s functions as commands. In the future, we have plans to build an API server that uses the Arlon library behind the scenes.

Arlon requires a “management cluster” that will have all the required tooling, along with Arlon custom resources installed.

Sample Arlon profile:

apiVersion: core.arlon.io/v1
kind: Profile
metadata:
  creationTimestamp: "2022-05-13T17:23:05Z"
  generation: 16
  name: test-profile-4
  namespace: arlon
  resourceVersion: "279075871"
  uid: 187361ac-d1a6-4da2-b993-581db8459f09
spec:
  bundles:
  - calico
  - capi-cluster-autoscaler
  - monitoring
  - guestbook-dyn
  description: sample profile number 4
  overrides:
  - bundle: monitoring
    key: foo
    value: bar
  - bundle: monitoring
    key: retentionTime
    value: 3d
  - bundle: monitoring
    key: grafanaUiSvcType
    value: ClusterIP
  repoPath: profiles/test-profile-4
  repoRevision: main
  repoUrl: https://github.com/bcle/fleet-infra.git
  tags:
  - foo
  - bar
  - goo

Next Steps. Get Involved!

This recorded video provides a detailed overview of how Arlon works, with examples:

The Arlon github repository is the best place to ask questions, share comments, and collaborate! The complete documentation is available on the github repo and here:

https://github.com/arlonproj/arlon/tree/main/docs

https://arlon.readthedocs.io/en/latest

We would love to hear your feedback on Arlon, what features you’d like to see added, and what use cases you’d like to see supported.

Author

Madhura Maskasky

Before co-founding Platform9, Madhura spent 7 years at VMware Engineering, where she grew to be technical lead for several key products including vCloud Director, Update Manager and Config Control. Madhura helped spearhead vSphere’s transformation into a policy driven product suite, working broadly across groups to pull together the vision. Madhura holds a M.S. in Computer Science from Stanford University, specializing in databases and distributed systems.
View all posts