Deploy MinIO cloud storage to Mesosphere DC/OS

Nitish Tiwari Nitish Tiwari on Docker |

Container orchestration is gaining traction as the default way to deploy applications. Developers are architecting their modern applications from the ground-up to run in containers, which enables faster deployment and more resilience. Even legacy applications are adopting containers in every way they can to access these advantages.

Of the many characteristics that make an application container ready, the way it handles unstructured data is one of the most important. Back in the day, the default way to handle unstructured data was to dump all of it onto the server’s file system, but using the host filesystem doesn’t make any sense for containerized apps. This is because in an orchestrated environment, a container can be scheduled — or rescheduled — on any of the hosts in a cluster, but data written to a previous host can not be rescheduled with that container.

The best solution is to use a cloud storage system with an easy to use, widely accepted API to handle storage. AWS S3 is an option, but what if you’d rather be in control of your application data? What if you want to run unstructured data storage in the cloud with your infrastructure, or run a cost effective solution on premises and still use S3 as a protocol for data transfer? There is a gap between no cloud storage and completely managed object storage, that MinIO cloud storage server strives to fill.

MinIO is a cloud native storage server that provides an open source alternative to AWS S3.

What is cloud native?

Cloud native applications are designed to take advantage of the fluid nature of resources in a cluster. A cloud native application doesn’t need resource management that will eventually compete with a cluster’s orchestration layer; it should rely on the orchestration layer to run applications wherever resources are allocated.

As a true cloud native application, MinIO focuses on storage and does that very well. It leaves out the resource management responsibility to orchestration platforms like Mesosphere DC/OS (datacenter operating system). This allows MinIO to scale very well as compared to applications with their own resource management mechanisms.

DC/OS allows containerized applications to scale in a sustainable manner by running several isolated instances of the application. Take for example, a HTTP server, which can be easily containerized due to its stateless nature. With Docker containers and DC/OS you can scale your HTTP serving capacity by adding as many instances as required to handle extra load.

In a cloud native environment, scalability is not a function of the application but the orchestration.

MinIO is designed to scale in a similar manner. Each of your DC/OS cluster tenants can have their own isolated MinIO server instance backed by the storage required for that tenant. This way, you can accommodate new tenants and storage requirements, by adding a new MinIO instance for a new tenant.

Not only scale, this design helps keep the failure domain limited.

The complexity of the first MinIO instance is no different than the millionth MinIO instance.

Remember, an application doesn’t automatically become cloud native when running in a container or on an orchestration platform. Design makes an application cloud native!

Deploy MinIO on Mesosphere DC/OS

Deploying an application on DC/OS is simple; you can use a Universe package, or create a customized config file. We at MinIO recently released an official universe package to enable single click MinIO deployment on a DC/OS cluster.

In the rest of this post, I explain the process of deploying a MinIO stand alone server on DC/OS with our new universe package and discuss how to scale this setup for a multi-tenant environment.

Prerequisites

To get started, you’ll need a cluster with DC/OS 1.8 or later running. You’ll also need Marathon-LB installed. Note the IP address of the public agent(s) where Marathon-LB is running; you will need it later to locate the load balancer. Alternately, you could configure a hostname to point to the public agent(s) where Marathon-LB is running.

You can use either the DC/OS UI or the command line interface to install the MinIO package.

MinIO package via DC/OS GUI

Visit the DC/OS admin page, and click on “Universe” on the left menu bar. Then click on the “Packages” tab and search for MinIO. Once you see the package, click the “Install” button on the right hand side.

Next, you’ll need to enter configuration values like the storage and service type you’d like to use with your MinIO instance. Finally enter the public Marathon-LB IP address under “networking >> public-agent”, and click “Review and Install”.

This completes the install process. You’ll now need to get the access key and secret key from the MinIO container logs. Click on “Services” and select MinIO service in DC/OS admin page. Then go to the “logs” tab and copy the accesskey and secretkey.

You can connect with the MinIO instance via either the web browser or MinIO mc.

MinIO package via DC/OS CLI

To install MinIO package via CLI, type:

$ dcos package install minio

Rest of the process remains largely same as the above GUI based install process.

The DC/OS CLI also provides options to install customized packages via the dcos install command. Refer to the CLI reference doc for more details.

MinIO Server modes

MinIO supports different modes, other than the default mode which we deployed above. These can come in handy based on your requirements. You can easily create deployments based on these MinIO modes via a custom config script.

  • MinIO erasure coded mode: MinIO server, when launched with at least four drives, automatically goes to the erasure coded mode. This protects data against hardware failures and silent data corruption using erasure code and checksums. In this mode you could lose half of your drives and still be able to recover your data.
  • MinIO distributed mode: Distributed mode allows you to run several (min4 and max 16) nodes as one single storage server.
  • MinIO shared backend mode: MinIO shared-backend mode provides an option to run multiple MinIO instances, supported by the same storage backend like NAS, with a load balancer like Marathon-LB running in front to distribute the load evenly. The writes to the backend are synchronized.

While you’re at it, help us understand your use case and how we can help you better! Fill out our best of MinIO deployment form (takes less than a minute), and get a chance to be featured on the MinIO website and showcase your MinIO private cloud design to MinIO community.