MinIO Blog (Page 16)

The Architect’s Guide to Storage for AI

Keith Pijanowski Keith Pijanowski on AI/ML | 15 June 2023

This post first appeared in The New Stack. Developers gravitate to technologies that are software defined, open source, cloud native and simple. That essentially defines object storage. Introduction Choosing the best storage for all phases of a machine learning (ML) project is critical. Research engineers need to create multiple versions of datasets and experiment with different model architectures. When a

Building a Moat Around an Object Store

Jonathan Symonds Jonathan Symonds on Object Storage | 12 June 2023

Most developers, engineers, architects and DevOps folks know MinIO. Not all know that the only thing we do is software-defined object storage. We don’t do file or block. We don’t offer a service, it is self-hosted. Our focus is singular. The result is that our object store is objectively, based on adoption, awards and customer feedback the best

End to End Spark Structured Streaming for Kafka Topics

Dileeshvar Radhakrishnan

Dileeshvar Radhakrishnan , AJ AJ on Apache Kafka | 12 June 2023

End to End Spark Structured Streaming for Kafka Topics

Apache Kafka and Apache Spark are two leading technologies used to build the streaming data pipelines that feed data lakes and lake houses. At a really high level, Kafka streams messages to Spark where they are transformed into a format that can be read in by applications and saved to storage.

Batch Replication Adds Two Way S3-MinIO and Pull Functionality

Matt Sarrel

Matt Sarrel @msarrel on Operator's Guide | 7 June 2023

Batch Replication Adds Two Way S3-MinIO and Pull Functionality

Build data pipelines with S3 to MinIO and MinIO to MinIO batch replication.

MinIO Batch Keyrotate

AJ AJ on Security | 6 June 2023

Encryption is an important part of the MinIO architecture. MinIO applies encryption to ensure objects are secure at rest and are compliant with regulations.

Setting up a Development Machine with Kubeflow Pipelines 2.0 and MinIO

Keith Pijanowski Keith Pijanowski on AI/ML | 2 June 2023

Engineers like to play and learn locally. It does not matter which tool is under investigation: a high-end storage solution, a workflow orchestration engine, or the latest thing in distributed computing. The best way to learn a new technology is to find a way to cram it all on a single machine so that you can put your hands on

Using InfluxDB with MinIO

AJ AJ on DevOps | 30 May 2023

InfluxDB is built on the same ethos as MinIO. It is a single Go binary, cloud agnostic, lightweight, but is also feature packed with things like replication and encryption, and it provides integrations with various applications.

Need for Speed 2 - The Supermicro GrandTwin™ SuperServer Benchmarks

Matt Sarrel

Matt Sarrel @msarrel on Benchmarks | 26 May 2023

Need for Speed 2 - The Supermicro GrandTwin™ SuperServer Benchmarks

88 GB/s writes in a 2U form factor for on-prem, colo and edge object storage.

Building an ML Data Pipeline with MinIO and Kubeflow v2.0

Keith Pijanowski Keith Pijanowski on AI/ML | 25 May 2023

Kubeflow Pipelines (KFP) is the most popular feature of Kubeflow. A Python engineer can turn a function written in plain old Python into a component that runs in Kubernetes using the KFP decorators. If you used KFP v1, be warned - the programming model in KFP v2 is very different - however, it is a big improvement. Transforming plain old

Spark Structured Streaming With Kafka and MinIO

Dileeshvar Radhakrishnan

Dileeshvar Radhakrishnan , AJ AJ on Apache Kafka | 22 May 2023

Spark Structured Streaming With Kafka and MinIO

Kafka and Spark Structured Streaming are used together to build data lakes/lake houses fed by streaming data and provide real time business insights.

Making the Most of Streaming with Kafka Schema Registry and MinIO

Dileeshvar Radhakrishnan

Dileeshvar Radhakrishnan , AJ AJ on Modern Data Lakes | 18 May 2023

Making the Most of Streaming with Kafka Schema Registry and MinIO

Make you Kafka topics performant and efficient with Kafka Schema Registry.

Erasure Coding Impact on CPU Utilization in MinIO

Klaus Post Klaus Post on Benchmarks | 18 May 2023

The answer to the burning question you always wanted to ask - how does erasure coding utilize CPU?

HDD Data Durability: Block-level RAID vs. Object Storage Erasure Coding

Matt Sarrel

Matt Sarrel @msarrel on BC/DR | 15 May 2023

HDD Data Durability: Block-level RAID vs. Object Storage Erasure Coding

HDD failure rates create big complications for RAID arrays. Find out why erasure coding is a better option for data durability.

YouTube Summaries: Identity and Access Management

Burke Gibson Burke Gibson on YouTube Summaries | 12 May 2023

Managing users, groups, and policies for security and functionality with MinIO.

SUBNET Security Features and Services

Matt Sarrel

Matt Sarrel @msarrel on SUBNET | 8 May 2023

MinIO licensees gain access to SUBNET security features like long term support and policy reviews.

Supermicro GrandTwin™ SuperServers and MinIO for Dense Rack-Optimized Storage

Matt Sarrel

Matt Sarrel @msarrel on Benchmarks | 5 May 2023

Supermicro GrandTwin™ SuperServers and MinIO for Dense Rack-Optimized Storage

The Supermicro GrandTwin™ SuperServer is a solid, well-designed NVMe class hardware that we recommend for MinIO workloads.

More MinIO Data Options - The MinIO FTP/SFTP Server

Harshavardhana Harshavardhana on Integrations | 4 May 2023

MinIO has added support for FTP and SFTP into the MinIO Server.

How to deploy MinIO with ArgoCD in Kubernetes

Cesar Celis Hernandez Cesar Celis Hernandez , AJ AJ on DevOps | 4 May 2023

What is ArgoCD? In short, it's a GitOps continuous deployment tool that stores the state of the infrastructure in a Git repository and automates deployment by tracking the changes between the existing and new deployment configurations.

In Full Bloom - KubeconEU Amsterdam

Jonathan Symonds Jonathan Symonds on Kubernetes | 24 April 2023

I wanted to share my thoughts on the semi-annual confab that is Kubecon, this one the European edition. These are fairly candid takes, I can be critical or complementary, but given how important this space is to us, it is worthy of analysis. Let’s get one thing out of the way. This was a superb Kubecon. The location was

How to Set up Kafka and Stream Data to MinIO in Kubernetes

Dileeshvar Radhakrishnan

Dileeshvar Radhakrishnan , AJ AJ on Apache Kafka | 24 April 2023

How to Set up Kafka and Stream Data to MinIO in Kubernetes

Apache Kafka is an open-source distributed event streaming platform that is used for building real-time data pipelines and streaming applications. It was originally developed by LinkedIn and is now maintained by the Apache Software Foundation. Kafka is designed to handle high volume, high throughput, and low latency data streams, making it a popular choice for building scalable and reliable data