MinIO Blog

MinIO Blog

Data Science and AI with a SQL Server 2022 Data Lakehouse

Matt Sarrel Matt Sarrel Matt Sarrel @msarrel on SQL |
Data Science and AI with a SQL Server 2022 Data Lakehouse

Microsoft SQL Server 2022 is one of the most commonly implemented enterprise relational databases. Many of the world's most successful companies, regardless of vertical, have significant SQL Server deployments. Thousands of companies have relied on SQL Server for decades. Microsoft has made great strides over the past decade in embracing open-source and standards-compliant technologies. The result is that

Read more

Scaling up MinIO Internal Connectivity

Klaus Post Klaus Post on Programming |
Scaling up MinIO Internal Connectivity

A MinIO cluster operates as a uniform cluster. This means that any request must be seamlessly handled by any server. As a consequence, servers need to coordinate between themselves. This has so far been handled with traditional HTTP RPC requests - and this has served us well.  Whenever server A would like to call server B an HTTP request would

Read more

Airgapped MinIO Deployments

AJ AJ on DevOps |
Airgapped MinIO Deployments

In this post we’ll talk about what is an Airgapped Network, what to consider when deploying MinIO in such an environment and how to replicate and scale it thereafter with other airgapped sites.

Read more

Two Things Can Be True at the Same Time

Two Things Can Be True at the Same Time

There is an interesting report out from McKinsey on the impending impact of AI on an enterprise’s cloud investments.  There was a quote early on in the piece where McKinsey states:“While the possible impact varies by sector, adopting cloud represents an opportunity for the average company to increase profitability by 20 to 30 percent.”  To many, this would

Read more

Distributed Data Processing with Ray Data and MinIO

Distributed Data Processing with Ray Data and MinIO

Introduction Distributed data processing is a key component of an efficient end-to-end distributed machine-learning training pipeline. This is true if you are building a basic neural network for statistical predictions where distributed training could mean each experiment runs in 10 minutes vs. an hour. It is also true if you are training or fine-tuning a Large Language Model (LLM) where

Read more

AI/ML Reproducibility with lakeFS and MinIO

MinIO MinIO MinIO on AI/ML |
AI/ML Reproducibility with lakeFS and MinIO

This post was written in collaboration with Amit Kesarwani from lakeFS. The reality of running multiple machine learning experiments is that managing them can become unpredictable and complicated - especially in a team environment. What often happens is that during the research process, teams constantly change configuration and data between experiments. For example, try several training sets and several hyperparameter

Read more

Event Notifications vs Object Lambda

AJ AJ on Object Lambdas |
Event Notifications vs Object Lambda

As we were writing the blogs on Event Notifications and Object Lambda we came to a realization of why there are two different features doing almost the same thing? Or are they? What is the difference between the Greek Lambda and Lightning Bolt?

Read more

How to Back Up with Restic and MinIO

How to Back Up with Restic and MinIO

Every system needs to be backed up because there are countless ways to lose local filesystem data and configurations. That loss can be devastating  – potentially resulting in revenue loss, dissatisfied customers and even costly litigation. The statistics are pretty bleak – sixty percent of businesses that suffer a data loss event close within six months and ninety-three percent of companies that

Read more