Percona Streaming Backup

Percona Streaming Backup

What is streaming mode? Essentially it allows you to backup with Percona xtraBackup without touching disk. When used alongside MinIO Jumbo, it is designed to upload and retrieve large objects from the MinIO cluster.

Read more...

Creating an ML Scenario in SAP Data Intelligence Cloud to Read and Model Data in MinIO

Creating an ML Scenario in SAP Data Intelligence Cloud to Read and Model Data in MinIO

Enterprise customers use MinIO to build data lakehouses to store a wide variety of structured and unstructured data, and work with it using ML and analytics. Data flows into MinIO from across the enterprise and the S3 API allows applications, such as analytics and AI/ML to work with it.   I previously blogged about building data pipelines with SAP Data

Read more...

Myths about Deduplication and Compression

Myths about Deduplication and Compression

A lot of vendors offer compression and deduplication as a feature of their storage solutions. We critically evaluate these claims by looking at the two methods used to reduce the raw disk usage. Compression Overview Compression in overall terms, gets its benefit from reducing redundancy in the data you upload. This means that repeated or otherwise predictable data can be

Read more...

OpenSearchCon 2023 Talk

OpenSearchCon 2023 Talk

Apparently, the OpenSearch folks love MinIO as much as we love OpenSearch, the talk got accepted and we were invited to present it in Seattle. It was very cold in the Northwest but we were very excited nonetheless.

Read more...

Object Detection Made Simple with MinIO and YOLO

Object Detection Made Simple with MinIO and YOLO

Tl;dr: In this post, we will create a custom image dataset and then train a You-Only-Look-Once (YOLO) model for the ubiquitous task of object detection. We will then implement a system using MinIO Bucket Notifications that can automatically perform inference on a new image. Introduction: Computer vision remains an extremely compelling application of artificial intelligence. Whether it’s recognizing

Read more...

The Disruptive Nature of Data Lakehouses

The Disruptive Nature of Data Lakehouses

Introduction In 1997, Clayton Christensen, in his book The Innovator’s Dilemma, identified a pattern of innovation that tracked the capabilities, cost, and adoption by market segment between an incumbent and a new entrant. He labeled this pattern “Disruptive Innovation.” Not every successful product is disruptive - even if it causes well-established businesses to lose market share or even fail

Read more...