MinIO Blog (Page 15)

AI/ML Best Practices During a Gold Rush

Keith Pijanowski Keith Pijanowski on AI/ML | 31 July 2023

Introduction The California Gold Rush started in 1848 and lasted until 1855. It is estimated that approximately 300,000 people migrated to California from other parts of the United States and abroad. Economic estimates suggest that, on average, only half made a modest profit. The other half either lost money or broke even. Very few gold seekers made a significant

Migrate from AWS S3 to MinIO on Equinix Metal

AJ AJ on Hybrid Cloud | 26 July 2023

Between the public cloud and your data center exists a middle ground where you can have full control over infrastructure hardware, without the high initial cost of investment.

How to Repatriate From AWS S3 to MinIO

Matt Sarrel

Matt Sarrel @msarrel on Best Practices | 26 July 2023

If S3 costs are burning a hole in your pocket, then it's time to start thinking about running MinIO on-premise for your private cloud.

Solving the Hybrid-Cloud Challenge - UCE Systems and MinIO

Kris Inapurapu Kris Inapurapu on Hybrid Cloud | 26 July 2023

MinIO has partners across the ecosystem - from our cloud partnerships with AWS, GCP, Azure and IBM to more solution-focused partnerships like Snowflake and Dremio. We are pleased to add UCE Systems to our roster of solutions-based partnerships. UCE is a leading consulting firm focused on modern data platforms (like the aforementioned Dremio). UCE has brought dozens of enterprises out

Parallel ML Experimentation leveraging MinIO & lakeFS

MinIO

MinIO on AI/ML | 25 July 2023

Parallel ML Experimentation leveraging MinIO & lakeFS

Introduction This post was written in collaboration with Iddo Avneri from lakeFS. Managing the growing complexity of ML models and the ever-increasing volume of data has become a daunting challenge for ML practitioners. Efficient data management and data version control are now critical aspects of successful ML workflows. In this blog post, we delve into the power of parallel ML

Get Started with MinIO on Red Hat OpenShift for a PoC

AJ AJ , Cesar Celis Hernandez Cesar Celis Hernandez on Red Hat OpenShift | 21 July 2023

When we announced the availability of MinIO on Red Hat OpenShift, we didn’t anticipate that demand would be so great that we would someday write a series of blog posts about this powerful combination. This combination is being rapidly adopted due to the ubiquitous nature of on-prem cloud and the need of large organizations wanting to bring their data

Setting up a Development Machine with MLFlow and MinIO

Keith Pijanowski Keith Pijanowski on AI/ML | 21 July 2023

About MLflow MLflow is an open-source platform designed to manage the complete machine learning lifecycle. Databricks created it as an internal project to address challenges faced in their own machine learning development and deployment processes. MLflow was later released as an open-source project in June 2018. As a tool for managing the complete lifecycle, MLflow contains the following components. * MLflow

Accelerating Database Backup and Restore with MinIO Jumbo

Aditya Manthramurthy Aditya Manthramurthy , Matt Sarrel

Matt Sarrel @msarrel on Benchmarks | 20 July 2023

Accelerating Database Backup and Restore with MinIO Jumbo

MinIO Jumbo improved backup performance by 15x using parallel uploads.

Migrating from Hadoop to a Cloud-Ready Architecture for Data Analytics

Raghav Karnam

Raghav Karnam on Dremio | 19 July 2023

Migrating from Hadoop to a Cloud-Ready Architecture for Data Analytics

This post was a collaboration between Kevin Lambrecht of UCE Systems and Raghav Karnam The cloud operating model and specifically Kubernetes have become the standard for large scale infrastructure today. More importantly, they are evolving at an exceptional pace with material impacts to data science, data analytics and AI/ML. This transition has a significant impact on the Hadoop ecosystem.

MinIO Fan-Out Feature for Time Shift Buffering

Satish Ramakrishnan Satish Ramakrishnan on New MinIO Features | 19 July 2023

MinIO has developed into a core building block for the media and entertainment industry. With a customer roster that includes the leading cable company, the biggest streaming company and dozens of companies up and down the stack we have added a number of different features in recent quarters. One of those is called the fan out feature and it is

Optimizing AI Model Serving with MinIO and PyTorch Serve

Sidharth Rajaram

Sidharth Rajaram @sidharrrrrth on AI/ML | 18 July 2023

Optimizing AI Model Serving with MinIO and PyTorch Serve

Making the serving of your AI models more lightweight by leveraging the simplicity of MinIO’s object store. tl;dr MinIO object storage can be used as a ‘single source of truth’ for your machine learning models and, in turn, make serving with PyTorch Serve more efficient when managing changes to Large Language Models (LLMs). As always, sample code is

Leveraging Object Storage for Enterprise Legacy Data

AJ AJ on DevOps | 14 July 2023

MinIO is built with speed and resiliency at the forefront, regardless of the type of environment you choose to run it on. Whether it's multi cloud, bare metal, cloud instances or even on-premise, MinIO is designed to run on AWS, GCP, Azure, colocated bare metal servers and Kubernetes distributions such as Red Hat OpenShift. MinIO runs just as

Low Level Performance Testing for Object Storage

Matt Sarrel

Matt Sarrel @msarrel on Operator's Guide | 11 July 2023

Low Level Performance Testing for Object Storage

Learn how to troubleshoot object storage performance with low level system component testing.

The New Math on Backup and Replication

Ugur Tigli Ugur Tigli on BC/DR | 11 July 2023

The world of backup has entered a brave new world where traditional solutions still have utility but where the scale, speed of change and application landscape require different…radically different…approaches. This post seeks to lay out the challenges of this new world, where the line of demarcation exists and how to think about architecting a data protection framework that

Enhance Large Language Models Leveraging RAG and MinIO on cnvrg.io

MinIO

MinIO on AI/ML | 5 July 2023

Enhance Large Language Models Leveraging RAG and MinIO on cnvrg.io

This post was written in collaboration with Harinder Mashiana from cnvrg.io. Large language models (LLMs) have revolutionized the world of technology, offering powerful capabilities for text analysis, language translation, and chatbot interactions. The revolution will heavily impact businesses, according to OpenAI, approximately 80% of the U.S. workforce could have at least 10% of their work tasks affected by

Object Management for AI/ML

Keith Pijanowski Keith Pijanowski on AI/ML | 29 June 2023

Introduction In a few previous posts on AI/ML, I mentioned that one of the benefits of MinIO is that you have tools for Versioning, Lifecycle Management, Object Locking, Object Retention and Legal Holds. These capabilities have a variety of uses. You may need a simple way to keep track of training experiments. You could also use these features to

Putting a Filesystem on Top of an Object Store is a Bad Idea. Here is why.

Dileeshvar Radhakrishnan

Dileeshvar Radhakrishnan , AJ AJ on Object Storage | 27 June 2023

Putting a Filesystem on Top of an Object Store is a Bad Idea. Here is why.

When purchasing storage, the emphasis is usually on media, but it may be even more important to consider access methods too. You will need to take storage protocols into account when designing and procuring infrastructure, especially when you leave legacy storage behind in order to migrate to cloud-native object storage. However, object storage relies on the S3 API for communications,

Building an ML Training Pipeline with MinIO and Kubeflow v2.0

Keith Pijanowski Keith Pijanowski on AI/ML | 20 June 2023

Introduction In a previous post, I covered Building an ML Data Pipeline with MinIO and Kubeflow v2.0. The data pipeline I created downloaded US Census data to a dedicated instance of MinIO. This is different from the MinIO instance Kubeflow Pipelines (KFP) uses internally. We could have tried to use KFP’s instance of MinIO - however, this is

YouTube Summaries: Kubernetes and the MinIO Operator

Sasha Wodtke Sasha Wodtke on YouTube Summaries | 19 June 2023

Our latest YouTube training series is all about the MinIO Operator, which brings native support for deploying and managing MinIO deployments (“MinIO Tenants”) on a Kubernetes cluster. MinIO’s Mike Johnson (aka MJ) brings us through the 10-part video series to set the foundation of understanding Kubernetes before focusing on installing and configuring the MinIO Operator for Kubernetes, which will

Fast and Efficient Search with OpenSearch and MinIO

AJ AJ on DevOps | 16 June 2023

In this post we look at how search, and specifically OpenSearch can help us identify patterns or see trends in our ever growing data.