AI/ML - MinIO Blog (Page 2)

Repatriating AI Workloads: An On-Prem Answer to Soaring Cloud Costs

Brenna Buuck on AI/ML | 8 November 2024

Repatriating AI Workloads: An On-Prem Answer to Soaring Cloud Costs

As AI workloads drive cloud costs through the roof, many companies are rethinking their approach. Moving select AI tasks back on-prem offers a path to predictable costs, improved performance, and stronger data control.

The Architect’s Guide to Interoperability in the AI Data Stack

Brenna Buuck

Brenna Buuck on AI/ML | 7 November 2024

The Architect’s Guide to Interoperability in the AI Data Stack

Interoperability is the key to building a flexible, future-ready AI data stack. As proprietary systems lock down innovation and drive up costs, open tools like S3-compatible storage and multi-format table systems offer the freedom to scale and adapt.

A Sneak Peak: The MinIO Object Storage and AI Survey

Jonathan Symonds Jonathan Symonds on Research | 7 November 2024

MinIO recently surveyed 656 IT leaders as part of a primary research initiative with User Evidence. The results were very interesting and underscore the massive sea change we are seeing in the enterprise, both around the movement to object storage and the interest in using object storage as the primary building block for an organization’s AI initiatives. We will

AI/ML workflows with AIStor and Metaflow

AJ AJ on AI/ML | 6 November 2024

AIStor and Metaflow can be deployed anywhere on any type of infrastructure. In fact more often than not we have folks deploying AIStor and metaflow alongside each other as part of the deployment pipeline.

Map-Style Datasets using Amazon’s S3 Connector for PyTorch and MinIO

Keith Pijanowski Keith Pijanowski on AI/ML | 31 October 2024

Before diving into Amazon’s S3 Connector for PyTorch, it is worthwhile to introduce the problem it is intended to solve. Many AI models need to be trained on data that cannot fit into memory. Furthermore, many really interesting models being built for computer vision and generative AI use data that cannot even fit on the disk drive that comes

An Easier Path to Scalable AI: Intel Tiber Developer Cloud + MinIO Object Store

Keith Pijanowski Keith Pijanowski on AI/ML | 24 October 2024

One of the biggest challenges facing organizations today for AI and data management is access to reliable infrastructure and compute resources. The Intel Tiber Developer Cloud is purpose-built for engineers who need an environment for proof-of-concepts, experimentation, model training, and service deployments. Unlike other clouds, which can be unapproachable and complex, the Intel Tiber Developer Cloud is simple and easy

AI Data Workflows with Kafka and MinIO

AJ AJ on Apache Kafka | 23 October 2024

AIStor is a foundational component for creating and executing complex data workflows. At the core of this event-driven functionality is MinIO bucket notifications using Kafka.

Tame the AI beast with Monitoring and Alerting

AJ AJ on AI/ML | 9 October 2024

In this post we’ll show you how you visualize the cluster metrics in a web browser and also we’ll set up alerting so that when something like a drive needs to be replaced or drive runs out of space we can get alerted for it.

Hiring for AI Success: Why Your First Hire Should Be a Data Engineer

Brenna Buuck

Brenna Buuck on AI/ML | 4 October 2024

Hiring for AI Success: Why Your First Hire Should Be a Data Engineer

To ensure AI success, start by hiring a data engineer, not an AI/ML expert. Learn from our experience and find out why a strong data foundation—focused on object storage, data lakehouses, and optimized pipelines—is critical for scalable, efficient AI/ML workloads.

Git-like versioning for your AI Data

AJ AJ on AI/ML | 2 October 2024

You’ve surely version controlled code in the past. But have you version controlled your data? Did you ever want to collaborate on large sets of data with various teams without committing a large chunk?

Open Source or Closed? The AI Dilemma

Keith Pijanowski Keith Pijanowski on AI/ML | 26 August 2024

This post first appeared on The New Stack on July 29th, 2024. Artificial Intelligence is in the middle of a perfect storm in the software industry, and now Mark Zuckerberg is calling for open-sourced AI. Three powerful perspectives are colliding on how to control AI: 1. All AI should be open-source for sharing and transparency. 2. Keep AI closed-source and

Spelunk through your AI data infrastructure with Splunk

AJ AJ on AI/ML | 7 August 2024

In this post we explain how to use Splunk's advanced log analytics to help understand the performance of AIStor and the data under management.

The MinIO DataPod: A Reference Architecture for Exascale

Rakshith Venkatesh Rakshith Venkatesh on AI/ML | 1 August 2024

The modern enterprise defines itself by its data. This requires a data infrastructure for AI/ML as well as a data infrastructure that is the foundation for a Modern Datalake capable of supporting business intelligence, data analytics, and data science. This is true if they are behind, getting started or using AI for advanced insights. For the foreseeable future, this

Breaking down Insight Partners State of Enterprise Tech 2024 Report

Jonathan Symonds Jonathan Symonds on Cloud Operating Model | 31 July 2024

The team at Insight Partners just released their State of Enterprise Tech report for 2024. There is a lot to consume in the 60+ slides, but we cherry picked the things that should be interesting to our audience - and frankly there is a lot of interesting stuff. I will leave the survey methodology stuff for you to consume, but

Build a Distributed Embedding Subsystem with MinIO, Langchain, and Ray Data

Keith Pijanowski Keith Pijanowski on AI/ML | 29 July 2024

An embedding subsystem is one of four subsystems needed to implement Retrieval Augmented Generation. It turns your custom corpus into a database of vectors that can be searched for semantic meaning. The other subsystems are the data pipeline for creating your custom corpus, the retriever for querying the vector database to add more context to a user query, and finally,

Bringing ARM into the AI Data Infrastructure Fold at MinIO Using SVE

Frank Wessels Frank Wessels on AI/ML | 22 July 2024

One of the reasons that MinIO is so performant is that we do the granular work that others will not or cannot. From SIMD acceleration to the AVX-512 optimizations we have done the hard stuff. Recent developments for the ARM CPU architecture, in particular Scalable Vector Extensions (SVE), presented us with the opportunity to deliver significant performance and efficiency gains

Data-Centric AI with Snorkel and MinIO

Keith Pijanowski Keith Pijanowski on AI/ML | 10 July 2024

With all the talk in the industry today regarding large language models with their encoders, decoders, multi-headed attention layers, and billions (soon trillions) of parameters, it is tempting to believe that good AI is the result of model design only. Unfortunately, this is not the case. Good AI requires more than a well-designed model. It also requires properly constructed training

The Architect's Guide to Machine Learning Operations (MLOps)

Keith Pijanowski Keith Pijanowski on AI/ML | 28 June 2024

MLOps, short for Machine Learning Operations, is a set of practices and tools aimed at addressing the specific needs of engineers building models and moving them into production. Some organizations start off with a few homegrown tools that version datasets after each experiment and checkpoint models after every epoch of training. On the other hand, many organizations have chosen to

Migrate to AI-Ready infrastructure: Hitachi Content Platform to MinIO

Brenna Buuck

Brenna Buuck on AI/ML | 26 June 2024

Migrate to AI-Ready infrastructure: Hitachi Content Platform to MinIO

Migrate from Hitachi Content Platform (HCP) to MinIO using the HCP-to-MinIO tool. Migration is a no-brainer given how MinIO offers modern, scalable, high-performance storage optimized for AI.

Earn your RAG-ing rights with MinIO

Dileeshvar Radhakrishnan

Dileeshvar Radhakrishnan , AJ AJ on AI/ML | 26 June 2024

In this blog, we will demonstrate how to use MinIO to build a Retrieval Augmented Generation(RAG) based chat application using commodity hardware.