AI/ML - MinIO Blog

OpenAI Open Models: A Gamechanger for Enterprise AI

Garima Kapoor Garima Kapoor , Anand Babu Periasamy Anand Babu Periasamy on AI/ML | 7 August 2025

OpenAI’s move this week to release two new open-weight AI models (gpt-oss-120b and gpt-oss-20b) just changed Enterprise Data Infrastructure forever. The news has rightfully made ripples across the tech ecosystem. Why? * Because these models are released under the Apache 2.0 license, users can for the first time in 5 years, run OpenAI models right on their own devices.

Turbocharged Storage: MinIO, KIOXIA, and AMD team up to take on AI

Michael Williams Michael Williams on Partners | 16 July 2025

MinIO, the leader in high-performance AI storage, has once again raised the bar in the AI infrastructure industry with its groundbreaking MinIO AIStor platform. Leveraging next-generation AMD hardware, KIOXIA NVMe™ SSDs, and cutting-edge software optimizations, MinIO AIStor delivers unmatched performance, scalability, and efficiency for AI-driven and other data intensive workloads. Today, we are excited to share benchmark results that demonstrate

Model Context Protocol (MCP) Server for AIStor: How it works

Pavel Anni Pavel Anni on AI Agents | 30 April 2025

In the previous blog posts of this series, we discussed the user-level and admin-level functions of the Model Context Protocol (MCP) server for MinIO AIStor. In the first blog, we learned how to review the bucket’s contents, analyze objects, and tag them for future processing. In the second blog, we also learned how to use admin commands and get

Model Context Protocol (MCP) Server for AIStor: administration functions

Pavel Anni Pavel Anni on AI/ML | 9 April 2025

In the previous blog of this series, we discussed the basic user-level functions of the Model Context Protocol (MCP) server for MinIO AIStor. We learned how to review a bucket’s contents, analyze objects, and tag them for future processing using human-language commands and simply chatting with the cluster via an LLM such as Anthropic Claude. In this blog, we’

Introducing Model Context Protocol (MCP) Server for MinIO AIStor

Pavel Anni Pavel Anni on AI/ML | 28 March 2025

GenAI is entering the agentic phase, with software agents collaborating with humans and other agents to reason and achieve complex goals. Agents are already demonstrating incredible intelligence and are very helpful with question answering, but as with humans, they need the ability to discover and access software applications and other services to actually perform useful work. The creators of such

Deepseek-style Reinforcement Learning Against Object Store

Sidharth Rajaram

Sidharth Rajaram @sidharrrrrth on AI/ML | 20 March 2025

Deepseek-style Reinforcement Learning Against Object Store

Tl;dr: We train a small LLM to become good at reasoning with reinforcement learning (similar to the process that led to Deepseek R1) all against AIStor AIHub, an on-premises model repository. Based on the great GRPO demo by will brown. Motivation: A growing requirement for teams is the need for an organized, secure, "single source of truth"

AIStor Integration with NVIDIA NIM™

Dileeshvar Radhakrishnan

Dileeshvar Radhakrishnan on NVIDIA | 17 March 2025

Building upon AIStor's robust AI capabilities, MinIO's PromptObject has been enabling users to interact with their data through natural language queries as described here. PromptObject transforms how users interact with stored objects by allowing them to ask questions about their data's content and extract information using natural language—eliminating the need to write complex

Enterprise AI Infrastructure Made Easy with AIStor and NVIDIA GPUs

Dileeshvar Radhakrishnan

Dileeshvar Radhakrishnan on NVIDIA | 17 March 2025

Enterprise AI Infrastructure Made Easy with AIStor and NVIDIA GPUs

Modern enterprises seeking to leverage AI capabilities often face a significant hurdle: the complex deployment and management of GPU infrastructure in their Kubernetes environments. MinIO's AIStor addresses this challenge head-on by integrating the NVIDIA GPU Operator, revolutionizing how organizations deploy and manage GPU resources for AI workloads. Through automated GPU setup, driver management, and resource optimization, this integration

MLflow Model Registry and MinIO

Keith Pijanowski Keith Pijanowski on AI/ML | 14 March 2025

MLflow Model Registry allows you to manage models that are destined for a production environment. This post picks up where my last post on MLflow Tracking left off. In my Tracking post I showed how to log parameters, metrics, artifacts, and models. If you have not read it, then give it a read when you get a chance. In this

Deploying Models to Kubernetes with AIStor, MLflow and KServe

Keith Pijanowski Keith Pijanowski on AI/ML | 28 February 2025

In several previous posts on MLOps tooling, I showed how many popular MLOps tools track metrics associated with model training experiments. I also showed how they use MinIO to store the unstructured data that is a part of the model training pipeline. However, a good MLOps tool should do more than manage your experiments, datasets, and models. It should be

Unlocking AI/ML Performance with AMD + MinIO

Michael Williams Michael Williams on AI/ML | 21 February 2025

In the rapidly-evolving world of artificial intelligence (AI) and machine learning (ML), speed and scalability are paramount. The ability to process massive amounts of data in real-time is a critical requirement for organizations looking to leverage AI/ML for competitive advantage. Whether it's training large machine learning models, running complex inference tasks, or scaling data pipelines, the performance

The Architect’s Guide to Understanding Agentic AI

Keith Pijanowski Keith Pijanowski on AI/ML | 3 February 2025

This post first appeared on The New Stack on January 16th, 2025. Often, while accessing the legitimacy of a new technology receiving a lot of hype, studying existing core capabilities and history is helpful. If the new technology in question is not based on existing or imminent capabilities, we can label it as “hype” and move on. Another litmus test

Mitigating Geopolitical Concerns with a Sovereign Private Cloud

Jelte Eshuis Jelte Eshuis , Keith Pijanowski Keith Pijanowski on Private Cloud | 31 January 2025

2025 has inherited a slew of geopolitical concerns that started years ago. U.S. Foreign policy, U.S. - China Relations, China’s geopolitical maneuvers, Conflicts in the Middle East, Russian Ukraine war, and cybersecurity threats. Additionally, new leadership in the United States adds to the uncertainty created by these concerns. And, as if all this were not enough, the

Are We All DataOps Engineers Now? If So, How Can We Become Great at It?

Brenna Buuck

Brenna Buuck on AI/ML | 24 January 2025

Are We All DataOps Engineers Now? If So, How Can We Become Great at It?

The evolution of data roles never stops—first, we were all data scientists, then data engineers, and now, DataOps engineers. But is DataOps really new, or just a fresh take on the same mission: delivering business value through data?

Model Checkpointing using Amazon’s S3 Connector for PyTorch and MinIO

Keith Pijanowski Keith Pijanowski on AI/ML | 17 January 2025

In November of 2023, Amazon announced the S3 Connector for PyTorch. The Amazon S3 Connector for PyTorch provides implementations of PyTorch's dataset primitives (Datasets and DataLoaders) that are purpose-built for S3 object storage. It supports map-style datasets for random data access patterns and iterable-style datasets for streaming sequential data access patterns. The S3 Connector for PyTorch also includes

The Innovations from AWS re:Invent

Keith Pijanowski Keith Pijanowski on AI/ML | 31 December 2024

Earlier this month, Amazon held their re:Invent conference in Las Vegas, Nevada, from December 1st to 5th - a 5-day event. If you have never been to a re:Invent conference, then the word that describes it best is “huge” - not just in terms of the number of attendees (60,000) but also the breadth of topics covered.

Iterable-Style Datasets using Amazon’s S3 Connector for PyTorch and MinIO

Keith Pijanowski Keith Pijanowski on AI/ML | 23 December 2024

In November of 2023 Amazon announced the S3 Connector for PyTorch. The Amazon S3 Connector for PyTorch provides implementations of PyTorch's dataset primitives (Datasets and DataLoaders) that are purpose-built for S3 object storage. It supports map-style datasets for random data access patterns and iterable-style datasets for streaming sequential data access patterns. In a previous post, I introduced the

Exness: Managing petabytes of trading data with MinIO

Brenna Buuck

Brenna Buuck on AI/ML | 6 December 2024

Exness: Managing petabytes of trading data with MinIO

How does Exness handle massive data volumes and demanding AI/ML workloads? By moving to an on-prem infrastructure powered by MinIO. From scaling their data lake to managing traffic peaks of 200 Gbps, MinIO supports their AI workflows, disaster recovery, and more.

AI/ML’s Sous-Chef: Why your Second Hire should be a DevOps Engineer

AJ AJ on AI/ML | 4 December 2024

Your DevOps Engineer’s customer should be your AI/ML Engineering Team. The DevOps Engineer is there to ease the friction points in infrastructure so AI/ML folks can focus on the task at hand. Any issues that come with the infrastructure should be the responsibility of the DevOps Engineer.

GPU Trends and What It Means to Your AI Infrastructure

Keith Pijanowski Keith Pijanowski on AI/ML | 27 November 2024

Almost a year ago (actually 11 months ago), I wrote about the “Starving GPU Problem” and how the horsepower of Nvidia’s Graphic Processing Units (GPUs) could be so powerful that your network and your storage solution may not be able to keep up - preventing your expensive GPUs from being fully utilized. Well, in those short 11 months, a