MinIO Blog (Page 5)

The most powerful S3 API ever? Introducing the Prompt API.

Dileeshvar Radhakrishnan , Jonathan Symonds Jonathan Symonds on AIStor | 13 November 2024

The most powerful S3 API ever? Introducing the Prompt API.

The object storage world to date has been defined by the S3 API concepts of PUT and GET. The world in which we live now, however, requires more. Given that MinIO has more S3 deployments than even Amazon, it fell to us to come up with the next great S3 API. That new API is the Prompt API and it

MinIO’s S3 over RDMA Initiative: Setting New Standards in Object Storage for High-Speed AI Data Infrastructure

Rakshith Venkatesh Rakshith Venkatesh on AIStor | 13 November 2024

As the demands of AI and machine learning continue to accelerate, data center networking is evolving rapidly to keep pace. For many enterprises, 400GbE and even 800GbE are becoming standard choices, driven by the need for high-speed, low-latency data transfer for AI workloads that are both data-intensive and time-sensitive. AI models for tasks like large language processing, real-time analytics, and

AIHub: A Private Cloud Repository for Models and Datasets

Keith Pijanowski Keith Pijanowski on AIStor | 13 November 2024

One of the newest features of the AIStor is a private cloud version of the highly popular, open-source project, Hugging Face. This post details how AIStor’s AIHub effectively creates an API compatible, private cloud version of Hugging Face that is fully under the enterprise's control. Before we get started, it makes sense to introduce Hugging Face. Hugging

A Redesigned Global Console for AI-Centric Workloads

AJ AJ on AIStor | 13 November 2024

The MinIO Console has been an evolving product for several years now. Every time we learn, we think about how to improve this incredibly important part of our interaction framework. First came the Console, which saw massive adoption within a year of its introduction. More than 10K organizations to be more specific.Next came the enterprise Console. That moved from

Repatriating AI Workloads: An On-Prem Answer to Soaring Cloud Costs

Brenna Buuck

Brenna Buuck on AI/ML | 8 November 2024

Repatriating AI Workloads: An On-Prem Answer to Soaring Cloud Costs

As AI workloads drive cloud costs through the roof, many companies are rethinking their approach. Moving select AI tasks back on-prem offers a path to predictable costs, improved performance, and stronger data control.

The Architect’s Guide to Interoperability in the AI Data Stack

Brenna Buuck

Brenna Buuck on AI/ML | 7 November 2024

The Architect’s Guide to Interoperability in the AI Data Stack

Interoperability is the key to building a flexible, future-ready AI data stack. As proprietary systems lock down innovation and drive up costs, open tools like S3-compatible storage and multi-format table systems offer the freedom to scale and adapt.

A Sneak Peak: The MinIO Object Storage and AI Survey

Jonathan Symonds Jonathan Symonds on Research | 7 November 2024

MinIO recently surveyed 656 IT leaders as part of a primary research initiative with User Evidence. The results were very interesting and underscore the massive sea change we are seeing in the enterprise, both around the movement to object storage and the interest in using object storage as the primary building block for an organization’s AI initiatives. We will

AI/ML workflows with AIStor and Metaflow

AJ AJ on AI/ML | 6 November 2024

AIStor and Metaflow can be deployed anywhere on any type of infrastructure. In fact more often than not we have folks deploying AIStor and metaflow alongside each other as part of the deployment pipeline.

Map-Style Datasets using Amazon’s S3 Connector for PyTorch and MinIO

Keith Pijanowski Keith Pijanowski on AI/ML | 31 October 2024

Before diving into Amazon’s S3 Connector for PyTorch, it is worthwhile to introduce the problem it is intended to solve. Many AI models need to be trained on data that cannot fit into memory. Furthermore, many really interesting models being built for computer vision and generative AI use data that cannot even fit on the disk drive that comes

Managing AI workloads with Tagging and Policies

AJ AJ on Modern Data Lakes | 30 October 2024

Object tags give you greater power. You now have the ability to categorize by up to ten dimensions. If you want to add the diagram to a project, then all you have to do is tag it appropriately.

An Easier Path to Scalable AI: Intel Tiber Developer Cloud + MinIO Object Store

Keith Pijanowski Keith Pijanowski on AI/ML | 24 October 2024

One of the biggest challenges facing organizations today for AI and data management is access to reliable infrastructure and compute resources. The Intel Tiber Developer Cloud is purpose-built for engineers who need an environment for proof-of-concepts, experimentation, model training, and service deployments. Unlike other clouds, which can be unapproachable and complex, the Intel Tiber Developer Cloud is simple and easy

Replication, Data Consolidation, and Data Migration

Keith Pijanowski Keith Pijanowski on Case Study | 24 October 2024

Parsec Labs is a company of engineers. Most have designed storage systems, been responsible for backups and replication, or worked in networking building switches. Founded in 2013, their Unified Data Mobility and Protection Appliance provides the most straightforward tools for migrating, replicating, and backing up data at scale. A Common Request As a one-time pre-sales engineer, Mark Clark, CEO of

AI Data Workflows with Kafka and MinIO

AJ AJ on Apache Kafka | 23 October 2024

AIStor is a foundational component for creating and executing complex data workflows. At the core of this event-driven functionality is MinIO bucket notifications using Kafka.

Comparing Software Defined Storage with Appliances

Jonathan Symonds Jonathan Symonds on Software Defined Storage | 21 October 2024

Everybody claims to be a software company these days. From the nearly decade old pronouncement by Marc Andressen that “Software Is Eating the World” to the push from Wall Street to produce recurring software revenue; the pressure is on to claim - at least - that you are a software company. This is obviously problematic for appliance vendors. Try as

Working with Small Objects in AI/ML workloads

AJ AJ on Operator's Guide | 16 October 2024

Cloud-native AI/ML workloads push storage to the limit with many small files. AIStor combines metadata and data to optimize small file operations.

Architect’s Guide to Migrating from Hadoop to a Data Lakehouse

Brenna Buuck

Brenna Buuck on Data Lakehouses | 15 October 2024

Architect’s Guide to Migrating from Hadoop to a Data Lakehouse

Unlock real-time analytics, scalability, and future-proof your data with a lakehouse. Hadoop can't keep up with AI, but a lakehouse blends the best of data lakes and warehouses. Get actionable tips to smoothly phase your migration with minimal disruption.

Cloud as an Operating Model - Not a Physical Location

Jonathan Symonds Jonathan Symonds on Cloud Operating Model | 10 October 2024

We have said it before, but it bears repeating. The cloud is an operating model - not a physical location. That is why you will find MinIO everywhere on the public cloud, on the private cloud, at the edge. We don’t differentiate and because we are cloud native we are cloud (location) agnostic. The public cloud has mindshare and

Tame the AI beast with Monitoring and Alerting

AJ AJ on AI/ML | 9 October 2024

In this post we’ll show you how you visualize the cluster metrics in a web browser and also we’ll set up alerting so that when something like a drive needs to be replaced or drive runs out of space we can get alerted for it.

Hiring for AI Success: Why Your First Hire Should Be a Data Engineer

Brenna Buuck

Brenna Buuck on AI/ML | 4 October 2024

Hiring for AI Success: Why Your First Hire Should Be a Data Engineer

To ensure AI success, start by hiring a data engineer, not an AI/ML expert. Learn from our experience and find out why a strong data foundation—focused on object storage, data lakehouses, and optimized pipelines—is critical for scalable, efficient AI/ML workloads.

Open Source Begets Open Source: How Tesla is Accelerating AI

Brenna Buuck

Brenna Buuck on Open Source | 2 October 2024

Open Source Begets Open Source: How Tesla is Accelerating AI

Tesla just open-sourced its Tesla Transport Protocol over Ethernet (TTPoE). Like Tesla’s earlier move to share EV patents, this opens the door to faster innovation through open-source contributions, proving once again that transparency drives progress.