MinIO Blog

Apache Arrow and the Future of Data: Open Standards Propel AI

on Apache Arrow 24 April 2024

Apache Arrow and the Future of Data: Open Standards Propel AI

Apache Arrow is an open-source columnar memory format that is vital for modern datalakes. This is because Arrow makes data processing swift and seamless across various systems. Arrow propels AI and analytics by enhancing interoperability and computational efficiency.

Improve RAG Performance with Open-Parse Intelligent Chunking

Keith Pijanowski Keith Pijanowski on AI/ML 24 April 2024

If you are implementing a generative AI solution using Large Language Models (LLMs), you should consider a strategy that uses Retrieval-Augmented Generation (RAG) to build contextually aware prompts for your LLM. An important process that occurs in the preproduction pipeline of a RAG-enabled LLM is the chucking of document text so that only the most relevant sections of a document

Navigating the Waters: Building Production-Grade RAG Applications with Data Lakes

Sam Cooper Sam Cooper on AI/ML 11 April 2024

In mid-2024, creating an AI demo that impresses and excites can be easy. Take a strong developer, some clever prompt experimentation, and a few API calls to a powerful foundation model and you can often build a bespoke AI bot in an afternoon. Add in a library like langchain or llamaindex to augment your LLM with a bit of custom

Control Cloud Data Costs with MinIO on Equinix

Michael Williams Michael Williams on Cloud Operating Model 11 April 2024

The public cloud changed the way companies build, deploy and manage their applications - mostly for the better. As you’re getting started, the public cloud supplies the infrastructure, services, enablement and maintenance to be up and running quickly. It provides ultimate scalability, in almost unlimited fashion, up and down, to provide you with the necessary resources no matter the

The Bank of the North - A Quick Case Study for HDFS Modernization

Jonathan Symonds Jonathan Symonds on Case Study 9 April 2024

Stories matter and customer stories are the best. The ones where they delivered jaw-dropping stats or overcame massive obstacles are the ones that garner the best headlines. They are also the ones that are the hardest to get published. We know, because we are going to share a few with you that we are tirelessly working to get published -

Building Next-Gen Data Solutions: SingleStore, MinIO, and the Modern Datalake Stack

Brenna Buuck Brenna Buuck

on Modern Data Lakes 9 April 2024

Building Next-Gen Data Solutions: SingleStore, MinIO, and the Modern Datalake Stack

Explore the integration of SingleStore, a high-performance cloud-native database, with MinIO in the Modern Datalake Stack. This tutorial provides hands-on experience in data storage, processing, and querying, fostering experimentation and innovation in data management, analytics, and AI workloads.

Building and Deploying a MinIO-Powered LangChain Agent API with LangServe

David Cannan David Cannan on AI/ML 9 April 2024

Explore the exciting possibilities of leveraging MinIO and LangChain to create a robust and efficient agent capable of handling complex data processing tasks.

The Architect’s Guide: A Modern Datalake Reference Architecture

Keith Pijanowski Keith Pijanowski on Modern Data Lakes 5 April 2024

An abbreviated version of this post appeared on The New Stack on March 26th, 2024. Businesses aiming to maximize their data assets are adopting scalable, flexible, and unified data storage and analytics approaches. This trend is driven by enterprise architects tasked with crafting infrastructures that align with evolving business demands. A Modern Datalake architecture addresses this need by integrating the

Towards Exascale AI Data Infrastructure

Rakshith Venkatesh Rakshith Venkatesh on Cloud Native 2 April 2024

It's been just over a week for me here at MinIO. The big takeaway from immersing myself in whiteboarding sessions, architecture reviews and customer calls is that the simplicity of the product is both its distinguishing feature and one of its most defining value drivers. This is particularly true at scale. The explosive growth in computing power due

The Full Stack AI Engineer: A Modern-Day Polymath

Keith Pijanowski Keith Pijanowski on AI/ML 2 April 2024

Anyone who has worked in a team environment knows that every successful team has one go-to person—that special individual who can help you regardless of the nature of your problem. On a traditional software development team, this individual is an expert programmer and is also an expert in one other technology, which could be a database technology like Snowflake

MinIO Networking with Overlay Networks

David Cannan David Cannan on DevOps 29 March 2024

Overlay networks enable seamless multi-host deployments for MinIO’s cloud-native S3-compatible storage solutions. Emphasizing security, scalability, and robust container networking, these technologies streamline complex cloud architectures.

Une dépêche de Kubecon Paris

Jonathan Symonds Jonathan Symonds on Kubernetes 27 March 2024

Time for the annual KubeconEU review - it is unfiltered and occasionally unwelcome by the CNCF - but spoiler alert, Paris was a smashing success. We always love the people, we don’t always love the venue or show management, but Paris was a win and more importantly, Kubernetes is on a winning streak. First off, it just felt big.

Architect’s Guide to a Reference Architecture for an AI/ML Datalake

Keith Pijanowski Keith Pijanowski on Architect's Guide 26 March 2024

An abbreviated version of this post appeared on The New Stack on March 19th, 2024. In enterprise artificial intelligence, there are two main types of models: discriminative and generative. Discriminative models are used to classify or predict data, while generative models are used to create new data. Even though Generative AI has dominated the news of late, organizations are still

Disaster Proof MinIO with GitOps

David Cannan David Cannan on DevOps 19 March 2024

When disaster strikes, the power of GitOps shines, transforming potential chaos into a choreographed comeback. Learn how strategic automation, redundancy, and Docker and GitHub integration ensure swift recovery, turning system wipes into minor setbacks.

Unbundling the Data Stack: the Disaggregation of Storage and Compute 2.0

Brenna Buuck Brenna Buuck

on Modern Data Lakes 15 March 2024

Unbundling the Data Stack: the Disaggregation of Storage and Compute 2.0

Discover the latest trend in databases: Disaggregation 2.0. Tomasz Tunguz's insightful post on LinkedIn explores how databases are evolving into high-speed query engines, shedding traditional storage constraints. Embrace flexible, performance-driven architectures.

Modern Datalakes with Hudi, MinIO, and HMS

Brenna Buuck Brenna Buuck

on Open Source 14 March 2024

Unlock the power of modern datalakes with Hudi, MinIO, and HMS. Seamlessly integrate these technologies for enhanced data governance. Set up your own cloud-native datalake and explore it with Spark.

Powering AI/ML Innovation: Building Feature Stores with MinIO’s High-Performance Object Storage

David Cannan David Cannan on AI/ML 12 March 2024

MinIO’s high-performance object storage is key for AI innovation, offering scalability and integration for feature stores. Its capabilities enable seamless ML workflows, enhancing data management for AI development and deployment, impacting sectors like e-commerce and healthcare.

The Enterprise Object Storage Feature Set

Jonathan Symonds Jonathan Symonds on Enterprise Object Store 12 March 2024

With so much product goodness coming out at once today we thought it would make sense to craft a quick summary post of all of the changes we have made and all of the features we have introduced. Let’s start from the top. MinIO now has two product binaries, the MinIO Enterprise Object Store and the MinIO Object Store.

A Single Pane of Glass - The Enterprise Global Console

Jonathan Symonds Jonathan Symonds Daniel Valdivia Daniel Valdivia @dvaldivia

on Enterprise Object Store 12 March 2024

A Single Pane of Glass - The Enterprise Global Console

The world changed for MinIO when we introduced the Console to our customers and community nearly three years ago. It was a massive leap forward in accessibility. The trusty CLI and MC commands quickly gave way to the speed and intuitive usability of our new browser-based GUI. It was a game changer for developers and enterprise IT admins. With just

MinIO Enterprise Cache: A Distributed DRAM Cache for Ultra-Performance

Keith Pijanowski Keith Pijanowski on Enterprise Object Store 12 March 2024

As the computing world has evolved and the price of DRAM has plummeted, we find that server configurations often come with 500GB or more of DRAM. When you are dealing with larger deployments, even those with ultra-dense NVMe drives, the number of servers multiplied by the DRAM on those servers can quickly add up – often to several TBs. That DRAM

Get a Quote

Select Plan

Choose Capacity