Introduction
Generative AI represents the latest technique an enterprise can employ to unlock the data trapped within its boundaries. The easiest way to conceptualize what is possible with Generative AI is to imagine a customized Large Language Model - similar to the one powering ChatGPT - running inside your firewall. Now, this custom LLM is not the same as the
Read more
Unlock the secrets of modern datalakes migration to the private clouds. Embrace S3 compatibility, data control, and the ever-evolving landscape for cost-effective data management. Don't miss the journey to enhanced flexibility, efficiency, and the future-proofing of your data ecosystem
Read more
Today we’ll talk how we use our local lab to test some of the key features and functionality to not only show you but also hopefully inspire you to elevate the technology and processes in your lab too that can make debugging any application a piece of cake.
Read more
A lot of ink has been spilled on the significance of the AI/ML technology wave (here are our posts). What doesn’t get attention, but probably should, is how AI/ML is remaking the technology power structure inside the enterprise. As companies reorganize around a data-centric orientation, they are also reorganizing who makes and executes the technology architecture. While
Read more
This is your symphony for data excellence. Explore the components of this modern data stack, including storage, data integration, transformation, data observability, data discovery, data visualization, data analytics, and machine learning.
Read more
Unlock the true potential of your cloud migration journey! Learn how embracing the cloud as an operating model, rather than a location, can revolutionize your technology approach. Find out why portability, the right tools, and open standards are your keys to success.
Read more
Build a streaming Change Data Capture (CDC) pipeline with Redpanda and MinIO into Snowflake. This solution simplifies data migration and analytics, with Redpanda offering scalability, MinIO as efficient storage, and Snowflake as a cloud-native analytics engine.
Read more
Confluent, Intel and MinIO conducted benchmarking and certification testing for MinIO Tiered Object Storage for Kafka storage. This blog post describes the observations and results of testing MinIO object storage as a backend for the tiered storage feature of Confluent Platform 7.1.0 on servers equipped with third generation Intel Xeon Scalable processors. The scope of these tests was
Read more
Hugging Face's DatasetDict class is a part of the Datasets library and is designed to make working with datasets destined for any model found on the Hugging Face Hub efficient. As the name implies, the DatasetDict class is a dictionary of datasets. The best way to understand objects created from this class is to look at a quick
Read more
To perform miscellaneous tasks, instead of modifying the main application or the container it's running in, you can run it in a separate container next to the main application as a sidecar.
Read more
MinIO provides a flexible Identity and Access Management system that can be integrated with popular external identity providers. MinIO IAM is built with AWS IAM compatibility at its core - access is controlled by policies mirroring AWS' IAM policies. While AWS supports myriad ways to control access, including ACLs, Bucket Policies, etc, in the interest of simplicity, MinIO'
Read more
Enterprises rely on data to make decisions. Effective decision-making hinges on the accuracy, timeliness, availability, and security of data. Data consistency, an important factor that cannot be ignored when purchasing storage, involves ensuring that all relevant parties can immediately access the results of a database transaction once it has been finalized, either through commitment or rollback. This guarantees that everyone
Read more
Some of the smartest minds in philanthropy are backing the concept of a simple yet powerful national ID system. The Bill and Melinda Gates Foundation, the Tata Trusts, the Omidyar Network and the Pratiksha Trust have all gotten involved with this movement because of its foundational capabilities for enabling a wide range of social programmes. They have put their resources
Read more
What is streaming mode? Essentially it allows you to backup with Percona xtraBackup without touching disk. When used alongside MinIO Jumbo, it is designed to upload and retrieve large objects from the MinIO cluster.
Read more
Enterprise customers use MinIO to build data lakehouses to store a wide variety of structured and unstructured data, and work with it using ML and analytics. Data flows into MinIO from across the enterprise and the S3 API allows applications, such as analytics and AI/ML to work with it.
I previously blogged about building data pipelines with SAP Data
Read more
We’ll show you how you can use your MinIO cluster (on-prem, in the cloud, or on Kubernetes) to be used as a Helm Chart repository.
Read more
MinIO’s training team recently completed a multi-part series on MinIO event monitoring, in which Will Dinyes brings us through five in-depth videos. By the end of the video series you should be able to register an event destination, subscribe to an event notification, and deploy a custom Python-based webhook.
Will kicks off the MinIO event monitoring series with an
Read more
With MinIO, enterprises are not forced to make a choice. They can literally use FTP and SFTP to move that data into an S3-like data store. It is the principle of AND not OR.
Read more
Explore the next generation of data streaming with WarpStream and MinIO! While Apache Kafka has been the standard for streaming data, it may be time to consider a simpler, more cost-effective, and cloud-native solution.
Read more