Git-like versioning for your AI Data

You’ve surely version controlled code in the past. But have you version controlled your data? Did you ever want to collaborate on large sets of data with various teams without committing a large chunk?
Read moreYou’ve surely version controlled code in the past. But have you version controlled your data? Did you ever want to collaborate on large sets of data with various teams without committing a large chunk?
Read moreEnhance your AI workflows by combining MinIO’s scalable AIStor with Polars, a lightning-fast DataFrame library. Learn how this powerful duo accelerates data pipelines, handles massive datasets, and offers powerful performance and scale.
Read moreDell generally focused on the filer game, but they dabble in object storage and have a very old offering, ECS. That makes sense, it was a step up from tape and wasn’t suited for dynamic workloads like HDFS modernization or database workloads. Needless to say, AI was out of the question. For a few years now, Dell has been
Read moreFaced with skyrocketing compute costs, MinIO data scientist Archana Vaidyanathan leveraged the power of the data lakehouse, which allows for flexible compute choices without overhauling storage. AIStor enhances this model, delivering speed, scalability, and cost savings.
Read moreSimplifying your data streaming architecture with WarpStream, a cloud-native, Kafka-compatible platform that cuts costs and complexity, recently acquired by Confluent. Paired with MinIO's high-performance object storage, it's a powerful alternative to Kafka for scalable, cost-effective streaming.
Read moreLarge numbers of small files present big challenges for application performance.
Read moreConfluent's WarpStream acquisition highlights the future of data streaming built on object storage. WarpStream’s cloud-native design cuts costs by 85% over traditional Kafka. Believe the hype: object storage drives low-cost, scalable performance.
Read morePairing the Iceberg table format with AIStor creates a powerful, flexible and extensible lakehouse platform. The Iceberg Table Spec declares a table format that is designed to manage “a large, slow-changing collection” of files or objects stored in a distributed system.
Read moreMinIO introduced its conditional write feature long before AWS S3’s recent announcement. This powerful tool offers greater control in high-concurrency environments, ensuring data consistency and reliability, especially in AI and ML workflows.
Read moreDatabrick's CEO Ali Ghodsi Decouple storage and compute for more control, lower costs, and scalability. Modern datalakes, built on high-performance object storage like MinIO, empower you to handle AI/ML workloads with flexibility and performance—without relying on proprietary platforms.
Read moreTake advantage of cloud native, Kubernetes-oriented, microservices-based architectures with object storage.
Read moreWhen you think about object storage workloads and storage types - databases are increasingly a core workload. The changes are driven by two forces: the availability of high performance object storage and the explosive growth of data and specifically its associated metadata. Because of these two forces, almost every major database vendor now includes S3 compatible endpoints. Further, for many
Read moreIn this post we’ll show you how quickly you can get a production grade of MinIO cluster up and running in just a few seconds. Not only that, but we’ll also show you how you can expand that cluster quickly in just a few seconds as well.
Read moreMicroblink is an AI company specializing in image detection. They got their start in the identity space with products like BlinkID, BlinkID Verify, and BlinkCard. Most recently, their image detection capabilities have led to products that can process other types of images. For example, product detection can be performed on receipts, whereby product descriptions on a receipt are used to
Read moreWe really like the team over at Packet Pushers. Their podcast is one of the best in the industry, and they cover technology from the top of the stack to the bottom. We recently had the opportunity to sponsor the legendary Tom Lyon for an interview with Ethan Banks and Drew Conry-Murray. The team at Packet Pushers was intrigued by
Read moreThis post first appeared on The New Stack on July 29th, 2024. Artificial Intelligence is in the middle of a perfect storm in the software industry, and now Mark Zuckerberg is calling for open-sourced AI. Three powerful perspectives are colliding on how to control AI: 1. All AI should be open-source for sharing and transparency. 2. Keep AI closed-source and
Read moreMinIO is the fastest object storage available, but how do you know that underlying infrastructure is free from bottlenecks?
Read moreOur client, a global financial institution headquarterd in Japan, recently completed an ambitious Hadoop replacement project with MinIO and Dremio. You can see them present it in this talk from Subsurface but we thought we would write it up as well. Like most banks, the firm had built out a large Hadoop footprint to power its analytics and risk management
Read moreThe rise of lakehouse functionality is reshaping data management. ParadeDB's pg_lakehouse extension lets PostgreSQL integrate with object storage, enabling scalable, secure analytics. This makes the modernization of data infrastructure possible without extensive overhauls. Welcome to the future!
Read moreWe often talk about how good, fast and reliable access to data is paramount if you want to have an upper hand in your AI/ML game. Why is this the case? This is because hardware failures happen at different levels.
Read more