Making the Most of Streaming with Kafka Schema Registry and MinIO

Make you Kafka topics performant and efficient with Kafka Schema Registry.
Read moreA collection of 68 posts tagged with "Modern Data Lakes"
Make you Kafka topics performant and efficient with Kafka Schema Registry.
Read moreThe Supermicro GrandTwin™ SuperServer is a solid, well-designed NVMe class hardware that we recommend for MinIO workloads.
Read moreBuild your on-prem data lake with Apache Iceberg, Dremio and MinIO
Read moreLearn how to get started with Dremio and MinIO on Kubernetes for fast, scalable analytics.
Read moreWe live in a cloud-native world where edge architecture must be consistent with cloud architecture, and where data can be retrieved with the same API call regardless of where it lives.
Read moreYou must have heard of different data formats like Parquet, ORC, Avro, Arrow, Protobuf, Thrift and MessagePack. What are they and how to choose the right one?
Read moreLearn how to use DataProfiler, an OSS project, to identify sensitive information, & you can then use MinIO object storage to protect data.
Read moreDo you need to find a way to replace Hadoop in your data lake and add cloud-native capabilities?
Read moreLearn how to build a multi-cloud data lake with the Delta open storage format and MinIO object storage.
Read moreThis post focuses on how Iceberg and MinIO complement each other and how various analytic frameworks (Spark, Flink, Trino, Dremio, and Snowflake) can leverage the two.
Read moreEnabling and tuning transparent data compression in MinIO.
Read moreLearn how to build a cloud-native analytics and visualization stack backed by MinIO.
Read moreWorking with a data lake of ZIP files? Learn how to download individual files from ZIP archives saved on MinIO.
Read moreStellar performance-at-scale, flexibility and consistency make object storage the best choice for cloud-native enterprises.
Read moreMigrate data from HDFS to MinIO and enjoy the benefits of cloud-native architecture.
Read moreWith today’s announcement of Starburst’s support for MinIO, it made sense to revisit the architectural trends that are becoming the standard for analytics workloads. Starburst provides a perfect example as we shall see shortly. The architecture follows the model of disaggregating storage and compute. Modern, high speed networks have obsoleted the old approaches espoused by the many defunct
Read moreHow to pair fast and efficient search with high-performance Kubernetes-native object storage.
Read moreWhen you think about the cloud, it helps to think about the types of businesses that have been built with elastic compute, networking and storage as a foundational component and self-service/multi-tenancy as the vehicle for customer engagement. For the most part, those businesses succeeded at scaling by focusing their efforts on building their product, almost exclusively on a single
Read moreIntroduction Document management is a core requirement for all sorts of regulated institutions - finance, telecom, healthcare, government and others. These institutions need to manage and retain an ever growing number of documents and regulatory guidelines often require these documents to be stored for a very long term (7-10 years). Take for example, KYC (Know Your Customer) documents. Anyone starting
Read moreWith the introduction of Apache Arrow, language-independent columnar memory format for flat and hierarchical data, organized for efficient analytic operations, MinIO data lakes can be much more powerful. This article explains how to make use of Apache Arrow by using ArrowRDD.
Read more