Pairing the Iceberg table format with AIStor creates a powerful, flexible and extensible lakehouse platform. The Iceberg Table Spec declares a table format that is designed to manage “a large, slow-changing collection” of files or objects stored in a distributed system.
Read more
Discover how to seamlessly migrate from HDFS to modern object storage without ripping out all of your current systems. Learn valuable strategies to retain essential tools and modernize your infrastructure for AI/ML.
Read more
Introduction
In 1997, Clayton Christensen, in his book The Innovator’s Dilemma, identified a pattern of innovation that tracked the capabilities, cost, and adoption by market segment between an incumbent and a new entrant. He labeled this pattern “Disruptive Innovation.” Not every successful product is disruptive - even if it causes well-established businesses to lose market share or even fail
Read more
Kafka and Spark Structured Streaming are used together to build data lakes/lake houses fed by streaming data and provide real time business insights.
Read more
In this blog post, we will build a Notebook that uses MinIO as object storage for Spark jobs to manage Iceberg tables.
Read more
Apache Spark and MinIO are powerful tools for data lakes and analytics. Learn how to run them in Kubernetes.
Read more
Learn how to build a multi-cloud data lake with the Delta open storage format and MinIO object storage.
Read more
Migrate data from HDFS to MinIO and enjoy the benefits of cloud-native architecture.
Read more
This is a guest blog from our friends at Guardant Health
[http://www.guardanthealth.com/].
Guardant Health is the world leader in comprehensive liquid biopsy. Oncologists
order our blood test to help determine if their advanced cancer patients are
eligible for certain drugs that target specific genomic alterations in tumour
DNA. Each test produces huge amounts of genomic data that
Read more