In this post we look at how search, and specifically OpenSearch can help us identify patterns or see trends in our ever growing data.
Read more
This post first appeared on The New Stack on January 16th, 2025.
Often, while accessing the legitimacy of a new technology receiving a lot of hype, studying existing core capabilities and history is helpful. If the new technology in question is not based on existing or imminent capabilities, we can label it as “hype” and move on.
Another litmus test
Read more
2025 has inherited a slew of geopolitical concerns that started years ago. U.S. Foreign policy, U.S. - China Relations, China’s geopolitical maneuvers, Conflicts in the Middle East, Russian Ukraine war, and cybersecurity threats. Additionally, new leadership in the United States adds to the uncertainty created by these concerns. And, as if all this were not enough, the
Read more
dbt’s acquisition of SDF Labs reinforces a powerful trend: the modern data stack is open. Learn why this matters for performance, interoperability, and future-proofing your data strategy.
Read more
Running AIStor on OpenShift enables enterprises to achieve cloud-native elasticity on their hardware or cloud instance of choice, balance cost, capacity and performance.
Read more
The evolution of data roles never stops—first, we were all data scientists, then data engineers, and now, DataOps engineers. But is DataOps really new, or just a fresh take on the same mission: delivering business value through data?
Read more
Discover the power of Apache Iceberg and AIStor in transforming data lakehouses! From multi-engine compatibility to time travel, schema evolution, and blazing-fast performance, this guide dives deep into how Iceberg unlocks the full potential of modern AI and analytics workloads.
Read more
In November of 2023, Amazon announced the S3 Connector for PyTorch. The Amazon S3 Connector for PyTorch provides implementations of PyTorch's dataset primitives (Datasets and DataLoaders) that are purpose-built for S3 object storage. It supports map-style datasets for random data access patterns and iterable-style datasets for streaming sequential data access patterns.
The S3 Connector for PyTorch also includes
Read more
Lately, there has been a trend in the industry to bring data “closer” to home. The result is that organizations now want to keep their data on servers that they own, in their own datacenter or at a colocation provider.
Read more
Earlier this month, Amazon held their re:Invent conference in Las Vegas, Nevada, from December 1st to 5th - a 5-day event. If you have never been to a re:Invent conference, then the word that describes it best is “huge” - not just in terms of the number of attendees (60,000) but also the breadth of topics covered.
Read more
2024 was a HUGE year for MinIO. Not only did we release AIStor, the most powerful version of MinIO to date, but we also attended 54 events, wrote 159 blogs, won over 10 awards and so much more. And all of it wouldn’t be possible without the support of our amazing MinIO community.
So, as a shoutout to you
Read more
In November of 2023 Amazon announced the S3 Connector for PyTorch. The Amazon S3 Connector for PyTorch provides implementations of PyTorch's dataset primitives (Datasets and DataLoaders) that are purpose-built for S3 object storage. It supports map-style datasets for random data access patterns and iterable-style datasets for streaming sequential data access patterns.
In a previous post, I introduced the
Read more
AWS recently unveiled Amazon S3 Tables, claiming to optimize Iceberg analytics on S3. Yet, these "special buckets" mainly fix AWS's own limits—like request caps—not universal object storage issues. With AIStor, you get unmatched performance, no vendor lock-in, and no extra costs for table maintance.
Read more
Deep dive into AIStor mindset on how we do and recommend updates and restarts with AIStor.
Read more
AIStor's Prompt API transforms healthcare data—analyze MRI scans, uncover trends in medical records, and accelerate research with natural language prompts. From automating image analysis to streamlining patient care, it empowers better outcomes for providers, researchers, and patients
Read more
How does Exness handle massive data volumes and demanding AI/ML workloads? By moving to an on-prem infrastructure powered by MinIO. From scaling their data lake to managing traffic peaks of 200 Gbps, MinIO supports their AI workflows, disaster recovery, and more.
Read more
Your DevOps Engineer’s customer should be your AI/ML Engineering Team. The DevOps Engineer is there to ease the friction points in infrastructure so AI/ML folks can focus on the task at hand. Any issues that come with the infrastructure should be the responsibility of the DevOps Engineer.
Read more
Almost a year ago (actually 11 months ago), I wrote about the “Starving GPU Problem” and how the horsepower of Nvidia’s Graphic Processing Units (GPUs) could be so powerful that your network and your storage solution may not be able to keep up - preventing your expensive GPUs from being fully utilized. Well, in those short 11 months, a
Read more