Big Data

Big Data

Data is at the core of modernisation efforts going on today. Data is growing at a rapid pace and modern technologies are evolving to address this growth. Follow along to read more on Cloud Native Big Data developments.
Feb
26
SQL Query on MinIO

SQL Query on MinIO

Full fledged analytical applications, AI, ML workloads, dashboards - need a high performance query engine, that understands standard SQL parlance.
3 min read
Jan
16
SQL Query on Parquet Files with DataFusion

SQL Query on Parquet Files with DataFusion

Rust big data ecosystem is all set for bigtime - with Arrow and surrounding ecosystem (DataFusion, Ballista) leading the pack.
3 min read
Dec
28
Big Data ecosystem turning to Rust: an overview

Big Data ecosystem turning to Rust: an overview

Java is synonymous with last generation of Big Data tools and technologies. But a lot has changed since 2000s. Latest
3 min read
Dec
01
The Curious Case of Small Files

The Curious Case of Small Files

Background Most of the files, by the virtue of their average size and usage patterns are clearly cut out for
4 min read
Sep
01
Streaming Data Tools & Techniques

Streaming Data Tools & Techniques

Introduction Streaming data is exactly what it sounds like, a continuously flowing stream of data generated by one or multiple
6 min read
Aug
01
Deploy Spark on Kubernetes

Deploy Spark on Kubernetes

Introduction Yarn has been the default orchestration platform for tools from Hadoop ecosystem. This has started changing in recent times.
6 min read
Jul
01
Modern Data Lakes Overview

Modern Data Lakes Overview

Background As Data volumes grow to new, unprecedented levels, new tools and techniques are coming into picture to handle this
6 min read
May
01
Persist Kafka Messages to MinIO

Persist Kafka Messages to MinIO

Learn how to use Kafka and MinIO to ingest huge data volumes and store it in persistent manner to ensure data is available for later analysis and consumption.
3 min read