Nitish Tiwari

Nitish Tiwari

Nitish works as a distributed system engineer. He is focussed on building and scaling Data products on Modern Cloud Native Software Platforms like Kubernetes.
Feb
26
SQL Query on MinIO

SQL Query on MinIO

Full fledged analytical applications, AI, ML workloads, dashboards - need a high performance query engine, that understands standard SQL parlance.
3 min read
Jan
16
SQL Query on Parquet Files with DataFusion

SQL Query on Parquet Files with DataFusion

Rust big data ecosystem is all set for bigtime - with Arrow and surrounding ecosystem (DataFusion, Ballista) leading the pack.
3 min read
Dec
28
Big Data ecosystem turning to Rust: an overview

Big Data ecosystem turning to Rust: an overview

Java is synonymous with last generation of Big Data tools and technologies. But a lot has changed since 2000s. Latest
3 min read
Oct
02
Kubernetes Hardening Guide

Kubernetes Hardening Guide

Organisations all around us, large and small, are adopting or planning to move to Kubernetes. As number of Kubernetes installations
2 min read
Jul
15
Understanding Horizontal Pod Autoscaling

Understanding Horizontal Pod Autoscaling

Autoscaling is an important aspect of running applications on Kubernetes at scale. Not only does it ensure your applications smoothly
4 min read
Dec
01
The Curious Case of Small Files

The Curious Case of Small Files

BackgroundMost of the files, by the virtue of their average size and usage patterns are clearly cut out for certain
4 min read
Nov
01
Storage: Complete Overview for Developers

Storage: Complete Overview for Developers

IntroductionBeing a developer today requires a working understanding of major computer technologies, storage being one of them. Yet, storage is
6 min read
Oct
01
Get Started with Computer Science Papers

Get Started with Computer Science Papers

IntroductionOne of the best ways to learn and improve in your field is doing something on your own, either as
2 min read
Sep
01
Streaming Data Tools & Techniques

Streaming Data Tools & Techniques

IntroductionStreaming data is exactly what it sounds like, a continuously flowing stream of data generated by one or multiple sources.
6 min read
Aug
01
Deploy Spark on Kubernetes

Deploy Spark on Kubernetes

IntroductionYarn has been the default orchestration platform for tools from Hadoop ecosystem. This has started changing in recent times. Especially
6 min read