Terminology confusion: Column Stores and Column oriented databases

This is my attempt to clear the air in the subjects of Column Stores and Column oriented databases (both at terminology and at understanding level). I will be talking a bit about how terrible is the idea of grouping column oriented databases as flavour of NoSQL data stores. What is a column store really ? There is no scope… Continue reading

“Exactly-once” with a Kafka-Storm Integration

Update 4, Nov 2016: When I first wrote this post it was outright mockery and contempt. But the Google Data flow paper (The Unified google framework for Batch (FlumeJava) and Stream processing (MillWheel)) and the Google MillWheel paper clearly explains that this is exactly the same approach google team has taken to solve the duplicate events problem…. Continue reading

Open Innovation Platform

This is a bit off topic for this blog, but still i felt compelled that having a solid platform for innovation is inevitable and required for any engineering discipline. so here is it – my thoughts on building a solid automated (most parts) innovation platform. Feel free to leave your comments and thoughts.