Someone asked me for what problems MapReduce is not good for, I am flipping the question and answering what problems it works best for. Say, someone asked when to use recursive logic vs iterative logic, there is a bit of grey area there even though some problems clearly lend itself to recursion like graph traversal,… Continue reading
Post Category → MapR
ElasticSearch on Hadoop
Some background on how ElasticSearch indexes documents For those who already have a back ground on Elastic Search, it is just a special purpose full text search document based data store with real time indexing abilities. How does Elastic Search does this? If you are already aware of the Search engines and the underlying data… Continue reading
Scheduling recurring jobs (cron like) with oozie
Any one familiar with cron will appreciate the fact that it is possible to configure a workflow to schedule any type of jobs like a map-reduce jobs or HIVE job or sqoop jobs in Hadoop using Apache oozie. Some good examples here