https://labs.spotify.com/2016/02/25/spotifys-event-delivery-the-road-to-the-cloud-part-i/
Tips and Tricks to build a Hadoop eco system. References to good articles on Hadoop based solutions. Topics include: Hadoop architecture, Hive, SQL on Hadoop, Compression, Metadata.
Sunday, February 28, 2016
Friday, February 26, 2016
Apache NiFi aka DataFlow
http://www.infoworld.com/article/2975833/hadoop/hortonworks-buys-better-hadoop-data-flow-management.html
Thursday, February 25, 2016
Hive Streaming
https://cwiki.apache.org/confluence/display/Hive/Streaming+Data+Ingest
This leverages Hive transaction capability but is limited to tables with ORC format. Only supports Storm and Flume.
Tuesday, February 23, 2016
Monday, February 15, 2016
Workflow Design and Execution Engines - Luigi Airflow Pinball
http://bytepawn.com/luigi-airflow-pinball.html
Thursday, February 4, 2016
Streamsets - Open Source flume interface
https://streamsets.com
azkaban open source workflow
this has become better now... check this out
https://azkaban.github.io
Subscribe to:
Posts (Atom)