Tips and Tricks to build a Hadoop eco system. References to good articles on Hadoop based solutions. Topics include: Hadoop architecture, Hive, SQL on Hadoop, Compression, Metadata.
Thursday, February 5, 2015
BigData startups in India
http://yourstory.com/2015/02/indian-big-data-companies-startups/
Friday, January 23, 2015
Wednesday, January 21, 2015
Hadoop and Security
This is a very old topic and no real good solutions; Hortonworks has published this article about Ranger + Dataguise
http://hortonworks.com/blog/hadoop-security-different-paradigm/?mkt_tok=3RkMMJWWfF9wsRovuq%2FOZKXonjHpfsX66%2B8uWaW%2BlMI%2F0ER3fOvrPUfGjI4JSsJhI%2BSLDwEYGJlv6SgFT7TMMbFh1rgNUxc%3D
Thursday, December 18, 2014
Hive 14 Transactions, Inserts, Updates, Deletes explained
Great slides explaining Hive 14 transactional support capabilities.
Monday, December 15, 2014
HDInsight Essentials 2nd version is coming soon
This is 2nd edition of my book HDInsight Essentials. This one is more in-depth and go through a journey of building an enterprise data lake. It is up to date with Hadoop 2.X and HDInsight 3.1.
I also take a real life project and walk through the ingestion, organization, transformation and reporting phases.
https://www.packtpub.com/big-data-and-business-intelligence/hdinsight-essentials-second-edition
I also take a real life project and walk through the ingestion, organization, transformation and reporting phases.
https://www.packtpub.com/big-data-and-business-intelligence/hdinsight-essentials-second-edition

Monday, December 8, 2014
Hive 14 released with useful features for RDBMS offload use cases
Great features in Hive 14 that make it really close to an RDBMS solution based on Hadoop:
http://hortonworks.com/blog/announcing-apache-hive-0-14/
Key features:
Key features:
- Transactions with ACID semantics
- Cost Based Optimizer
- SQL Temporary Tables
Design Docs of Hive if you are interested to get to the details:
Saturday, November 1, 2014
Subscribe to:
Posts (Atom)