Thursday, February 26, 2015

Oracle Data Integrator

I have been researching on Oracle Data Integrator and it's strategy on Big Data integration. I'll add further slides, this one below maybe a good start.

Saturday, February 21, 2015

Azure HDInsight now runs on Linux

This is great news from Microsoft, HDInsight can now run on Linux servers. This allows easier migration of current data center driven Hadoop implementations to Microsoft hosted cloud solution. Below are slides from Microsoft's presentation at Strata http://cdn.oreillystatic.com/en/assets/1/event/118/Running%20Hadoop-as-a-Service%20in%20the%20Cloud%20Presentation.pptx

What's coming in Spark 2015

Architectural Considerations for Hadoop

Really great read for Architects trying to understand what components to use while building a Hadoop solution http://cdn.oreillystatic.com/en/assets/1/event/118/Architectural%20Considerations%20for%20Hadoop%20Applications%20Presentation.pdf Check out this link for other conference slides and videos http://strataconf.com/big-data-conference-ca-2015/public/schedule/proceedings

Thursday, February 19, 2015

HDInsight now supports Spark

http://azure.microsoft.com/en-us/documentation/articles/hdinsight-hadoop-spark-install/

Tuesday, February 17, 2015

Hive SQL cheat sheet

Check this site on simple comparison between SQL and HiveQL http://hortonworks.com/blog/hive-cheat-sheet-for-sql-users/

Tuesday, February 10, 2015

Hive on Spark

Spark is becoming the next generation MapReduce framework for Hadoop. Hive is now able to run on Tez and Spark as well. See the slides below that detail the Hive on Spark plan

Thursday, February 5, 2015

BigData startups in India

http://yourstory.com/2015/02/indian-big-data-companies-startups/