Monday, April 13, 2015

HDFS permissions explained

http://hadoop.apache.org/docs/current/hadoop-project-dist/hadoop-hdfs/HdfsPermissionsGuide.html In several scenarios, you might want to change the default behavior. The key property to change in hdfs-site.xml fs.permissions.umask-mode = 022 (note there is a bug in Apache that it does not like 4 digit as advertised, so use 3 bits)

Integrating Tableau, Hive and Elastic Search

Good article on supporting a highly interactive Tableau dashboard with Hive and Elastic Search http://ryrobes.com/systems/connecting-tableau-to-elasticsearch-read-how-to-query-elasticsearch-with-hive-sql-and-hadoop/

Monday, April 6, 2015

Microsoft buys Revolution Analytics

Microsoft is getting aggressive on Machine Learning and has acquired Revolution Analytics. They plan to integrate this with HDInsight. http://blogs.technet.com/b/machinelearning/archive/2015/04/06/microsoft-closes-acquisition-of-revolution-analytics.aspx

Saturday, April 4, 2015

Ecosystem for Data Scientists

http://www.computerworld.com/article/2902920/the-data-science-ecosystem-part-2-data-wrangling.html