This is a good article on Hive Server2 architecture
Tips and Tricks to build a Hadoop eco system. References to good articles on Hadoop based solutions. Topics include: Hadoop architecture, Hive, SQL on Hadoop, Compression, Metadata.
Tuesday, September 30, 2014
Thursday, July 3, 2014
How to kill a MapReduce job
Very common need, kill a long running job. following is the syntax
$ hadoop job -kill <job-id>
Usage: hadoop job [GENERIC_OPTIONS] [-submit <job-file>] | [-status <job-id>] | [-counter <job-id> <group-name> <counter-name>] | [-kill <job-id>] | [-events <job-id> <from-event-#> <#-of-events>] | [-history [all] <jobOutputDir>] | [-list [all]] | [-kill-task <task-id>] | [-fail-task <task-id>] | [-set-priority <job-id> <priority>]
Yarn Capacity Schedular
Good article explaining how to use capacity scheduler
https://support.gopivotal.com/hc/en-us/articles/201623853-How-to-configure-queues-using-YARN-capacity-scheduler-xml-
Sunday, June 29, 2014
Friday, June 27, 2014
Hadoop Summit 2014 Presentations online
Links to keynote and presentations of Hadoop Summit 2014. YARN was the focus of this year and MapReduce will soon become legacy:
- http://hadoopsummit.org/san-jose/keynote-day1/
- http://hadoopsummit.org/san-jose/schedule/
Thursday, June 26, 2014
How to cleanup hdfs /tmp space
my jobs are getting /tmp no space left issue.
has anyone used this?
https://github.com/mag-/hdfs-cleanup
Saturday, June 21, 2014
Subscribe to:
Posts (Atom)