Tuesday, September 30, 2014

Thursday, July 3, 2014

How to kill a MapReduce job


Very common need, kill a long running job.  following is the syntax

$ hadoop job -kill <job-id>


Usage: hadoop job [GENERIC_OPTIONS] [-submit <job-file>] | [-status <job-id>] | [-counter <job-id> <group-name> <counter-name>] | [-kill <job-id>] | [-events <job-id> <from-event-#> <#-of-events>] | [-history [all] <jobOutputDir>] | [-list [all]] | [-kill-task <task-id>] | [-fail-task <task-id>] | [-set-priority <job-id> <priority>]

Yarn Capacity Schedular

Friday, June 27, 2014

Hadoop Summit 2014 Presentations online

Links to keynote and presentations of Hadoop Summit 2014.  YARN was the focus of this year and MapReduce will soon become legacy:

  • http://hadoopsummit.org/san-jose/keynote-day1/
  • http://hadoopsummit.org/san-jose/schedule/

Thursday, June 26, 2014

How to cleanup hdfs /tmp space

my jobs are getting /tmp no space left issue. has anyone used this? https://github.com/mag-/hdfs-cleanup