本帖最后由 jttsai 于 2014-8-15 16:38 编辑
2014-08-15 16:12:29,364 INFO ActionStartXCommand:539 - SERVER[master1.hadoop] USER[oozie] GROUP[-] TOKEN[] APP[map-reduce-wf] JOB[0000001-140815155144199-oozie-oozi-W] ACTION[0000001-140815155144199-oozie-oozi-W@:start:] Start action [0000001-140815155144199-oozie-oozi-W@:start:] with user-retry state : userRetryCount [0], userRetryMax [0], userRetryInterval [10]
2014-08-15 16:12:29,365 WARN ActionStartXCommand:542 - SERVER[master1.hadoop] USER[oozie] GROUP[-] TOKEN[] APP[map-reduce-wf] JOB[0000001-140815155144199-oozie-oozi-W] ACTION[0000001-140815155144199-oozie-oozi-W@:start:] [***0000001-140815155144199-oozie-oozi-W@:start:***]Action status=DONE
2014-08-15 16:12:29,365 WARN ActionStartXCommand:542 - SERVER[master1.hadoop] USER[oozie] GROUP[-] TOKEN[] APP[map-reduce-wf] JOB[0000001-140815155144199-oozie-oozi-W] ACTION[0000001-140815155144199-oozie-oozi-W@:start:] [***0000001-140815155144199-oozie-oozi-W@:start:***]Action updated in DB!
2014-08-15 16:12:29,454 INFO ActionStartXCommand:539 - SERVER[master1.hadoop] USER[oozie] GROUP[-] TOKEN[] APP[map-reduce-wf] JOB[0000001-140815155144199-oozie-oozi-W] ACTION[0000001-140815155144199-oozie-oozi-W@mr-node] Start action [0000001-140815155144199-oozie-oozi-W@mr-node] with user-retry state : userRetryCount [0], userRetryMax [0], userRetryInterval [10]
2014-08-15 16:12:30,512 INFO MapReduceActionExecutor:539 - SERVER[master1.hadoop] USER[oozie] GROUP[-] TOKEN[] APP[map-reduce-wf] JOB[0000001-140815155144199-oozie-oozi-W] ACTION[0000001-140815155144199-oozie-oozi-W@mr-node] checking action, external ID [job_1408075517794_0010] status [RUNNING]
2014-08-15 16:12:30,515 WARN ActionStartXCommand:542 - SERVER[master1.hadoop] USER[oozie] GROUP[-] TOKEN[] APP[map-reduce-wf] JOB[0000001-140815155144199-oozie-oozi-W] ACTION[0000001-140815155144199-oozie-oozi-W@mr-node] [***0000001-140815155144199-oozie-oozi-W@mr-node***]Action status=RUNNING
2014-08-15 16:12:30,515 WARN ActionStartXCommand:542 - SERVER[master1.hadoop] USER[oozie] GROUP[-] TOKEN[] APP[map-reduce-wf] JOB[0000001-140815155144199-oozie-oozi-W] ACTION[0000001-140815155144199-oozie-oozi-W@mr-node] [***0000001-140815155144199-oozie-oozi-W@mr-node***]Action updated in DB!
2014-08-15 16:12:46,983 INFO CallbackServlet:539 - SERVER[master1.hadoop] USER[-] GROUP[-] TOKEN[-] APP[-] JOB[0000001-140815155144199-oozie-oozi-W] ACTION[0000001-140815155144199-oozie-oozi-W@mr-node] callback for action [0000001-140815155144199-oozie-oozi-W@mr-node]
2014-08-15 16:12:47,129 WARN ActionCheckXCommand:542 - SERVER[master1.hadoop] USER[oozie] GROUP[-] TOKEN[] APP[map-reduce-wf] JOB[0000001-140815155144199-oozie-oozi-W] ACTION[0000001-140815155144199-oozie-oozi-W@mr-node] Exception while executing check(). Error Code [JA017], Message[JA017: Unknown hadoop job [job_1408075517794_0010] associated with action [0000001-140815155144199-oozie-oozi-W@mr-node]. Failing this action!]
org.apache.oozie.action.ActionExecutorException: JA017: Unknown hadoop job [job_1408075517794_0010] associated with action [0000001-140815155144199-oozie-oozi-W@mr-node]. Failing this action!
at org.apache.oozie.action.hadoop.JavaActionExecutor.check(JavaActionExecutor.java:1134)
at org.apache.oozie.command.wf.ActionCheckXCommand.execute(ActionCheckXCommand.java:180)
at org.apache.oozie.command.wf.ActionCheckXCommand.execute(ActionCheckXCommand.java:55)
at org.apache.oozie.command.XCommand.call(XCommand.java:280)
at org.apache.oozie.service.CallableQueueService$CallableWrapper.run(CallableQueueService.java:174)
at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
at java.lang.Thread.run(Thread.java:724)
2014-08-15 16:12:47,130 WARN ActionCheckXCommand:542 - SERVER[master1.hadoop] USER[oozie] GROUP[-] TOKEN[] APP[map-reduce-wf] JOB[0000001-140815155144199-oozie-oozi-W] ACTION[0000001-140815155144199-oozie-oozi-W@mr-node] Failing Job due to failed action [mr-node]
2014-08-15 16:12:47,132 WARN LiteWorkflowInstance:542 - SERVER[master1.hadoop] USER[oozie] GROUP[-] TOKEN[] APP[map-reduce-wf] JOB[0000001-140815155144199-oozie-oozi-W] ACTION[0000001-140815155144199-oozie-oozi-W@mr-node] Workflow Failed. Failing node [mr-node]
2014-08-15 16:12:47,161 INFO KillXCommand:539 - SERVER[master1.hadoop] USER[oozie] GROUP[-] TOKEN[] APP[map-reduce-wf] JOB[0000001-140815155144199-oozie-oozi-W] ACTION[-] STARTED WorkflowKillXCommand for jobId=0000001-140815155144199-oozie-oozi-W
2014-08-15 16:12:47,171 INFO KillXCommand:539 - SERVER[master1.hadoop] USER[oozie] GROUP[-] TOKEN[] APP[map-reduce-wf] JOB[0000001-140815155144199-oozie-oozi-W] ACTION[-] ENDED WorkflowKillXCommand for jobId=0000001-140815155144199-oozie-oozi-W
2014-08-15 16:13:03,603 INFO CallbackServlet:539 - SERVER[master1.hadoop] USER[-] GROUP[-] TOKEN[-] APP[-] JOB[0000001-140815155144199-oozie-oozi-W] ACTION[0000001-140815155144199-oozie-oozi-W@mr-node] callback for action [0000001-140815155144199-oozie-oozi-W@mr-node]
2014-08-15 16:13:03,613 ERROR CompletedActionXCommand:536 - SERVER[master1.hadoop] USER[-] GROUP[-] TOKEN[] APP[-] JOB[0000001-140815155144199-oozie-oozi-W] ACTION[0000001-140815155144199-oozie-oozi-W@mr-node] XException,
org.apache.oozie.command.CommandException: E0800: Action it is not running its in [FAILED] state, action [0000001-140815155144199-oozie-oozi-W@mr-node]
at org.apache.oozie.command.wf.CompletedActionXCommand.eagerVerifyPrecondition(CompletedActionXCommand.java:77)
at org.apache.oozie.command.XCommand.call(XCommand.java:251)
at org.apache.oozie.service.CallableQueueService$CallableWrapper.run(CallableQueueService.java:174)
at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
at java.lang.Thread.run(Thread.java:724)
我已经把把jobhistory的配置相关信息放在oozie的conf/hadoop-conf/core-site.xml中,jobhistory也已经开启了,我的配置如下:
<property>
<name>mapreduce.jobhistory.address</name>
<value>master1.hadoop:10020</value>
</property>
<property>
<name>mapreduce.jobhistory.webapp.address</name>
<value>master1.hadoop:19888</value>
</property>
<property>
<name>mapreduce.jobhistory.intermediate-done-dir</name>
<value>${hadoop.tmp.dir}/mr/history-tmp</value>
</property>
<property>
<name>mapreduce.jobhistory.done-dir</name>
<value>${hadoop.tmp.dir}/mr/history-done</value>
</property>
|
|