Oozie hourly coordinator timing out on future actions
At the 5 minute mark of every hour, I have data from the past hour loaded into hdfs. I thought I could setup a coordinator job to run at 10 minute mark of every hour to process this data while doing a check if the directory for that hour exists. What ends up happening is the coordinator will perform normal on past hour's data at time of submission, continue working fine for the next 2 hours and then future actions will go from 'waiting' to 'timedout'. My guess is that there is a default max limit for how long an action can stay in 'waiting'. It seems a bit counterintuitive for the time out limit to apply to all actions at an absolute future time. Anyway, here's a sample of the coordinator.xml. I'm looking for any suggestions on either how to design it in a way that makes more sense or on how to raise the default timeout.
Specifically, the <throttle> property limits how many actions can be put into waiting status. So by setting it to 1, the timeout time only applies to the next action that is in 'waiting' status. <timeout> also changes the timeout limit for 'waiting' actions, while I believe <concurrency> limits how many actions can be running at once.
Asked in February 2016Viewed 1,949 timesVoted 8Answered 1 times