Quantcast
Channel: Apache Timeline
Viewing all articles
Browse latest Browse all 5648

Is it possible to fix MR jobs order in Pig?

$
0
0
I have a Pig script that was divided by the Pig framework in two MapReduce
jobs. So far so good.

One of these jobs was always failing. When I checked the logs I realized
that Pig is executing the "2nd" job before the "1st".

Well, I think this is happening because the second part of my script
doesn't depend explicitly on the first part. But I'd like it to be executed
before the other part. Is it possible?

I know Pig tries to optimize several things, but changing the order of the
MR jobs is not something nice. Are pigs "domestic animals" are all?

By the way, how much control do we really have over Pig's internal DAG?

Thanks,
Rodrigo Ferreira.

Viewing all articles
Browse latest Browse all 5648

Trending Articles