Quantcast
Channel: Apache Timeline
Viewing all articles
Browse latest Browse all 5648

running pig against yarn cluster

$
0
0
Hi,
I am running Pig against a CDH5 Yarn cluster. I am not running pig with the command line, but instead running it programmatically through java code.

The server I am running it from does NOT actually have Pig/Hadoop etc. installed on it, and so there is no mapred-site.xml, yarn-site.xml, etc. All config I do is through a set of properties set at launching time from java.

I need some help figuring out what properties to set. Currently I have these properties set as pig properties:
yarn.resourcemanager.hostname
yarn.resourcemanager.addressmapreduce.framework.name

When I run like this through java, pig doesn’t actually fail but reverts to running in local mode. Of course I don’t want that, it just made it harder to spot what happened.

High level summary of question: what pig properties should I set to run against an external yarn cluster when I cannot rely on any local hadoop config that pig might normally pick up.

Thanks!
Adam

Viewing all articles
Browse latest Browse all 5648

Trending Articles