Quantcast
Channel: Apache Timeline
Viewing all articles
Browse latest Browse all 5648

order guarantees in bags

$
0
0
Hey All,
I have a question about Pig’s guarantees around the order of tuples in bags. I am trying to decide how paranoid to be about this.

Documentation says that bags are unordered. But, in practice, I have never seen Pig re-order the tuples in a default data bag and nothing about the current implementation suggests they can get out of order.

Also, if you look at PigAvroStorage or JsonStorage (at least the elephant-bird version), both read in arrays as bags. Does that mean they implicitly don’t care about maintaining order in arrays? Or are they counting on the current implementation to keep them in order.

Thanks for any insights on this!
Adam

Viewing all articles
Browse latest Browse all 5648

Trending Articles