Nested foreach with order by

Hello everyone,

I have a foreach statement and inside of it, I use an order by. After the order by, I have a UDF. Example like this:

logs = LOAD 'raw_data' USING org.apache.hcatalog.pig.HCatLoader();

logs_g = GROUP logs BY (date, site, profile) PARALLEL 2;

service_flavors = FOREACH logs_g {
t = ORDER logs BY status;
GENERATE group.date as dates, group.site as site, group.profile as profile,
FLATTEN(MY_UDF(t)) as (generic_status);
};

The problem is that I get duplicate results.. I know that MY_UDF is running on mappers, but shouldn't each mapper take 1 group from the logs_g? Is something wrong with order by? I tried to add order by parallel but I get syntax errors...

My problem is resolved if I put GROUP logs BY (date, site, profile) PARALLEL 1; But this is not a scalable solution. Can someone help me pls? I am using pig 0.11

Cheers,
Anastasis

Nested foreach with order by

Trending Articles

RAMAYAMPET Mandal Sarpanch | Upa-Sarpanch | Ward member Mobile Numbers Medak...

लड़कियां सेक्स के दौरान क्यों करती है उह! आह!लड़कियां सेक्स के दौरान क्यों करती...

Neem Baba Extra Questions Answer Class 6 English Poorvi

Throw Back: 4×4 — Sikilitele (Ft Castro) Prod by JQ

Rajasthan Board 10th Result 2016 Roll No wise & Name Wise

Lowe faces four theft charges

Practice Sheet of Right form of verbs for HSC Students

Mafia, Murder & Mayhem In The Motor City: Detroit Mob Hit Timeline (1937-2007)

The 10 Tennessee Cities With The Largest Black Population For 2021

Materials Around Us Class 6 Worksheet Science Chapter 6

デスクトップヒープの枯渇

Best Suvichar in Hindi |बेस्ट सुविचार |शुभ विचार हिंदी में

Kanulanu Thaake Lyrics and translation | Manam (2014)

Korean Sex Porn Videos: XXX Videos & Free Porn Movies

Teen Shot In Miami Drive-By Dies From Injuries

Download: IQ Muzatasha feat Shy D & Pmj – Ulesi NiFertilizer Yamavuto

Mahakal Attitude Status

Property developer set up cannabis factory to help pay off debts...

♡

KB: How to troubleshoot issues when adding a Hyper-V host in System Center...