Thoughts

Hi everyone,

I'm developing a quite complex system using Pig and I'd like to confirm
some ideas if possible. They are not really questions. They are more like
thoughts.

1) I'm creating my "input" data using Pig itself. It means that the actual
input is a small file with a few rows (few means not Big Data). And for
each of these rows I create lots of data (my real input).

Well, in order to do that, considering that the creation of the real input
is CPU bounded, I decided to create a separated file for each row and LOAD
them separately, this way allowing Pig to fire a different Map process for
each of them and hopefully obtaining some parallelization. Is it OK?

2) I have a UDF that I call in a projection relation. This UDF communicates
with my S3 bucket and the relation that is produced in this projection is
never used. Well, it seems that Pig optimizer simply discards this UDF.
What I did was to make this UDF return a boolean value and I store it on S3
(a lightweight file). This way it gets executed. Any thoughts on this?

Thank you. I'll come back later on with other ideas. I hope this reasoning
may help someone :)

Rodrigo Ferreira

Thoughts

Trending Articles

RAMAYAMPET Mandal Sarpanch | Upa-Sarpanch | Ward member Mobile Numbers Medak...

लड़कियां सेक्स के दौरान क्यों करती है उह! आह!लड़कियां सेक्स के दौरान क्यों करती...

Neem Baba Extra Questions Answer Class 6 English Poorvi

Throw Back: 4×4 — Sikilitele (Ft Castro) Prod by JQ

Rajasthan Board 10th Result 2016 Roll No wise & Name Wise

Lowe faces four theft charges

Practice Sheet of Right form of verbs for HSC Students

Mafia, Murder & Mayhem In The Motor City: Detroit Mob Hit Timeline (1937-2007)

The 10 Tennessee Cities With The Largest Black Population For 2021

Materials Around Us Class 6 Worksheet Science Chapter 6

デスクトップヒープの枯渇

Best Suvichar in Hindi |बेस्ट सुविचार |शुभ विचार हिंदी में

Kanulanu Thaake Lyrics and translation | Manam (2014)

Korean Sex Porn Videos: XXX Videos & Free Porn Movies

Teen Shot In Miami Drive-By Dies From Injuries

Download: IQ Muzatasha feat Shy D & Pmj – Ulesi NiFertilizer Yamavuto

Mahakal Attitude Status

Property developer set up cannabis factory to help pay off debts...

♡

KB: How to troubleshoot issues when adding a Hyper-V host in System Center...