Quantcast
Channel: Apache Timeline
Viewing all articles
Browse latest Browse all 5648

How to read a CSV with Latin small letter thron as delimiter

$
0
0
Hi All,

I want to load data from a file which uses Latin small letter thorn 'þ' as delimiter. However pig cannot detect this separator. In Hive I could solve this issue using '\-2' in but did not work for pig. 

data = LOAD '/home/hadoop/data/test.log' USING org.apache.pig.piggybank.storage.CSVExcelStorage('þ', 'YES_MULTILINE', 'NOCHANGE', 'SKIP_INPUT_HEADER') AS (time:chararray,userid: chararray, visit int);

Thanks for your help,
Hanif

Viewing all articles
Browse latest Browse all 5648

Trending Articles