Quantcast
Channel: Apache Timeline
Viewing all articles
Browse latest Browse all 5648

Hadoop stream gzipped file with AvroAsTextInputFormat

$
0
0
I have successfully stream Avro data file to Python mrjobs using the
library AvroAsTextInputFormat

-inputformat
org.apache.avro.mapred.AvroAsTextInputFormat

However, unlike text file,
it does not seems to handle gzipped file automatically. What can I do to
stream a gzipped Avro file?

Wai Yip

Viewing all articles
Browse latest Browse all 5648

Trending Articles