Hi everyone,
I have a very basic question on the Apache SGD implementation. My training
set has about 50 features, most of which are categorical. Some of these
categories are binary, but others can have an unknown number of discrete
values (countries, cities, etc.).
Should I be encoding these with the ConstantValueEncoder? The
StaticWordValueEncoder?
Thanks,
*Brian Krebs*
CIO and Co-founder
TapHeaven
Mobile: 443.866.2137
Email: bkrebs [ at ] tapheaven.com
Twitter: @BKrebsTH
LinkedIn: www.linkedin.com/in/briankrebs
tapheaven.com
I have a very basic question on the Apache SGD implementation. My training
set has about 50 features, most of which are categorical. Some of these
categories are binary, but others can have an unknown number of discrete
values (countries, cities, etc.).
Should I be encoding these with the ConstantValueEncoder? The
StaticWordValueEncoder?
Thanks,
*Brian Krebs*
CIO and Co-founder
TapHeaven
Mobile: 443.866.2137
Email: bkrebs [ at ] tapheaven.com
Twitter: @BKrebsTH
LinkedIn: www.linkedin.com/in/briankrebs
tapheaven.com