Quantcast
Viewing all articles
Browse latest Browse all 5648

Preserve contents of keys after running k-means

Hi list

We are trying to do some k-means clustering and are wondering if there's an
easy way to preserve the contents of the keys for the input records.

E.g.

12345: (0,3,79,80)
98765: (1,4,98,90)

where the vectors being clustered are the tuples and the keys are some id.

When we run clusterdump with pointsDir specified we have the vectors but
not the keys. We're looking at NamedVector as a path to this solution, as
well as looking at a mapping file between ordered integers and the ids in
order.

Thanks for any advice.

Best
Andrew

Viewing all articles
Browse latest Browse all 5648

Trending Articles