I didn't see the size of the cluster described, but watch out for having timesta... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		advisory5739f2 on Aug 21, 2013 \| parent \| context \| favorite \| on: Cassandra vs MongoDB For Time Series Data I didn't see the size of the cluster described, but watch out for having timestamps as your row keys, for you are going to have hot spots (all timestamps in one token range). What is your replication factor and the size of the cluster? This might be improved with vNodes, though I'm not sure how granular and automatic the subnodes are. If they are just an even range (e.g. 256 vnodes across the same 00-ff range), then you will have the same problem. This is the major reason why Datastax pushes random ordered partitioning so much, it's easy to get into hot water with byte-ordered keys.

relistan on Aug 21, 2013 [–]

You're right. As I mention in the article, the row keys are not timestamps, the columns are timestamps. We use the RandomPartitioner for rows.

advisory5739f2 on Aug 21, 2013 | [–]

Sorry, I misread!

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact