Added a link to Sebastian's data modeler tool and links to twitter accounts.
r/m old stuff, add PK post
|Deletions are marked like this.||Additions are marked like this.|
|Line 7:||Line 7:|
A blog post by committer [[https://twitter.com/tylhobbs|Tyler Hobbs]] that gives [[http://www.datastax.com/dev/blog/basic-rules-of-cassandra-data-modeling|Basic Rules of Cassandra Data Modeling]].
!DataStax has a free self-paced online course [[https://academy.datastax.com/courses/ds220-data-modeling?dxt=blogposting|DS220: Data Modeling with Apache Cassandra]].
A blog post by [[https://twitter.com/syllogistic|Sebastian Estevez]] describing a web based tool he created to help visualize data models (pre-3.0) as well as gives a tailored cassandra-stress configuration file to test the model. [[http://www.datastax.com/dev/blog/data-modeler|Using the Cassandra Data Modeler to Stress and Size Cassandra Instances]]
|Line 21:||Line 15:|
|1. [[http://www.planetcassandra.org/blog/the-most-important-thing-to-know-in-cassandra-data-modeling-the-primary-key/|The most important thing to know in Cassandra data modeling: The primary key]]. (Blog post)|
Cassandra is a partitioned row store, where rows are organized into tables with a required primary key.
The first component of a table's primary key is the partition key; within a partition, rows are clustered by the remaining columns of the PK. Other columns may be indexed independent of the PK.
This allows pervasive denormalization to "pre-build" resultsets at update time, rather than doing expensive joins across the cluster.
Patrick McFadin's data modeling series: