cassandra-glossary

cassandra-glossary#glossary

column
- page 221: each key points to another map of key-value pairs called columns (#)
- page 221: all columns for a key is stored together, physically, making it inexpensive to access ranges of columns (#)
- page 221: columns can differ from key to key (#)
- page 222: columns are sorted (#)
- page 223: because the columns are ordered and stored together, slice operations over a range of columns are very efficient. (#)
- Cassandra can store up to 2 billion columns per row, but if we're storing data every millisecond you wouldn't even get a month's worth of data (#)
column family
- Page 221: column family is analogous to tables in relation ab databases (#)
- page 222: schemas only have to be created once per column family (#)
clustered columns
- slice: A set of clustered columns in a partition that you query as a set using, for example, a conditional WHERE clause. (#)
clustering
- Clustering is a storage engine process that sorts data within the partition. (#)
key
- page 221: if you consider a column family a giant map, keys are the top-level entries in that map (#)
- page 221: cassandra uses keys to partition a column family across a cluster (#)
- page 221: each key points to another map of key-value pairs called columns (#)
- can partition keys randomly or preserve order (#)
keyspace
- Cassandra has the concept of a keyspace, which is similar to a database in a RDBMS. A keyspace holds data objects and is the level where you specify options for a data partitioning and replication strategy. (#)
partition key
- The partition key determines which node stores the data. It is responsible for data distribution across the nodes (#)
row
- Cassandra can store up to 2 billion columns per row, but if we're storing data every millisecond you wouldn't even get a month's worth of data (#)
schema
- page 222: schemas only have to be created once per column family (#)
secondary index
- In Cassandra, if you want to query columns other than the primary key, you need to create a secondary index on them (#)
- (for a high-cardinality column) It would probably be more efficient to manually maintain the table as a form of an index instead of using the Cassandra built-in index (#)
slice
- page 223: because the columns are ordered and stored together, slice operations over a range of columns are very efficient. (#)
- slice: A set of clustered columns in a partition that you query as a set using, for example, a conditional WHERE clause. (#)
timeuuid
- timeuuid: ideal for use in applications requiring conflict-free timestamps (#)

cassandra-glossary#glossary

Referring Pages

cassandra-organized-conceptual-glossary