Cassandra Interview Questions and Answers

Share This Post

Best Cassandra Interview Questions and Answers

Cassandra is now highly preferred by most of the companies and so there is a huge demand for Cassandra certified professionals. As the need for Cassandra professionals are increasing day by day, they are paid highly too. This is a one-stop resource for all the aspirants who are preparing for Cassandra interview. Here, we have compiled a list of top 50 Cassandra interview questions and answers in guidance with the top recruiters in the industry. We have almost covered all the important topics of Cassandra that are frequently asked by the interviewers. So, just go through all these top 50 Cassandra interview questions and answers to brush up your Cassandra intellect before you attend an interview.

These top 50 Cassandra interview questions and answers will assist you in cracking the interviews more confidently. Both freshers and experienced Cassandra professionals can make use of these interview questions to upgrade their career. So, if you are an expert looking a career in the fields of Cassandra, then just make use of our top 50 Cassandra interview questions and answers and achieve your career goal. We wish you all success in your future career.

1. Can you explain what Cassandra is in detail?

Cassandra is simply an open-source and free to use distributed NoSQL database management system by Apache which is highly utilized to handle and maintain huge volumes of data without any single point of failure. This big data model ensuring high availability and scalability was generally developed by Facebook. Among the several different NoSQL database management system, Cassandra comes under key-value store and column-oriented database.

2. How are MongoDB and Cassandra different from each other?

MongoDB	Cassandra
It ensures a document like data model	It supports a data model like Google Bigtable
Multi-indexed is used to query a data	Data querying can be made using scan or key

3. Differentiate Cassandra and the Traditional RDBMS

Cassandra	Traditional RDBMS
It is masterless and therefore there is no single point of failure in Cassandra	It comes under master-slave core architecture and there might be a point of failure
It is highly available	It generally replicates with master-slave
It can hold dynamic, structured or unstructured data	It comprises of structured data and legacy RDBMS

4. Can you figure out some of the essential features of Cassandra?

Cassandra has several important features while some are listed below:

5. List out the various types of data model present in Cassandra

The different types of data models present in Cassandra are as follows:

Conceptual data model
Logical data model
Physical data model

6. Can you list out the various database elements that are found in Cassandra?

Listed below are the different database elements found in Cassandra:

Cluster
Keyspace
Column family
CQL (Cassandra Query Language) Table

7. What operating systems does Cassandra support?

Windows and Linux are the two operating systems that are being supported by Cassandra.

8. Explain what do you know about clusters in Cassandra

Cluster is the outermost structure of Caasandra which acts as a container of keyspaces and can also be denoted as a ring and the main reason behind this is that Cassandra actually assigns particular data to the nodes present in the cluster by assembling them in the form of a ring which contains different types of replication of data.

9. Define what keyspace is in Cassandra?

The outermost container of data present in Cassandra is known as the keyspace which is a collection of column families. A keyspace is identical to a relational database which consists of a name and a group of attributes that denotes the keyspace-wide behavior.

10. What are the key parameters of keyspace found in Cassandra?

The key parameters that are used in developing a keyspace in Cassandra are as follows:

Keyspace Name
Replication Strategy
Replication Factor &
Durable Writes

Looking for Best Cassandra Hands-On Training?

Get Cassandra Practical Assignments and Real time projects

11. Write down the syntax used to create a keyspace in Cassandra?

CREATE KEYSPACE <identifier> WITH <properties> is the syntax used to create a keyspace in Cassandra.

12. Is it possible to add or remove a column family present in a cluster?

Yes, we can either add or remove a column family in a cluster but look to that all the below listed conditions are met before you do so:

The commitlog should be cleared completely with the nodetool drain
Check whether commitlog is completely free from data by turning off the Cassandra
The SStables across the removed column families should be deleted for certain

13. What is the technique used to iterate all the rows present in a column family?

get_range_slices is the command used to iterate all the rows found in a column family. The iteration can be initially started with an empty string, at the end of each iteration; the last key read will act as a start key for the following iteration.

14. What does the term “Durable writes” refers to in Cassandra?

Durable writes offers commands to Cassandra either to make use of commitlog or not in order to update the recent keyspace. The default value of durable writes will always be TRUE and this is not mandatory, it can be changed too.

15. Explain when should you go for Alter Keyspace

If you are in need of changing the replica counts or to alter the durable writes property of the keyspace, we can make use of the ALTER KEYSPACE command.

16. Explain the feature “Tunable Consistency” in Cassandra

This is one of the most awaited characteristics that have made Cassandra an essential preference by most of the Analysts, Developers and Big Data Architects. The main function of “Tunable consistency” is that is synchronizes and keeps all the data rows on their respective replicas up to date. This consistency level can be opted by the users as per their preference and requirement. There are two types of consistencies present in Cassandra namely – Strong consistency and Eventual consistency.

17. Explain the function of “Capture” and the “Consistency” command in Cassandra

The “Capture” command in Cassandra is used to capture the output data and then affixes it to a particular file while the “Consistency” command in Cassandra either displays the current consistency level or it can be used to set a new preferred consistency level.

18. How can you write a query in Cassandra effectively?

To write a query in Cassandra, we can make use of CQL which is known as the Cassandra Query Language. In order to effectively interact with the database as and when required, Cassandra makes use of CQLSH.

19. Can you define what CQLSH is and list out the functionalities of CQLSH too?

One of the Cassandra query languages that are highly used by the users to effectively communicate with the database is known as CQLSH. The main functionalities of CQLSH are as follows:

It can define any specific scheme
It can be used to insert a data
It can be effectively used to execute any particular query

20. What is the use of Drop table command in CQLSH?

In order to view all the tables that consist of data even from the keyspace, we can make use of the drop table command in CQLSH.

Become Cassandra Certified Expert in 35 Hours

Get Cassandra Practical Assignments and Real time projects

21. Why is truncate table command used in CQLSH?

In order to truncate a table and then to permanently delete all the rows of the table, we can make use of the truncate table command in CQLSH.

22. What is YAML file and what is the use of YAML file in Cassandra?

One of the main configuration files of Cassandra is YAML file. If you have done any changes to the cassandra.yaml file consider rebooting the nodes such that the changes are updated.

23. Do you know what replication factor is in Cassandra?

Cassandra contains copies in the form of replicas of each of the rows in association with the row key. The term “replication factor” denotes the number of nodes that represents the copies or replicas of all the rows of data.

24. Is it possible to change the replication factor of a live cluster?

Yes, we can change the replication factor of a live cluster but it is necessary to run a repair in order to modify the existing data’s replica count.

25. What does the term “Replication Strategy” refers to? And, list out the different types of replication strategy too

The strategy that indicates how the replica will be assembled in the ring is known as the replication strategy. Cassandra comes with several types of replication strategies that determine which node will be provided with which set of copies of which specific key. The types of replication strategies are as follows:

Simple strategy
Network topology strategy

26. Define what is simple strategy in detail

Simple strategy makes use of the simple single datacenter cluster which arranges the foremost replica in the node that was identified by the partitioner. The left over replicas will be assigned to the further nodes in the form of a ring in a clockwise manner without taking into account the datacenter location and the rack.

27. Can you explain what network topology strategy is?

Network topology strategy is considered while deploying a cluster in association with several datacenters. This is the initial prerequisite to affix a replica. It can achieve the requirement without causing datacenter latency and it can also be used to handle failures.

28. How are node, cluster and datacenter different from each other in Cassandra?

A node is nothing but a single machine that can be used to run Cassandra while a cluster is a collection of nodes that consists of identical groups of data. We can group several nodes of cluster into various datacenters which can be used to serve clients located in wide geological areas.

29. Define what is a row in Cassandra and what are the elements that are present in a row

The collection of or a group of sorted columns are known as a row in Cassandra which is the smallest unit that consists of the relevant data. All the components of a row can hold either a data or a metadata. And, the key elements of a row present in Cassandra are as follows:

Row key
Column keys
Column values

30. What do you mean by column family in Cassandra?

A column family is nothing but a collection of rows in an ordered manner which can itself be denoted as an ordered collection of columns too. As per our needs, we can add any number of columns at any time to the column family.

Become a master in Cassandra Course

Get Cassandra Practical Assignments and Real time projects

31. What are the values that can be stored in the Cassandra column?

The values that can be stored in the Cassandra column are as follows:

Column Name
Value
Time Stamp

32. What do you know about the primary key and list the different types of primary keys too?

A column that is uniquely used to determine a row is known as the primary key and the three different types of primary keys are as follows:

Single primary key
Compound primary key
Composite partitioning key

33. Explain the different types of primary keys present in Cassandra

The simple definitions of the different types of primary keys are illustrated below:

The single primary key has only a single column which can be defined as a primary key.
The data will be initially partitioned and will be later clustered in a compound primary key.
In order to develop several partitions to a particular data, this composite partitioning key can be used.

34. Can you explain what partitions in Cassandra are?

Partition is nothing but a hash function used in Cassandra which is located on every node. It actually hashes the tokens that are represented from particular values of the rows that are being affixed. A partition can also be used to convert a variable input length into a fixed length.

35. List out the several partitioner types found in Cassandra

Partitioners in Cassandra are of different types and they are as follows:

Murmur3 Partitioner
Random Partitioner
Byte Ordered Partitioner

36. How is a static and dynamic CQL table different from each other?

A static table is actually identical to a relational database table that makes use of a static set of column names while a dynamic CQL table grants permission to the users to pre-compute the result sets and then stores those sets in a single row in order to make it simpler to retrieve data as and when required.

37. What should you highly consider while creating a table in Cassandra?

Primary keys that consist of one or more table columns are highly important and are mandatory while creating a table in Cassandra.

38. What are the prerequisites that you should look for while adding a new column in Cassandra?

Check to that all the below mentioned points are met while creating a new column in Cassandra:

The name of the new column name should not be similar to any other existing column names
Check to that the table is not limited to any particular compact option

39. What are the techniques available for Cassandra to write a data?

Listed below are some of the ways with which Cassandra can write a data at ease:

Commitlog write
Memtable write
SStable write

40. What do you mean by commit log in Cassandra?

A crash recovery technique that supports Cassandra in achieving its durability goals are known as commit log.

Looking for Cassandra Hands-On Training?

Get Cassandra Practical Assignments and Real time projects

41. Can you explain SStable in detail?

SStable can also be defined a “Sorted String Table” which is one of the Cassandra data files whose main ativity is to store the data that has been flushed by the memtable. SStable is different from memtable as it will not delete any data or it will not add any further information once the data is written.

42. List out what does SStable is comprised of

SStable in Cassandra is composed of the following:

Index file which includes Bloom filter and key offset pairs
Data file which includes the actual data of the column

43. Can you explain the characteristics of the bloom filter in Cassandra?

An off-heap data structure that is associated with an SStable to detect the availability of data in the SStable in order to perform specific I/O disk operations is the main functionality of the bloom filter in Cassandra.

44. What do you know about Memtable in Cassandra?

Memtable acts as a storage engine in Cassandra that holds the data written temporarily. It stores the data in the form of a key or a column. Each memtable has a separate column family and it extracts column data from any particular key that are specified.

45. How is write operation performed in Cassandra?

A write request when reached the node, the following operations will be done:

The request first enters the commit log where the data will be collected and saved in to the memtable
When the memtable storage is full, it actually flushes the data into the SStable
The writes in Cassandra will be partitioned automatically and it will be replicated in the clusters. Cassandra frequently consolidates and discards the irrelevant data from the SStables.

46. Can you explain the term “Snitch”?

The one that denotes to which rack or datacenter the specific node belongs to is known as the snitch.

47. Write some of the different types of snitch found in Cassandra

Snitch is of different types and some of them are listed below:

48. Whether Cassandra supports ACID transactions?

No, Cassandra does not support ACID transactions while relational database does it.

49. Can you explain the major difference between the column and super column in Cassandra?

Both column and super column are executed with the tuple concept that consists of both the names and the values but column has values in the form of string while super column is actually a map of columns that consists of several data types.

50. Explain the main function of source command in Cassandra

In Cassandra, the main aim of the source command is to run a file that consists of CQL statements.

Cassandra Interview Questions and Answers

Best Cassandra Interview Questions and Answers

Looking for Best Cassandra Hands-On Training?

Get Cassandra Practical Assignments and Real time projects

Become Cassandra Certified Expert in 35 Hours

Get Cassandra Practical Assignments and Real time projects

Become a master in Cassandra Course

Get Cassandra Practical Assignments and Real time projects

Looking for Cassandra Hands-On Training?

Get Cassandra Practical Assignments and Real time projects

Related Courses

Cassandra Training

MEAN Stack Training

MongoDB Admin Training

MongoDB Training

Node.js Training

Our Recent Blogs

AngularJS Interview Questions and Answers

AWS Interview Questions and Answers

Blue Prism Interview Questions and Answers

MongoDB Interview Questions and Answers

Python Interview Questions and Answers

Selenium Interview Questions and Answers

Leave a Comment Cancel Reply

Head Office

Trending Courses

Courses

Company

Company Policy

Work With Us

🚀Fill Up & Get Free Quote