Cassandra

Cassandra database data sink
PropertiesProperties supported in this sink are shown below ( * indicates required fields )
Property
Description
Name* 
Name of the data sink
Description
Description of the data sink
Processing Mode
Select for batch and un-select for streaming. If 'Batch' is selected the value of the switch is set to true. If 'Streaming' is selected the value of the switch is set to false.﻿﻿﻿Default: true
Connection * 
Pre-defined Cassandra connection
Table * 
Cassandra table name to write﻿﻿﻿Example: table_test
Keyspace
Cassandra keyspace to write
Output Mode
If mode is batch mode, the values should be either of Append, Overwrite, ErrorIfExists or Ignore. If streaming mode, the values should be append, complete or update.﻿﻿﻿Default: Append
Select Fields / Columns
Comma separated list of fields / columns to select from inputs to the sink﻿﻿﻿Example: id, name, city, state, zip﻿﻿﻿Default: *
Connections Per Executor
Minimum number of remote connections per Host set on each Executor JVM. Default value is estimated automatically based on the total number of executors in the cluster.
Partition By
Comma separated column names to partition by﻿﻿﻿Example: year, month, day
Write Consistency Level
Consistency level to use when writing. Refer  https://docs.datastax.com/en/cassandra-oss/3.0/cassandra/dml/dmlConfigConsistency.html#Readconsistencylevels  for details.﻿﻿﻿Default: LOCAL_ONE
Output Batch Buffer Size
Number of batches per single Spark task to be stored in memory before sending to Cassandra﻿﻿﻿Default: 1000
Batch Grouping Key
Determines how insert statements are grouped into batches﻿﻿﻿Default: partition
Output Batch Size
Maximum total size of the batch in bytes. Overridden by spark.cassandra.output.batch.size.rows﻿﻿﻿Default: 1024
Outptut Batch Size Rows
Number of rows per single batch. The default is 'auto' which means the connector will adjust the number of rows based on the amount of data in each row.﻿﻿﻿Default: None
Output Concurrent Writes
Maximum number of batches executed in parallel by a single Spark task﻿﻿﻿Default: 6
Enable Pushdown
Enables pushing down predicates to C* when applicable﻿﻿﻿Default: true
 ﻿
﻿
﻿﻿﻿