How do I run cassandra-stress against AWS Keyspaces with Sigv4 authentication? - cassandra

I want to use Cassandra-stress tool to load test on AWS keyspaces but can't figure out how to use sigv4authprovider. Can someone please help with the steps?

Not sure if it is helping, but you can define a service account with classic authentication (login, password) against keyspaces. This is the best way to test cassandra tools/libs compatibility. SIGV4 would require some extra work with actually overwriting the way said libs/tools create their connection.

Related

How to connect to Flink SQL Client from NodeJS?

I'm trying to use Apache Flink's Table concept in one of my projects to combine data from multiple sources in real-time. Unfortunately, all of my team members are Node.JS developers. So, I'm looking for possible ways to connect to Flink from NodeJS and query from it. In Flink's documentation for SQL Client, it's mentioned that
The SQL Client aims to provide an easy way of writing, debugging, and submitting table programs to a Flink cluster without a single line of Java or Scala code. The SQL Client CLI allows for retrieving and visualizing real-time results from the running distributed application on the command line.
Based on this, is there any way to connect to Flink's SQL client from NodeJS? Is there any driver already available for this like Node.JS drivers for MySQL or MSSQL. Otherwise, what are the possible ways of achieving this?
Any idea or clarity on achieving this would be greatly helpful and much appreciated.
There's currently not much that you can do. The SQL Client runs on local machines and connects to the cluster there. I think what will help you is the introduction of the Flink SQL Gateway, which is expected to be released with Flink 1.16. You can read more about that on https://cwiki.apache.org/confluence/display/FLINK/FLIP-91%3A+Support+SQL+Gateway
Another alternative is to check out some of the products that offer a FlinkSQL editor on the market, maybe that is useful path for your colleagues.
For example:
https://www.ververica.com/apache-flink-sql-on-ververica-platform
https://docs.cloudera.com/csa/1.7.0/ssb-overview/topics/csa-ssb-intro.html
Note that this is not exactly what you asked for, but could be an option to enable your team.

Using Cassandra online - no installation

Is there a simple way to use and explain Cassandra without having to install it.
I mean a solution that with be ready to exploit over the cloud or whatever (like the ready to use compiler for sql in W3schools for example)
Astra DB is a Cassandra-as-a-service offering which has a free tier you could use exactly for that purpose.
You can launch a Cassandra cluster on the cloud in literally just 5 clicks and there's no credit card required to create an account. Cheers!

How to create a mock Oracle ADW database in Python for integration testing?

I am currently migrating data from AWS Redshift to Oracle ADW. I use postgres to create a mock database and run integration tests to simulate how my queries would run in production environment. Postgres is a good candidate for mocking Redshift database as they are similar, but that is not the case for Oracle ADW. I'm wondering if anyone has suggestions on how I could create a mock database that has the same syntax constraints as Oracle ADW.
I already know how to create mock connections to write unit tests. However, these integration tests help us validate pipelines end to end.
I would recommend using the Oracle XE database. It is free to use, is "real" Oracle, and supports most basic and several advanced features, with some resource limitations. It can be deployed in a variety of ways; not sure what would work best for you, but you can check it out here: https://www.oracle.com/database/technologies/appdev/xe.html

stubbed cassandra for data storage

I need an embedded cassandra for my project and I was wondering if I can use Stubbed Cassandra for data storage. Because I need a system to simulate CQL requests and responses.
Thanks everyone.
You cant use it as a real datastore. Use real cassandra as a real cassandra datastore. check out ccm which is probably more what your looking for.
There are wrappers for it in dtests (python) and the java driver uses it for testing and has a java wrapper.
I don't really have any experience at all with SCassandra but I worked on several projects using Apache Cassandra and there are some use cases like multidatacenter infrastructure to experiment and I don't think SCassandra can do it. So if you plan to do simple tests, that's fine, But advanced use cases really need to be tested in a real cassandra distribution.
As others have mentioned, you will need the real Cassandra for data storage. However, if you want to test CQL requests/responses then you can use this library:
Cassandra-Spy
It runs an actual embedded Cassandra and also can simulate failures for inserts/selects. This helps you test your app's behaviour in failure cases. I wrote the library to address this specific use case.

Cassandra stress tool authentication

I can't seem to get cassandra-stress working with authentication. I'm using simple username/password authentication. When I run cassandra-stress, I get auth failures (naturally). Anybody know how to specify username / passwords for stress-tool. I'm trying out apache cassandra 2.1.0-rc4 btw.
I know this is an old one but cassandra-stress got authentication support in 2.1.3 with CASSANDRA-7985.
I turned off authentication (stopped Cassandra, in conf/cassandra.yaml changed authenticator value from PasswordAuthenticator to AllowAllAuthenticator, then started Cassandra), then cassandra-stress tool worked.
Maybe it just doesn't support authentication? That would be a little disappointing (version 2.0.9)
Ben

Resources