where do i find dse.jar for cassandra-spark join operation? - apache-spark

i'm working on cassandra joins using sql. i found that dse.jar is needed for the operation but couldn't find the path to download the jar. i googled and found that we need to maually download and add it to the project, but no one has said where to download the jar from.
[http://docs.datastax.com/en/datastax_enterprise/4.6/datastax_enterprise/spark/sparkJavaApi.html?scroll=sparkJavaApi__dsejar_unique_1][1]

You can't download the jar individually. It's usually under /usr/share/dse/dse.jar in package installations of Datastax Enterprise

Related

Why does HDInsight cluster does not come with pre-installed Scala?

on HDInsight's masternode, $scala -verion returns an error. It is easily installed via
$apt-get install scala
but shouldn't scala be installed there by default?
Thank you for suggestion. What's the scenario where you need scala to be directly installed on the node? For example, in spark there are couple of other common scenarios that already work:
Running Spark commands in command line. This is accomplished through spark-shell which has built-in scala interpreter.
Building spark project. This is ussually done through maven or sbt project definition file. Those tools would automatically download correct scala version and compiler based on the project dependencies.
As you said it's not hard to preinstall scala, but we would like to understand the need to do that. In the discussions with customers this didn't come up before.

Unable to find custom indexer class 'com.stratio.cassandra.lucene.Index'

I am using apache-cassandra-3.0.10. I have placed cassandra lucene jar with version 3.0.10.3 in cassandra lib folder. When i am trying to create lucene index it is showing the message Unable to find custom indexer class 'com.stratio.cassandra.lucene.Index'. As per the lucene documentation 3.0.10 jar is compatible with cassandra version 3.0.10. Then why this error is occuring. Can any one help me out of this please?
Put the stratio lucene jar into all of your cassandra node's lib folder and
Restart all the node.
The cassandra lucene jar 3.0.10 was downloaded from maven repository and it was broken. I generated own jar file from their github repository and it was working fine

Where to download Spark SQL's spark-sql_2.10-1.3.1.jar from?

Please bear with my dumm question.
When I download Spark package (spark-1.3.1-bin-hadoop1.tgz)I don't see the Spark SQL JAR (spark-sql_2.10-1.3.1.jar) under lib folder. Where can I get that? Isn't included by default on the spark package?
Please help.
You can download the jar from:http://mvnrepository.com/artifact/org.apache.spark/spark-sql_2.10/1.3.1

External Authentication for Cassandra in DSE 4.7

We are trying to implement external authentication to Cassandra on DSE 4.7. Followed few of the guides where we have to extend IAuthenticator class but after doing that there is less documentation on how to integrate.
Is it more of plug and play where we extend IAuthenticator class build a jar and place it in lib(/usr/share/dse/resources/cassandra/lib) and change the yaml file accordingly or is it take a source code from Github build entire tree and then use?
If so is Datastax's Cassandra available on Github?
What do we need to do to build external authentication other that LDAP and Kerberos in DSE 4.7?
extend IAuthenticator class build a jar and place it in
lib(/usr/share/dse/resources/cassandra/lib) and change the yaml file
accordingly
^^ yes, this is the right approach.
Datastax's Cassandra available on Github?
Not exactly. You'll see the version of c* that ships with DSE in the release notes, you can check the source in the apache/cassandra github and it will match (up to and excluding the build number). The exact c* build under DSE will have some critical patches from future versions and that exact source code is not avaliable. However, the dot release in apache/cassandra is good enough for all intents and purposes.
I.E. look at https://github.com/apache/cassandra/tree/cassandra-2.1.8 for 4.7.1
As mentioned by #Mikea we need to override ISaslAwareAuthenticator and while using Cassandra in DSE need to be very sure of Cassandra version and then dig into appropriator Github repo.

CouchDB, Fauxton - installation of, locating files

I'm using CouchDB 1.5 on Ubuntu Desktop 14.04. I'm having trouble understanding how the Fauxton app got installed (as I don't recall installing it.)
I found the UI at [myhostip]:5984/_utils/fauxton/ but I can't locate all of the files on disk. So the questions are:
1) I'm thinking the Ubuntu Software Center installed some sort of package, but wondered if someone could confirm that?
2) Where the application files on disk?
Fauxton is a part of couchdb, in fact it is the default user interface for 2.0. Most likely your fauxton is in
/usr/local/share/couchdb/www/fauxton
to find out for sure open your your config file
http://localhost:5984/_utils/config.html
and look for
__utils in httpd_global_handlers section and it will tell you where couchdb utilities are installed. In the utilities directory you should see a fauxton folder which contains the fuaxton files.
Don't worry though no third party installed it. It is a part of couchdb package. Here is the repo on github.
Went further based on guidance from #Akshat; it turns out that the Fauxton application code was on disk in /usr/share/couchdb/www/fauxton/js in a file named require.js. All code and templates were there, minified and munged.

Resources