not able to use webgraph command in nutch 1.2 - nutch

I am quite new to nutch . Thing is I have crawled a site successfully using nutch 1.2 .Now using cygwin I am working on crawldb and segments . Problem is when I am using webgraphdb command it is showing "Error: Could not find or load main class WebGraph". Please suggest me that what I need to do to use this command properly.

Related

Custom Plugin for hawtio 2.x

I am trying to write a custom plugin for hawtio 2.x.
The first step that I tried building the entire project (https://github.com/hawtio/hawtio) as instructed in https://github.com/hawtio/hawtio/blob/master/BUILDING.md
Later, I tried the example (springboot sample) located at https://github.com/hawtio/hawtio/tree/master/examples/springboot
The example seems to run fine, except for the fact that I do not see a trace of the sample plugin written in this application (https://github.com/hawtio/hawtio/blob/master/examples/springboot/src/main/webapp/app/sample-plugin/js/sample-plugin.js)
Right now, I am not trying to do fancy stuffs. I am simply trying to run this sample application and see how the plugin works.
Below is the screenshot of the web console. I have also checked in the browser console. I do not see any traces of the initialization of this plugin.
Am I missing something specific?

p:photocam disable warnings

I am using photocam from Primefaces to capture a simple photo.
When a computer that has no webcam access the web page, an error
"Webcam.js Error: Could not access webcam."
shows up. Is it possible to supress this warning when the computer does not have a webcam? How can I do it?
I am using Java, a Web Maven Project with JSF and primefaces 6.0.
The solution was to edit the js file related to photocam (photocam.js).
To do that, I used 7-zip to access the file (inside Primefaces jar, Maven Repository) and extracted it.
After the extraction and the modifications, I replaced the older file.
The modification was to return null instead of the error message.

error message of java.net.UnknownHostException for nutch 2.3

I am trying to crawl web data using nutch 2.3 on Linux Mint 17.2, but get the following error message
“Failed with the following error: java.net.UnknownHostException:”
I'd like to know what causes this error and how to resolve it. My observation is that this error sometimes comes with another error message "Couldn't get robots.txt." What are the relationships between these two errors and how to solve this? Thanks.
Usually this happens when Nutch can't resolve a given URL, does this happens with specific URLs? or when you run some specific Nutch commands?

Joomla Hello World tutorial not working

Short version:
I installed this Hello World component in my Joomla 2.5 environment and although everything went fine during the installation, I am getting a 500 server error when I am trying to access it.
Can anyone let me know whether loading the above mentioned component in their Joomla 2.5 environment let me know whether it is working fine for them? And if accessing this URL works (the hostname and Joomla path may have to be adjusted):
http://localhost/joomla/index.php?option=com_helloworld
At least I will know it whether it is a problem with my Joomla installation.
Longer version:
I read this Joomla tutorial: http://docs.joomla.org/J2.5:Developing_a_MVC_Component/Adding_configuration
and downloaded the code from the "archive" link on that page. I uploaded the component on my Joomla 2.5.14 installation, but the component does not seem to work. Well, the administrative part of the component works, but not the end-user part.
Here is how I access it:
http://localhost/~myusername/joomla/index.php?option=com_helloworld
However this page gives me a 500 error:
Error: 500
You may not be able to visit this page because of:
an out-of-date bookmark/favourite
a search engine that has an out-of-date listing for this site
a mistyped address
you have no access to this page
The requested resource was not found.
An error has occurred while processing your request.
Please try one of the following pages:
Home Page
Search this site
If difficulties persist, please contact the System Administrator of this site and report the error below..
Call stack
# Function Location
1 JSite->dispatch() /Users/myusername/Sites/joomla/index.php:42
2 JComponentHelper::renderComponent() /Users/myusername/Sites/joomla/includes/application.php:197
3 JComponentHelper::executeComponent() /Users/myusername/Sites/joomla/libraries/joomla/application/component/helper.php:351
4 require_once() /Users/myusername/Sites/joomla/libraries/joomla/application/component/helper.php:383
5 JController->execute() /Users/myusername/Sites/joomla/components/com_helloworld/helloworld.php:23
6 JController->display() /Users/myusername/Sites/joomla/libraries/joomla/application/component/controller.php:761
7 HelloWorldViewHelloWorld->display() /Users/myusername/Sites/joomla/libraries/joomla/application/component/controller.php:722
8 JError::raiseError() /Users/myusername/Sites/joomla/components/com_helloworld/views/helloworld/view.html.php:33
9 JError::raise() /Users/myusername/Sites/joomla/libraries/joomla/error/error.php:251
I am running this on OS X (10.8.4) with PHP 5.3.15 and MySQL 5.5.27.
Am I using the proper URL to call this component, or am I supposed to supply more query string parameters?
One thing I have noticed is that the tutorial is for Joomla 2.5, but the archive is for Joomla 1.6. Are Joomla 1.6 components supposed to work with Joomla 2.5? Joomla did not complain when I installed the component though.
Francois,
welcome to the world of Joomla. You are correct when you say that joomla 1.6 components should work fine on a 2.5 but that's not all true. There has been many complaints on this issue and it seems that some components are not compatible with higher/lower Joomla versions.
This is due to some sort of conflict inside of the code and different calls made/used to the components.
I believe it would be possible to fix these fatal errors since I have gone through this myself but you would have to go through the code and killing the code to see where it passes and where not and change those calls which are not compatible.
Sorry for not having an 100% solution but this could give you an idea of what the problem may be.

Documentation for bin/nutch elasticindex

There is a lot of documentation and examples using the bin/nutch solrindex command, but the bin/nutch elasticindex command is lacking in coverage. I am struggling to combine an instance of Nutch 2.2.1 with Elasticsearch 0.90.2. I've tried to use this plugin to disguise Elasticsearch as a Solr instance, but any bin/crawl jobs crash from internal server error. What I am looking for is an example of bin/crawl modified to use Elasticsearch or a detailed description of the bin/nutch elasticindex command (the nutch wiki doesn't have a page for it). Can I simply replace every occurrence of the phrase solrindex with elasticindex freely?
I've modified bin/crawl to remove the bin/nutch solrdedup command, and replaced all mentions of solrindex with elasticindex.
I don't think it's possible to make Nutch 2.2.x work with Elasticsearch. But I don't see the added benefit of 2.2.x compared with 1.8. The only thing is that Nutch 2.2.x uses Gora to save the crawled pages in a database of your choice. Since you are using Elasticsearch to index the results I assume you don't need the database.
I made Nutch 1.8 with Elasticsearch 0.90.11 and you can find the bundle on my GitHub account:
https://github.com/andreivisan/NutchElasticsearch

Resources