I have a simple HTML site with 100+ pages or so. I want to add a search bar at the top so the user can search the site. I know about Google Custom Search, but it shows ads unless you pay at least $100. Obviously I'd like ad-less search on my site for free if at all possible!
I've also heard about Lucene/Solr, but they do not actually crawl the site. For that I would apparently need Nutch.
Anyway, the site I have runs on a Microsoft IIS6 server, but I have basically no knowledge as to how Solr, Nutch, etc. gets "installed" on the server.
Also: I'd like to point out that I do have a local copy of the site. Perhaps I can do one big initial nutch "crawl" locally that will create an .xml for Solr?? That would help me get "up and running", but probably wouldn't be a good long-term solution.
..so should I just use Google Custom Search? or is there a not-extremely-painful-to-implement alternative? The brain hurts folks.
You did not mention how many search requests you want to handle but if you use the json-rest-api of google's custom search you have 100 searchqueries a day for free and you can display them without any ads on your page.
An simple example request can be found here.
Here is an easy way that works pretty well, although you may be looking for something more than this.
http://sitecomber.com/getsitecomber/
You can create code to paste into your site in about 2 minutes. It doesn't get easier than that. Search is powered by Google, but results are isolated to your website.
EDIT: This no longer works.
Related
Greetings fellow developers,
I would like to ask for help regarding the following problem: Is there a way to request removal of stored website data from search engines? Most of the links that show up when searching my domain are old and non-existent.
What I've found from personal research regarding this question/problem:
From my personal research I have found that removal requests can be made individually to the well-known search engines such as Google, Yahoo and Bing, but this is not what I am looking for, since I am well-aware that it would take a lot of time for the requests to be processed and the removal of the data to be done. Also, I wasn't able to find this "removal-request" webpage for the other search engines.
To be more precise/clear...
... I want to request this website-data-removal to all (most) search engines at once, so that when I upload my new website (to the same domain), working and functional links (URLs) would be displayed. Can this be anyhow achieved and, if so, how? Also, how much time would it take for this removal to be finished?
Hope my question is clear enough, and any answer/help would be very much appreciated.
No, there is not a way to do this for all search engines at once. You will have to request it from each site individually. As for the smaller search engines you can try and find any contact information or customer support however their is a chance they will ignore your request (heck, some sites ignore the robot.txt file and just search your site anyways... it's just a part of being on the web).
How do i actually start making a website? Every time I think of making a website I stuck as from where should I start?Where to start writing a code?How to make it interactive? Hoping to get the answer that help me.
I have tried in the past but I really want to make a website of my own as I am looking to start my online company.
Use wordpress. It is content management system with milions of themes and plugins to do whatever you want for your own web site.
Hi I am running a delivery shop and using delivery.com for my shop.
I also want to have my own website which user can order service or choose delivery time on the website like delivery.com does.
I wonder if is it better to develop from scratch or can I use wordpress and customize it? or are there plugins or widgets I can use for my website?
Please give me some advice what will be the best way to build my website.
Thank you.
Well technically anything is pretty much possible in Wordpress but it won't happen out of the box, unless you already have a theme in mind which serves this purpose. I won't be surprised if there is a theme out there which you can use to do this.
Personally I would build this from scratch as you would be able to build exactly what you want and you can build it with scalability in mind as well.
Wordpress tends to fall off when the site starts getting A LOT of traffic.
Check out Ruby on Rails, you would be surprised how at how simple it can be to build something like this.
The server will go down (if) when your site will visit 1000+ persons, because one of the problem of universal script that can be used (in opinion of creators) every where is that it make 100 sql queryes to display one simple page.
I'd like to code a website where you can find search results from many websites.
So my question is, if this scenario is possible and if yes, if you guys have any suggestions how I would be able to do this.
Here my workflow:
I search for something on my website. For example: "asdf"
My code then executes the search from the other website. for example:
https://www.google.ch/#q=asdf&safe=images
Then there will be shown some results, of course. But how can I directly take the results and show them on my website, without opening the other website?
I have to say, that the websites I'm looking for, haven't got any API for that.
I probably wouldn't recommend to scrap a web page directly in the client.
I'm not even sure if you can achieve it easily without getting some Cross Domain Policy problems anyway.
A solution like APIfy might help you doing what you want:
http://apify.heroku.com/resources
Or you can still make your own server site API "layer" for this particular website?
Keeping in mind that scrapping a web page is always a fragile process where the format can chance at any moment.
The search feature on the site seems pretty awful.
Are there any external sites that do a better job of categorizing projects with tags, etc?
Or maybe I'm just not using GitHub correctly?
Have you tried a Google search with site:github.com included in the query?
I haven't tried this, but I understand that very often Google does a better job of searching a website than the site's own search tools. Have you tried that?
Go to their advanced search page and fill out github.com in the "Only return results from this site or domain" slot.
GitHub indexes a tremendous amount of data (50+ million projects). Taking a moment to understand their search syntax should help you narrow your search.
hubscovery.com is much better.
GitHub's search functionality is terrible. I'm all for everyone having their own opinion, but for such a well-done site in nearly every other way, there is no excuse for lacking a basic sort option. It is to their service.
EDIT: Hubscovery is dead! Long live GitHub search!