Amazon Cloud Search Experience - search

I might get flagged down by this question.. but still will give it a shot..
Since Google Site Search is going out of business and we are not interested in the free version of it - We decided to go with the Amazon Cloud Search option. The challenge though is - it is not straight forward. We have to build a crawler and there are some features that needs to be custom built.
I am trying to see examples where websites have used ACS and worked but i am not able to find anything good.. Have anyone tried using Amazon Cloud search for their Website search. Our website has around 15000 plus pages.
We are .net based solution - so i am thinking to write a crawler.. extract content on nightly basis and send it to Amazon. Would it be the right way?

ACS is based on Solr. If your site is under your control, i think the first step is extracting all useful content out and generate them into xml/json files, then use AWS CLI upload these documents to ACS. ACS has REST APIs to let you to get the query result. You need to define indexes before uploading them.

Related

simple google analytics way to count node.js site visits

I have a node.js website and I seek a simple way to know if anyone is visiting it. I have google analytics 4 on my google account panel (I created that account with google), and I want my node.js website to report site visits to it (like people are doing with Wordpress).
In short, I want to go up to my google analytics panel on google and view the site visits (of a node.js website). I am not trying to go the other way of using node.js to pull data from what I think is a google analytics API. I am not trying to clone the google analytics dashboard on a site.
This is my website.
https://rocky-temple-91974.herokuapp.com
In google analytics stream I have the Measurement ID copied and ready. Do I have to add code to my node.js website and paste in the Measurement ID? Is there a node package I need to install in my site code?
Remember I have google analytics 4 not Universal.
Finally, I see everyone knows how to do this with Wordpress and Shopify. Wordpress has a plug-in that allows you to paste in the Measurement ID. I am thus wondering why no one says or knows anything about using google analytics 4 with node.js. With node.js it seems no one is trying to view the site visits by logging into google analytics and using the google tool.
If google analytics account viewing does not work with node.js I would like to have a code snippet for node.js that would say save counts to a file in my node.js public folder. I am using mongoDB Atlas and another way would be to save visitor counts to mongoDB. Any hack like that would be appreciated.
I have not had any luck finding how to use google analytics for node.js (everyone knows how to use Wordpress to do it).
Thanks everyone! I feel this is a worthy question to ask.

How can I scrape content from a Website with AlchemyAPI?

I would like to scrape Content from a website with AlchemyAPI. I get informed about this feature on http://www.alchemyapi.com/api/scrape/qlang.html
I will implement it in the same way as in the example:"Querying Inside Tables (Selecting a Column Inside a Specific Row)".
Could somebody please help me, how to use this in Node.js and Cquery? Which parameters do I need to get specific fields like price as output?
No, it's not currently possible to do this. Since AlchemyAPI was acquired by IBM, the remaining services have been incorporated into Watson. Most of the AlchemyAPI services are now covered in the Natural Language Understanding (NLU) service: https://www.ibm.com/watson/developercloud/doc/natural-language-understanding/ but there is no feature that allows you to scrape content from a web site per se.
The NLU service does allow you to retrieve text from a web page using the analyze endpoint: https://www.ibm.com/watson/developercloud/natural-language-understanding/api/v1/#post-analyze

Add search feature to simple website without mySQL database

I have a simple HTML site with 100+ pages or so. I want to add a search bar at the top so the user can search the site. I know about Google Custom Search, but it shows ads unless you pay at least $100. Obviously I'd like ad-less search on my site for free if at all possible!
I've also heard about Lucene/Solr, but they do not actually crawl the site. For that I would apparently need Nutch.
Anyway, the site I have runs on a Microsoft IIS6 server, but I have basically no knowledge as to how Solr, Nutch, etc. gets "installed" on the server.
Also: I'd like to point out that I do have a local copy of the site. Perhaps I can do one big initial nutch "crawl" locally that will create an .xml for Solr?? That would help me get "up and running", but probably wouldn't be a good long-term solution.
..so should I just use Google Custom Search? or is there a not-extremely-painful-to-implement alternative? The brain hurts folks.
You did not mention how many search requests you want to handle but if you use the json-rest-api of google's custom search you have 100 searchqueries a day for free and you can display them without any ads on your page.
An simple example request can be found here.
Here is an easy way that works pretty well, although you may be looking for something more than this.
http://sitecomber.com/getsitecomber/
You can create code to paste into your site in about 2 minutes. It doesn't get easier than that. Search is powered by Google, but results are isolated to your website.
EDIT: This no longer works.

Is there any Web API for Google web clipboard?

Google Docs have a feature "Web Clipboard" to help users easily copy and paste cross google documents. (you can refer to https://drive.googleblog.com/2010/02/a-web-clipboard-for-google-docs.html)
But I am wondering that if there's any Web API or official way to access those API.
I try to find in Google Document List APi (https://developers.google.com/google-apps/documents-list/), I can't find any related stuff.
The original goal for me is I want to copy some screenshot/images/text from my Windows OS.
And I want to paste into my GoogleDoc Document.
But it can't be done unless the screenshot/image/text is update/upload into Web Clipboard.
I keep finding any related integrated tools or extension, but I still can't find anyone.
Could someone give me some suggestions?
There is no official way to access this API, sorry.
There is clearly an API of sorts, just not documented. I'd suggest looking at extensions like this one and trying to reverse-engineer the protocol so you can see what endpoints it is calling.
But it sounds like a lot of work.

Adding search for a private website

I have a login-protected website. It's an internal application and it's not avaiable to the general public hence it's not indexed by any search engine.
My application is developed on the Google App Engine.
I would like to add a search engine but obviously without the need to public index it. There's any solution avaiable from Google/Bing/Others for a situation like this?
Have you done this before? What solution did you chose and what are yours results?
Well Google has the Google Search Applicance which is basically a blade server which lives on your internal network and creates a "private" index. But this is meant as an enterprise caliber solution (translation: expensive).
Which framework is your website running on? You may be able to find an indexing/search module.
To answer the latter part of the question... I've used Xapian in a Django based website (via the djapian adapter). It basically creates a full-text index. Results are maybe not as good as Bing or Google would generate but it's still quite good; easy to use API too.

Resources