Removing HyperLinks from search result content (Solr 3.4.0) - search

I'm working on Solr project.. When I search for a word say (google) ,the results will be something like this:
Google
[Sign In] [About us] google is a search engine.... etc
www.google.com
As you can see, in the search results I need to remove the Hyperlinks (About us and Sign in ) from the content.. How can I do this with Solr.
Thank you in advance.

Solr would return you back what you feed into Solr.
So if the link are stored into the field, they would be returned with the response as is.
The filtering part needs to be handled at client side.

Related

does filetype equals pdf in google cse list work in python

I am using google custom search through the api google_api_client python. I want to retrieve the pdf documents for a particular query. Below is the function that does it.
def query_results(service,q=None,startIndex=1,siteSearch=None,fileType=None):
return service.cse().list(
q=q,
cx='000906600611484344115:o9lfdh9y1m5',
start=startIndex,
siteSearch=siteSearch,
fileType=fileType,
safe='off'
).execute()
When i call the above method passing q="alienware", filetype="pdf" and siteSearch="google.com", I get hardly one result. But then when I type the same query filetype:pdf alienware in google search through browser, I got plenty of results. Did I do any wrong ? or Is there an issue with the api itself ?.
When i say I got plenty of results, I mean the below image
Below is the configuration of custom search
Specifying siteSearch="google.com" means you will only search pages on google.com. It is equivalent to searching for site:google.com on Google. Remove the siteSearch and you should get the expected results.

Sharepoint 2013 Asset Library Search isn't working

I'm new to sharepoint so I'm hoping that this is a simple user error:
I'm working in SharePoint 2013, and have added the Asset Library App. I have uploaded about 15 images into the Asset Library app page. They all show up fantastic. I have tagged keywords in them with Adobe Bridge before importing them to Sharepoint. The tags carried over into SP perfectly and are all showing up correctly in the document information under the field for Keywords.
When I try to search for my documents using the Sharepoint search bar, nothing shows up. I have tried search for a single keyword, multiple keywords, document title, document type, a single letter, everything! And yet there are never any results found.
I then found an option to turn on Enterprise Keywords. I did that, and copied my Keywords field over to "Enterprise Keywords" Still no success. No results found for anything at all.
What am I doing wrong?
Ondrej was correct, No crawl had been performed on the new data yet.
Like I said, I'm very new to SharePoint, and didn't know that it was set up to crawl only once a week.
It's been crawled now and the search feature works great!
To search for property saved under Keywords, type keywords: before your tag word, i.e. if you're searching for all items containing keyword Cat, type in the search box:
keywords:Cat

Search keyword from google to my site

If user search a particular keyword in google and google returns set of results with url. Now if user comes to my website after clicking any result, then how can I know that search keyword.
In bing and yahoo I am able to get this using "q" and "p" respectively. Like "http://www.bing.com/search?q=......" and "http://in.search.yahoo.com/search......?p=......&.....".
But in google I am unable to find any parameters to get search keyword. I am using php.
Please help.
EDIT:
It is not possible to get the search string once you click on a search result.
if I search for stackoverflow.. this is what I get.. link not showing up properly in a comment..
so adding an answer.. you should be able to "?q=" as well for google
https://www.google.com/search?q=stackoverflow&rlz=1C1CHFX_enUS524US524&oq=sta&aqs=chrome.3.69i60l3j69i59j69i57j69i65.2857j0&sourceid=chrome&ie=UTF-8

How to place search query in the URL?

With a lot of search engines, you can find the string you are searching in the URL.
However, http://drugcompare.destinationrx.com/Home.aspx does not let me do this. When I search something, the resulting URL is http://drugcompare.destinationrx.com/DrugCompare.aspx no matter what.
Is there any way I can find out whether I can search the website by adding something to the end of the URL, like "?query=searchstring" instead of using the form provided on the page? Basically I need a unique URL.
that website you pointed at uses POST to send data for its search query which means you wont be able to see or append it on the URL bar. The reason for that is either for security or the search query it generates is a complex object or too long and does not fit in a url. websites such as search engines uses GET, with that you can append your search query in the url by following the syntax it generates.

How would I best make this SEO_able?

I have a search engine that searches albums.
For each music album, I have a page.
So, the work flow goes like this:
People search for music titles
The search engine displays a list of albums.
People click on an album to go to a details page.
I want google to index my front page and the details page. I want the details page to be highly ranked. How can I build a sitemap for this?
By the way, I have about 5 million albums (but I want the top 1000 ones to be highly ranked on google)
You would not use a sitemap for that many results. You would want each album to appear as a page with a unique URI to reference that page. That way the search engine can crawl your site by crawling links since search bots cannot submit form data. Each of those URIs should be simple, meaning limited to this part of the URI syntax:
scheme://authority_segment/path
Program your web application to remove and throw away any extraneous data, such as query string or parameters. If you do this you have to be sure that you are watching for URI poisoning or SQL injection even through means of character encoding.
How can I build a sitemap for this?
By pulling the addresses out of your database and creating a XML file with a high priority for some selected pages. Somehow I think that isn’t your real question …
If I wanted to automate building a site map for a site like this, I'd employ Python. I'd pretty much write everything from the ground up (except the data store access). The format is quite simple.
I'm not sure I quite understand your question...

Resources