I would like to use Google to search for certain platforms. The platform I want to search for today is Big Commerce. I just want a list of websites that are using Big Commerce.
I have found a common ground between bigcommerce sites. Most if not all use Bigcommerce for the nameservers. They also have this script in their head tags http://cdn2.bigcommerce.com
I have searched for "http://cdn2.bigcommerce.com" but don't get many bigcommerce sites. How can set a search query within google to find sites that use Bigcommerce for nameservers or have http://cdn2.bigcommerce.com within their head tags?
Experiment with search queries such as:
inurl:http://cdn2.bigcommerce.com
inanchor:http://cdn2.bigcommerce.com
fashion inurl:http://cdn2.bigcommerce.com
Can combine inurl: or inanchor:, e.g.,
inurl:http://cdn2.bigcommerce.com OR inanchor:http://cdn2.bigcommerce.com
inurl:http://cdn2.bigcommerce.com | inurl:http://cdn.bigcommerce.com
where OR and | are synonymous for search for either this or that and can be contained within parentheses if your queries get really complicated, e.g., (this OR that) ("dog house*" | catbird")
Related
We have many different documentation sites and I would like to search a keyword across all of these sites. How can I do that?
I already thought about implementing a simple web scraper, but this seems like a very ugly solution.
An alternative may be to use Elasticsearch and somehow point it to the different doc repos.
Are there better suggestions?
Algolia is the absolute best solution that I can think of. There's also Typesense and Meilisearch of course.
Algolia is meant specifically for situations like yours, so it even comes with a crawler.
https://www.algolia.com/products/search-and-discovery/crawler/
https://www.algolia.com/
https://typesense.org/
https://www.meilisearch.com/
Here's a fun page comparing them (probably a little biased in Typesense's favor)
https://typesense.org/typesense-vs-algolia-vs-elasticsearch-vs-meilisearch/
Here are some example sites that use Algolia Search
https://developers.cloudflare.com/
https://getbootstrap.com/docs/5.1/getting-started/introduction/
https://reactjs.org/
https://hn.algolia.com/
If you personally are just trying to search for a keyword, as long as they're indexed by Google, you can always search with the format site:{domain} "keyword"
You can checkout Meilisearch for your use case. Meilisearch is a Rust based and open sourced search engine.
Meilisearch comes with a document scraper tool ( https://github.com/meilisearch/docs-scraper ) that can scrape content and then also index it.
While using it you need to define what exact content you are searching for in the configuration file for the scraper tool. And then you can run the tool using Docker.
I cannot find any option to achieve a verbatim azure/cognitive/bing Web search.
In my case the difference is trying to sift through tens of millions of irrelevant search results to find the 10 results that actually match my query literally.
Even though I am a paying customer, there is no support available. And the API documentation did not help either.
I would think it should be super easy to provide a verbatim search option. Is there one that I did not see?
I checked further and it seems for the Bing Search APIs - +"phrase" works and returns documents containing this phrase at the top. Just add + in front of what you have been trying. Support link is here: https://azure.microsoft.com/en-us/support/plans/.
Having had the luxury of having my fun and earning my bread well away from MS products for years, I am today trying to programmatically search with Bing Azure (wearing gloves) basically because I thought getting a google api was complex. So I headed down Data Market and issued this (let's say with perl's LWP which has been used to pass credentials):
https://api.datamarket.azure.com/Bing/Search/v1/Composite?%24skip=0&%24top=10&%24format=json&Sources=%27web%27&Query=%27abc%27
which works.
What I am now trying to find out is
1) how to tell the Bing search api to restrict results to a specific domain (e.g. ".org" or even a single website "www.wikipedia.org").
2) how to tell the search engine to restrict results to a specific filetype, e.g. 'PDF', 'XML' (or PDF and XML if that's possible)
3) if there is a simple list of the features/keywords in the GET request of the latest bing search API. Please no MS links if you please - i am really tired.
I have seen "site:.org" working on the bing search website when doing a manual search. And read about "filetype:pdf" working too.
Any hints?
bliako
cracked it:
... Query=%27abc site:.com filetype:pdf%27
at the point when m$ realises it costs to be clumsy
bliako
I have a group of websites I want to check daily for new content and I'm not sure what the best way is. I'm hoping one of you can help me.
With Google Custom Search, I can search a group of websites -- but what I want is to find any content posted in the past 24 hours, not just content related to a specific keyword. I've tried searching with no keyword and I get no results.
With regular Google Search, I can choose a single site (site:www.example.com), use search tools to limit the results to the past 24 hours, enter no keyword and find anything that's new. But that only works for one site at a time, as far as I can tell.
With Google News search, I can find new content from multiple sites -- but that only works for news sources. If I enter nytimes.com, it works; if I enter dcenr.gov.ie/ I get nothing.
Any ideas on another way to approach this?
You can try creating a RSS feed for the webpages and then using a RSS reader to check for updates.
Is it possible to use site: and allintitle: in a google search at the same time? I want to find results with various search terms in the Google returned page title but restricted to a specific site. I know how to do either one but not both together and can't find anything at all looking online? All help shows you is how to do each separately.
No - the allintitle operator can not be used with other operators.
You can use multiple intitle operators instead.
For more on google searching:
http://en.wikipedia.org/wiki/Google_hacking
although allintitle:, allinurl, allintext:, & allinanchor: are exclusive, you can use multiple intitle: & site: filters in the same query to achieve almost the same result as combining them with other filters, e.g.,
intitle:(dog house) (site:cnn.com | site:*.gov)
intitle:((cat | dog*) AROUND() people) (site:cnn.com | site:*.gov)