Why is the sitemap not Fetch in search console? - search

I have a site that is written with php and now when I try to connect the site map to the search console, it gives a coudnt fetch error. The site is already WordPress and the site map itself has no problem.
site address
site map

You can create a XML sitemap of your website upload the file in Search Console

Probably, Google's crawlers do not access URLs correctly
you should use a dynamic sitemap.

Related

Drupal homepage showing the node/ in search results

I have launched a site built in Drupal 8. When building the site the node/2 was assigned as the home page.
Now when I search for the site in Google I see in the results the node in the URL:
e.g. www.domain.com/node/2
I've never seen this before. Is there a way to not get /node/2 indexed?
you can use drupal's pathauto module [https://www.drupal.org/project/pathauto]
where you can set your url by using token also.

New Google Sites in Google Search results

The question applies to only Google Sites New (version) as it currently doesn't support web URL mapping. More info here under section Web address mapping :
https://support.google.com/a/answer/7198710?hl=en&ref_topic=6399171
So I made my site publicly available and can access it via the sites URL. I then did a 302 URL forwarding in my domain DNS so if I type in the domain.com it gets redirected to sites URL -- All good so far.
My question is how come some users are managed to have their URL showing up in Google results. Example Query in Google :
https://www.google.com.au/webhp?sourceid=chrome-instant&ion=1&espv=2&ie=UTF-8#newwindow=1&q=Made+with+the+new+Google+Sites%2C+an+effortless+way+to+create+beautiful+sites.
you will see some sites for example
These sites - ArchSci2020 and Mr. Arnote's new google Site have their URL displayed and Title doesn't contain the word Google Sites. Example if you try the following query where I am looking for the footer phrase
https://www.google.com.au/webhp?sourceid=chrome-instant&ion=1&espv=2&ie=UTF-8#newwindow=1&q=%22Made+with+the+new+Google+Sites%2C+an+effortless+way+to+create+beautiful+sites%22
I am using Google Domains and it only allow URL forwarding -- what are they doing differently?
Thanks.
I think they are indexed by Google with their real domain name because they use iframes or invisibles redirects configured in their registrar panels like Namecheap (for mrarnote.com for example).
According to Namecheap support:
Recently, Google has released a NEW version of Google Sites (it can be accessed through https://sites.google.com/new), however, it has no option to use a custom domain with it. If you still would like to use this version of sites with your domain, you may set up the connection via URL forwarding (more detailed instruction on how to set up the URL forwarding can be found in this article)

Google results shows Meteor.js site as "Loading..."

I submitted a Meteor.js site (based on Telescope) to Google and in the search results, Google have the site pages crawled as having the title Loading... and only static content (instead of dynamically loaded ones) were in Google's description of the site.
mrt list --using shows that spiderable is already being used.
Is it possible to let google retrieve the actual title of the page?
Yes. You should use the spiderable package.
You can check if your configuration works by adding the escaped fragment parameter. For instance for the documentation: http://docs.meteor.com/?_escaped_fragment_=

Google is searching AWS Elasticbeans site( mysite.elasticbeanstalk.com) but not my site (mysite.com). What to do?

I was testing my site on AWS. And it is like mysitetest.elasticbeanstalk.com. But my original site is mysite.com. Now whenever i search for mysite google shows mysitetest.elasticbeanstalk.com links but not my original site. I have done all the verifications on webmaster tool for my site.
Is there any way to make elasticbeanstalk site completely private to me only and it is invisible to google? And if there are more suggestions please give me. All are welcome.
You should set up a robots.txt file for your test site to tell Google/crawlers to redirect the test site to the production site.
Example article: http://www.bruceclay.com/blog/how-to-properly-implement-a-301-redirect/
Your future test sites should have a robots.txt that tells google not to crawl it.

Webmaster Tools Crawler 403 errors

Google Webmaster Tools is reporting 403 errors for some folders on the websites server for example:
http://www.philaletheians.co.uk/Study%20notes/
The folder isnt forbidden so dont understand why it would be 403 errors for Googles Crawler?
How come the Google Crawler is trying to browser the actual folders and not just going straight to the files in that folder? Is this somthing to do with robots.txt ?
Make sure is there any actual place or document to be present if some one request that url. I've browsed through your site and could not found a link that directs to http://www.philaletheians.co.uk/Study%20notes/
Also it seems, all the study notes are inside this "Study%20notes" directory.So actual this link will not work anyway. So check the google web master tools's link from to find where this broken link situate and cure it.
Have you set default document correctly in your web server? In apache, this comes in the DirectoryIndex setting (and defaults to index.html). Also, in general it might be better to strip off spaces etc.. from your traversable directory names (the %20 you are seeing between Study and notes is a url-encoded space character), so as to keep your URLs clean to your visitors and search engine bots.

Resources