HTML meta description: how many characters do engines display? - meta-tags

On Google, results are displayed with the meta description tag on one line, the description on the next, and the URL on the last.
How many characters of the meta description does Google display? I always thought it was around 150 or 160.

Meta description tag provides short information on webpage as a summary. Meta description will not appear in web page but it display in Search Result Pages. Google search engine give much important to Meta Description Tag to display in search result pages, but yahoo and other related search engines give somewhat different important to meta description tag, so you should not neglect it altogether.
Meta Description Tag Length is not the number of words that count. It’s actually the number of characters length. Because, Google will cut off anything more than 155(roughly) characters. Optimizing for in the description and try to limit its length to 25-30 words. Also try to use no more than two sentences.

Related

JavaScript: scrape description from Wikipedia

I'm looking to scrape the main description from Wikipedia when you first enter a page (e.g. the text that first appears when you send a wikipedia link on a website and it embeds).
I initially tried using MediaWiki, but when I tried it, it mostly returns non-relevant data which does not include the description/embed text.
Could there be any endpoint that Wikipedia has that could return the data that I want to make use of?

Correctly set meta description

I am having an issue with the meta tag description in my personal web site, when using it like that:
<meta name="description" content="my content" />
A search in Google for my homepage doesn't display the description I set but some contents of my page.
I read articles about that but I can't figure it out why this is happening.
According to google support, here is a good way to define a description of your site:
<meta name="Description" content="Author: A.N. Author,
Illustrator: P. Picture, Category: Books, Price: $17.99,
Length: 784 pages">
Is this meta content description going to be a standard ? Some kind of formated object ?
Or maybe this is my XHTML way to write tag the problem ?
Google announced a while back that it wouldn't use meta descriptions for ranking.
The meta description is sometimes used as the summary text in search results, but if Google thinks it has found a better summary on your page based on the user's search, it will use that instead.
In some situations this description is used as a part of the snippet shown in the search results
Your meta description should really match up to the page content very well, otherwise it definitely won't be shown. The example you have shown contains structured data - but there is no indication that there is a strong preference for this (i.e. it may not be relevant to all pages).
As always, the algorithms are mysterious and subject to change.

Query wikipedia

I would like to query two or three terms in order to locate them in Wikipedia´s entries. Specifically, I´m trying to see if some terms get repeated in the first paragraphs (abstract) across entries. Could be direct or through dbpedia. Thanks
Using Mediawiki API you can find articles that contain those keywords.
Try the API:Search documentation.
For doing what you want to do, also, you'd probably need to find the articles that have those keywords and then parse the text to check if they are in the first paragraphs.
With this:
?action=parse&page=Nicolas_Cage&prop=text&section=0
you can get the HTML of the first section of a page (see this post).

How can I prevent certain element to get displayed in Google search excerpt?

Currently Google displays elements in the result excerpts that belongs to the functional part of the site. Is there a way to exclude these elements to get crawled/displayed in google?
Like eEdit, eDelete, etc in the example above.
To exclude the pages from Google's index, block them using the Robots.txt file or if it is just the content then use the "rel="nofollow" tag.
Hope this helps.
Update on my particular situation here: I just found out that the frontend code has been generated in a way where the title and the description meta was identical.
Google is smart enough to expect that if a copy is already displayed in the title of the search result there's no reason to add in to the excerpt as well, instead looks for content - believed to be valuable - from the actual page.
Lessons learned:
there's no way to hide elements from google but keep it visible for your users
if you'd like to have control over the content displayed in google searches, avoid using the same copy in your title and description

How to get a description of a URL

I have a list of URLs and am trying to collect their "descriptions." By description I mean what comes up, for example, if you Googled the link. For example, http://stackoverflow.com">Google: http://stackoverflow.com shows the description as
A language-independent collaboratively
edited question and answer site for
programmers. Questions and answers
displayed by user votes and tags.
This the data I'm trying to accumulate for the URLs I have.
I tried parsing the URL's meta-descriptions, however most of them are lacking a meta-description (yet Google and other search engines manage to get a description somehow).
Any ideas? Should I just "google" each link and scrape the data? I have a feeling Google wouldn't like this...
Thanks guys.
Different search engines have different algorithms to get the description out of the page if/when they are lacking the description meta tag. Some ignore the tag even it it's there.
If you want the description Google has, the most accurate way to get it would be to scrape it. Otherwise, you could write your own or look around on the web for code that does it.
These are called snippets.
Google use proprietary (and possibly patented) methods to garner this information, so there is no simple answer.
As you suggest, they will use meta-description information if it is there. (How to set the meta-information to help Google.)
They will also honour requests from the page authors to NOT include snippets. (How to prevent Google from displaying snippets) You should probably respect this too (as well as robots.txt, of course.)
You may have some luck with existing auto-summary packages, such as OTS.
You may want to check AboutUs.org (i.e. http://www.aboutus.org/StackOverflow.com).
But, there's little chance that the site will have an aboutus page and not have a meta description.
Some info that might explain how google does this:
Webmasters/Site owners Help
Adding a URL to google
I am not familiar with Google APIs, but perhaps there is an official way to get such information.
Interesting. some sources are better than others.
For "audiotuts.com" google has a worse description than AboutUs.com.
Google
Nov 18th in General by Joel Falconer ·
1. Recently, an AUDIOTUTS reader asked me about creative process. While this
is a topic that can’t be made into a
...
AboutUs.com:
AUDIOTUTS is a blog/tutorial site for
musicians, producers and audio
junkies! It is the sister site of the
popular PSDTUTS, VECTORTUTS and
NETTUTS.
I hate problems like these... they should be trivial but they aren't!
If you can assume English content, you can first look for Meta Description, and if that doesn't work, you can look for the first two or three sentence-like word sequences.
A product I worked on looked for the first P or DIV that contained more than one sequence of > n "words" delimited by periods. It would use the two or three sentence-like sequences, up to x total words, as a summary paragraph. It wasn't 100% accurate, but good enough for the average case. The number of words was adjusted a few times to eliminate things like navigation elements.

Resources