My home page is the root node, and consists of HTML editable text and repeaters. The page types used by the repeaters have been excluded from the smart search, but editable text isn't showing in results.
Currently my path is set to: /%
Everything else seems fine, i'm getting results that i expect (minus some url alias issues), but this has be stuck.
Some ideas:
Have you used the correct Documents index type?
https://docs.kentico.com/display/K8/Defining+document+indexes
Have you selected the root page type as allowed content (probably CMS.Root)?
Is the Search Preview (located on the Search Index itself) showing the page?
David
Yes, checking for whether page type is included in the index is probably the first thing one should look out for while debugging for Search Index not returning specific page type results.
Related
as expected, there coming more and more with TYPO3. I'm very in TYPO3 and working with the bootstrap package. Therefore I don't understand loads of stuff.
The task is actually very simple (i guess). I just want to include a search field on in my page with indexed search. I included the extension with the extension manager.
So far so good, the search field is shown and it's able to find content on my page, but when I select one of the search results this page is shown:
Error message
Does anyone understand what happened here?
Many thanks in advance for your hints!
cheers,
expikx
I have a Kentico 12 MVC site where the cms and I guess "client" site are in the same server but separate IIS entries. One is called admin.site.com and the other is called dev.site.com.
I'm trying to implement the Smart Search functionality with a Page Crawler index. The reason I want a Page Crawler index is because my content structure is as follows:
Page Container > Page Type "Product"
Then within "Product" page type, I'm pulling in content from a different part of the content tree using widgets/page builder functionality in the Page tab. The Content tab of that page has very little actual content.
If I use Pages Index and search on that, it only grabs the page types that are in the content widget section of the site, so not the pages that implement the widgets which are the actually live pages on the site. I implemented the Page Crawler index and tried a search preview but literally anything I search comes with no results. Please let me know what details you'd need from me to help, I appreciate any help!
Best,
RP
Check the documentation and especially the note:
"We do not recommend using crawler indexes on MVC content-only sites. The crawler only selects pages from the site's content tree in Kentico, which may not match the actual structure of the website (in many cases, content-only pages only store data and do not represent pages on the live site)."
To achieve your need you will need to create your own crawler code and combine it with custom search index.
I have a custom page types (Content Only) for Locations. Then I have a landing page (/company/locations/) with repeater to list all locations and their details. Things work well so far. Now, after adding the smart search, I notice that if I search a location name like "san francisco", the landing page didn't show up in search result, but the content-only page showed with a URL like this /company/locations/san-francisco. The thing is, this URL results in 404 since that page doesn't really exist. What should I do? Should I re-create the page type and change it to a regular page instead of content only before it's too late? Or is there a way to make individual location url (/company/locations/san-francisco) work - considering we can't specify a page template to go with content only page type? Thanks!
There are multiple types of Search indexes in Kentico.
"Pages" scans the data of a document, such as any webparts+properties, editable text, form data, etc. They do NOT scan the rendering on the page though, it doesn't catch any Repeaters (what you're using).
"Page Crawler" will literally load the page, and scan all the content in the page. This will catch Repeaters and dynamic content like that.
Knowing this, you have a couple options.
Use Pages, then Modify the Smart Search Result and add some transformation logic to say something like the below
The Link
Use Page Crawler, tell it specifically to only index the /company/locations.
Use Page Crawler, and also a custom smart search indexer so you can exclude the header/footer or other areas out of the content (it's a bit more advanced)
If you don't want that URL to show then simply exclude those page types from that search index. But if you want them to specifically show, then create a detail or selected transformation for that /company/locations repeater to display when someone navigates to it from the search. This will also be good for google and other search indexes if you plan to have specifics for each location.
Currently Google displays elements in the result excerpts that belongs to the functional part of the site. Is there a way to exclude these elements to get crawled/displayed in google?
Like eEdit, eDelete, etc in the example above.
To exclude the pages from Google's index, block them using the Robots.txt file or if it is just the content then use the "rel="nofollow" tag.
Hope this helps.
Update on my particular situation here: I just found out that the frontend code has been generated in a way where the title and the description meta was identical.
Google is smart enough to expect that if a copy is already displayed in the title of the search result there's no reason to add in to the excerpt as well, instead looks for content - believed to be valuable - from the actual page.
Lessons learned:
there's no way to hide elements from google but keep it visible for your users
if you'd like to have control over the content displayed in google searches, avoid using the same copy in your title and description
I am getting some strange (to me) results when using Google CSE on my site.
Background:
The site has a MAIN search field (Google CSE #1 we'll call it).. that was set up in the Google CSE admin panel.. and set to be a site wide search.
I recently added a page that will be the front end search for a directory of PDF's.
I went back to Google CSE admin page.. made a NEW entry for this PDF sub-directory..
pointed to only this directory and its child/sub-directories..
set it to ONLY return .pdf results..
outside of these two differences, both CSE accounts/entries are set-up identical.
The problem or question I have is..
the 'results' pages (while have the same exact settings, search field on top of results, full row...single page..etc)..are different a bit.
the 'new' page results has a tan colored 'ad block' on top of the results..
and 'more importantly' the results page SEARCH field itself looks different, and the SEARCH BUTTON itself looks different, and has an 'x' (close/clear) button right after it
if the settings are the same,.. how am I getting different looks? if only PDF file type or confined to a sub-directory for indexing/searching are different?.. do those settings somehow give different looking elements like that?
the new page SEARCH/SUBMIT button also have a magnifying glass on it. where original one has search text..
I didnt see anything in the "Look and Feel" section of the admin pages
what is affecting the different looks?
answer: choosing hide ads or not under the BUSINESS section in Google CSE admin page changes the 'looks' of the search field & submit button...
why that should affect the visual results, I have no clue.