Google's crawler won't understand own maps. How to workaround?

前端 未结 6 1902
我寻月下人不归
我寻月下人不归 2021-02-13 23:18

I found strange words, (have, here, imagery, sorry) that were supposed not to be on my site, being taken as keywords by the crawler from Google

相关标签:
6条回答
  • 2021-02-13 23:25

    "Solution" applied:

    I had one different map per each page in my site, I ended up converting all maps to images and only keep one interactive map where I really needed user interaction with coordinates and mapping stuff. The solution I used led me to create and opensource [osm-static-maps][3]. Hope it helps somebody!

    The site got several improvements:

    • Got rid of this awkward words in google webmasters.
    • More relevant SEO using static images with "alt" html img attribute instead of "unindexable" js map.
    • Faster page loading (got rid of all mapping libraries and tile loading).
    • Faster js performance (less js to process by client)
    • Improved user experience: scrolling page caused map zooming instead of actually scrolling (you can think that this could be solved by disabling map scroll to zoom, but it lead to a user expecting to zoom the map on scroll, both ways were [ok and wrong at the same time][4]).

    On the downside, I found:

    • Less user interactivity (boring page).
    • Less context on the map (less informative map).

    This two things could be "fixed" loading the map when the user clicks the map img. The bad side is that if the user clicks the map img unintentionally, the map load can be seen as unexpected behaviour by the user.

    Edit2

    I made an opensource project out of this. Check out! https://github.com/jperelli/osm-static-maps

    0 讨论(0)
  • 2021-02-13 23:27

    1) Perhaps setting your Meta Description inside your <head> tags will supercede this:

    <meta name="description" content="This is an example of a meta description. This will often show up in search results.">
    

    2) If the meta tag doesn't work, I would also suggest that this is possibly due to the very first thing in the <body> being rendered (or rather, attempted by the looks of your screenshot) is a Maps display prior to any other content being loaded.

    For example, if you place a <div> or <p> tag with some introduction content about your website before the Map in your <body>, you may avoid this. However, I am not 100% sure about this you will have to test and see the results (keep us posted).

    If you plan on doing this and want a) the Google crawler to still pick it up and b) wish to hide the actual block of words itself from viewers (style="display: none;" or style="position: absolute; left:-9999px;"), do so at your own discretion (more info here).

    0 讨论(0)
  • 2021-02-13 23:31

    perhaps you can add a bit more specific meta tags such as

    <meta name="geo.region" content="US-WA" />
    
    
    
    <meta name="geo.placename" content="Snohomish" />
    
    
    
    <meta name="geo.position" content="-57.954900;-34.917000" />
    

    also adding what Matt Rowles meta description and some of the word filters in Google Webmasters.

    0 讨论(0)
  • 2021-02-13 23:35

    This answer won't help you remove the words of the crawled pages, but it might prevent them from being added after the next crawl.

    Since your problem might be related to the crawler not being able to load a valid map. It's not exactly clear why it can't. The map provider might be blocking googlebots.

    Anyway if it's not too hard, I'd have a look here:

    https://support.google.com/webmasters/answer/1061943?hl=en

    Create a list of user agents written here:

    I'll use 'Googlebot' as an example, but you should use a list with every blocked user agents.

    if (navigator.userAgent !== 'Googlebot') {
       // load the map and other stuff
    } else {
       // show a picture where the map should be or do nothing.
    }
    

    Google bot executes JS so it should work preventing errors in case the google bot can't load it.

    One thing you could do is to change your browser's useragent to 'Googlebot' and load your page. If the map provider is preventing any browser with this user agent, you should see exactly what the googlebot sees. The other problem is that googlebot might also have some timeout to prevent loading too much data and it won't load images.

    Adding guards might help preventing google bot to actually load the map if the problem is really in the map.

    0 讨论(0)
  • 2021-02-13 23:36

    Did you try to add spider meta tags , it really helps a lot try this out in the head section.

    <meta name="robots" content="index, follow">
    

    The spider will now index your whole website, also will not only index the first webpage of your website but also all your other webpages.

    Also try to make you description more unique! much more powerful but not to overdose those keys.

    thanks

    0 讨论(0)
  • 2021-02-13 23:47

    Unfortunately i saw this a lot too...

    My assumption is that googlebot won't fully evaluate all js code on a page, but will use heuristics as well. Thus getting no imagery (which gets indexed). Based on this assumption I did the following:

    1. Create a div with a "random" ID (for the map) and style="display: none;"

    2. Create a noscript tag with an img tag in it with the SAME "random" ID (i used a static map image as fallback here, which is also good as a no-js fallback)

    3. Create a (custom) javascript function where the unique ID must be passed to initialize your map AND toggle the display style to block on the map-element.

    So far, none of the maps "sorry we have no imagery" gets indexed.

    Hope it helps

    0 讨论(0)
提交回复
热议问题