问题
I had my domain set to a parking page with my registrar. The nameservers have been changed (I'm using Azure DNS), I've updated robots.txt
and submitted a sitemap. I've requested Google to crawl my site, and we're now showing in Google's search results.
The problem is that for every title link that shows in the search results, some info from the parking page is tacked on to the end.
So the result link might say
About Us | Example Company - example.com - nameOfRegistrar
.
About Us | Example Company
is correct, but - example.com - nameOfRegistrar
seems to be old info from the parking
page.
The URLs and text descriptions are fine, it's just the title links that are messed up because they add the URL of the site again and the name of the registrar. I think this info was in the title on the parking page.
How do I get rid of this old (presumably cached) info? Do I need to contact Google? My registrar? Shall I ask for another crawl? Why is this happening?
回答1:
So the result link might say "About Us | Example Company - example.com - nameOfRegistrar". "About Us | Example Company" is correct, but "- example.com - nameOfRegistrar" seems to be old info from the parking page. The URLs and text descriptions are fine, it's just the title links that are messed up because they add the URL of the site again and the name of the registrar.
It's better to write a 301 redirect
in your .htaccess file. Go through this and this to understand how it works and implement it. It should fix your old url bugs in no time.
With the 301 redirects
you will be writing instruction at your server level which will enable to redirect it to a page with the url that you mention. So even if your search result point to an old url (until Google fixes it), when your server gets a request pointing to the old url, it would redirect it to the new url that you mention in the .htaccess
file. Also, in your robots.txt
please make sure you allow bots to crawl .htaccess
file too. It would be easy for the bots to know that you have moved urls to newer ones.
Also, if you want to remove
old url which are not present in your site currently, then you can place a request to Google to remove them using this Google Help Answer. Not to mention, it might takes days together for those links to disappear.
回答2:
I fixed this by using Google Search Console. I requested a recrawling of the entire site several times, but after a 5 or 6 days nothing had improved although the site was clearly being slowly crawled. I then requested crawling of individual pages, without crawling other links on that page, for every messed-up result that showed up in Google. That was yesterday and everything seems to be fine now, those pages are showing up fine in Google.
So Google will eventually recrawl your whole site, it just takes time. You can probably speed up the fixing of messed-up results for individual pages by submitting individual pages to be crawled (without requested crawling of other links on that page).
I'm surprised that the crawling process isn't faster, but it's good because it doesn't hit my resources too hard.
As for why it happened, I'm not sure, but I suspect that the DNS info didn't have time to propagate. It doesn't matter now.
来源:https://stackoverflow.com/questions/45748382/old-parking-page-info-is-mixed-in-with-search-results-for-a-newly-crawled-site