Lecture
Today I was analyzing one of the clients' sites, fresh, duck, just hands were falling, how many problems were there. Doubles of content within the site - there were exactly 67% of the total volume of the site. But we will analyze them on the basis of the popular resource ROEM.RU, especially since there are also enough problems here.
Previously, we already had an article on duplicates: "Duplicates of content - myths and problem solving," but today we will touch on Runet and Yandex in particular.
There are no identical sites. Search engines also treat different sites quite differently. What one - like an elephant grain, the other - the last nail in the coffin lid. ROEM.RU is a quite authoritative resource, but it is possible that it can also get into a problematic situation once.
Let's try to calculate how much content there really is on the site, clean, without any slag.
So (on March 24, 2010), Yandex knows 33 thousand pages, Google knows 37 thousand pages, Rambler knows 41 thousand pages. What are these pages?
Let's begin to cut off in order.
We remove user profiles from indexed pages - these are 2 thousand.
Next, cut off the login page - this is 500 pages.
We now turn directly to the duplicate content on the content of the pages.
As we see on each information page there are 5 copies , namely:
Do you want to add? Yes please:
THOSE. if you figure out, we have 5 clear copies that Yandex knows, and you can also make at least +2 more copies, and if you like, I’m sure you can do even more. But let's only deal with what Yandex sees, and not what can be done with the resource.
As we see - "for printing" is a rather large segment on the site - almost 10 thousand pages.
So, to summarize, how much real content is on the site and how much does Yandex know.
There are 33 thousand documents in the search engine index, of which 2 thousand are profiles, 9500 are printable pages, 500 pages are authorization. Now let's subtract ALL duplicate content and system pages. As a result, we get 5-6 thousand pages.
Of the 33 thousand pages - only 5-6 thousand with real content , 3 thousand pages - can be called systemic in the index, and 24 thousand pages are duplicate articles (clarification is ONLY clear duplicates)!
Here on the client's site I have almost the same situation. will have to clean up. On any site there are problems - the main task, get rid of them, so as not to have problems in the future.
Of course, if it is not a satellite for selling links although, just on the site for selling links to a greater degree this problem manifests itself, they just stupidly take off (doubles) and the site falls into GBL. If for a satellite it is just a loss of profit, then for a normal website, a business, the departure of a large part of the pages can be destructive (for the Internet direction).
Analyze your sites, fix problems, do not earn yourself hemorrhoids.
Comments
To leave a comment
seo, smo, monetization, basics of internet marketing
Terms: seo, smo, monetization, basics of internet marketing