Afrikaans Afrikaans Albanian Albanian Amharic Amharic Arabic Arabic Armenian Armenian Azerbaijani Azerbaijani Basque Basque Belarusian Belarusian Bengali Bengali Bosnian Bosnian Bulgarian Bulgarian Catalan Catalan Cebuano Cebuano Chichewa Chichewa Chinese (Simplified) Chinese (Simplified) Chinese (Traditional) Chinese (Traditional) Corsican Corsican Croatian Croatian Czech Czech Danish Danish Dutch Dutch English English Esperanto Esperanto Estonian Estonian Filipino Filipino Finnish Finnish French French Frisian Frisian Galician Galician Georgian Georgian German German Greek Greek Gujarati Gujarati Haitian Creole Haitian Creole Hausa Hausa Hawaiian Hawaiian Hebrew Hebrew Hindi Hindi Hmong Hmong Hungarian Hungarian Icelandic Icelandic Igbo Igbo Indonesian Indonesian Irish Irish Italian Italian Japanese Japanese Javanese Javanese Kannada Kannada Kazakh Kazakh Khmer Khmer Korean Korean Kurdish (Kurmanji) Kurdish (Kurmanji) Kyrgyz Kyrgyz Lao Lao Latin Latin Latvian Latvian Lithuanian Lithuanian Luxembourgish Luxembourgish Macedonian Macedonian Malagasy Malagasy Malay Malay Malayalam Malayalam Maltese Maltese Maori Maori Marathi Marathi Mongolian Mongolian Myanmar (Burmese) Myanmar (Burmese) Nepali Nepali Norwegian Norwegian Pashto Pashto Persian Persian Polish Polish Portuguese Portuguese Punjabi Punjabi Romanian Romanian Russian Russian Samoan Samoan Scottish Gaelic Scottish Gaelic Serbian Serbian Sesotho Sesotho Shona Shona Sindhi Sindhi Sinhala Sinhala Slovak Slovak Slovenian Slovenian Somali Somali Spanish Spanish Sundanese Sundanese Swahili Swahili Swedish Swedish Tajik Tajik Tamil Tamil Telugu Telugu Thai Thai Turkish Turkish Ukrainian Ukrainian Urdu Urdu Uzbek Uzbek Vietnamese Vietnamese Welsh Welsh Xhosa Xhosa Yiddish Yiddish Yoruba Yoruba Zulu Zulu

 

 

Article Navigation

Back To Main Page


 

Click Here for more articles

Google
Google's Duplicate Internet Content Filter in Action
by: Tony Simpson

If you don't believe Google's Duplicate Content Filter exists, I have Dramatic Proof their Internet content filter exists and it's very effective.

On July 5, 2005 I published an article entitled "7 Top Ways to Avoid Link Theft" which was picked up and included as content on other websites.

Before the article was released I checked on Google whether any results already existed for the exact phrase "7 Top Ways to Avoid Link Theft" and there were no listings for that term.

Over the next few weeks I monitored through a search query on Google how many results appeared in Google for the title of my article. One week after publication there were 6,760 results listed in Google, a week later it was 14,100 and it reached a peak of 17,000 results by July 26, 2005.

4 weeks after publication the results in Google had fallen slightly to 16,600.

Almost 6 weeks after publication the results listed in Google had fallen to 44.

In a matter of less than two weeks the number of search results on Google.com for the title of my article had gone from 16,600 to just 44.

In case you're thinking this is because all these other websites dropped by article and replaced it with other content I should add that a search on Yahoo.com on the same day still showed 14,300 results for my article.

What's more of these 44 results on Google, more than half consist of listings from the same websites. In other words some sites have the same article duplicated on different pages on their website.

So Google's Internet Content Filter is not used to remove duplicate listings from the preferred websites it chooses to keep in the search results.

On August 28th, 2005 8 weeks after first publication I distributed the article again to a new list of article sites to repeat the process. After 6 weeks the same article had reached a peak of 5,620 results on Google. Less than 2 weeks later the results had fallen to 217.

For me this was dramatic proof that Google's Duplicate Internet Content Filter is active and very effective. If you're wondering if other major search engines have a duplicate content filter I can confirm that Yahoo certainly does. The same article which was once listed on 14,300 sites on Yahoo, has fallen to 344 over the same time period.

From these results it would seem Google takes about 6 to 8 weeks to remove duplicate content using its Duplicate Internet Content Filter.

But the question remaining is just how does Google decide which out of over 16,000 results does it keep and which does it reject ?

I have witnessed situations where my own articles appear in results on other websites, but are not listed in the results for my own website.

So clearly Google does not take into account who the originator and author of the original article was when deciding which sites will remain in its search results.

It also seems to have nothing to do with where Google first finds the article.

Some articles I have published to my website for several weeks before releasing them for distribution to other websites.

In that time the Google spiders have visited my site several times and Google has had enough time to work out that the article was first found on my site.

It would be interesting to see if it's possible to work out what factors Google is using in its Internet Content Filter to decide which results to keep in its listing and which ones to remove. But that's for another article.

About The Author

Tony Simpson is a Web Designer and Search Engine Optimizer who brings a touch of reality to building a Web Business. A related report on article distribution is at : http://www.webpageaddons.com/stp/announcerclaim Article Announcer Review - Testing Product Claims

This article was posted on October 28, 2005

 



©2005 - All Rights Reserved

JV Blogs Visit free hit counter