Duplicate Content – Misunderstood Not Mythical
Duplicate Content is not a myth. It has a very specific meaning, but it is misunderstood to apply to other conditions to which it simply does not apply. Here is Google’s definition (the only one we care about):
Duplicate content generally refers to substantive blocks of content within or across domains that either completely match other content or are appreciably similar. Mostly, this is not deceptive in origin.
Duplicate Content often occurs naturally on websites and is usually not deceptive in nature. Ex. The page www.website.com/category/green/canoe having the exact same content as www.website.com/category/canoe/green. These are different URLs and yet they have the same content. This is also known as canonicalization. (what is canonicalization?). It’s for this reason that when Google finds Duplicate Content on websites they do not immediately and in general do not give a penalty. Why would they with the above example? To help out Google, you can use the rel="canonical" link element to refer Google to your preferred page when you have multiple URLs leading to the same content such as in the above example.
You may be wondering now, when does Google take action and give a penalty for Duplicate Content? If after further review they decide the Duplicate Content on your site is deceptive in nature, they will penalize your site. This isn’t rocket science – don’t make a bunch of pages on your site with the same article or content posted over and over again. I’ve been over this a few times on the site now – when does Google slap sites with penalties? Drum-roll please….for SPAM! Posting the same content repeatedly on your site sounds like…..SPAM! Don’t do it.
“But wait, I am a very astute person and noticed that Google’s definition addresses the same content ‘within or across domainS”. Why, thank you for noticing astute reader. The same content (duplicate content) is often found on multiple sites. Google has claimed that approximately 28% of all the content on the web is duplicate. Before moving further, let’s first reaffirm that nowhere in their definition does Google mention that duplicate content is inherently tied to penalization. That being said, the way Google handles such a predicament is pretty simple. They choose one (or sometimes more sources) with that content that seems the most “appropriate.” Yes, the term “appropriate” is quite vague, but it is the best we can get from them. They do this so that you don’t have an entire results page filled with the same content from 10 different pages. That’s it.
If you are going to use someone else’s content you need to address it as such – link back to their site and say where it is from, maybe even ask them first – they will probably be fine with it since it means a relevant “dofollow” backlink for them. Otherwise, don’t be surprised if you get a DMCA infringement notice for stealing someone else’s content – you can be de-indexed entirely for this. So now that we are clear about what duplicate content is actually all about, what is all this crazy myth stuff about?
The myth is about receiving a penalty for having the same content as someone else or for syndicating articles. In all honesty, after clearly explaining what duplicate content really is, we can dispel this myth very quickly.
Part 1) If I submit an article to multiple article directories, is that duplicate content?
No, articles are not duplicated, they are syndicated. Articles play by their own rules. The point of submitting an article to a directory is to have other people republish it! Ever notice how an AP news report gets published by tons of news sites and none of these sites are penalized? It’s because it is syndicated content.
Part 2) What if I post the same content to multiple hubpages/squidoo lenses/blogs?
No problem, they just won’t all show up in the SERPs because Google will pick the one that is most “appropriate.” And to repeat an earlier point, if you post someone else’s content, that’s just plagiarism
Part 3) I can just use copyscape to see if my content will be duplicate content.
Here’s a question for you: Do you think that Google runs everything through copyscape to decide if it’s duplicate content? Sounds silly now doesn’t it. Google has its own way of deciding whether something will be considered duplicate content or not. If you are writing everything yourself there should be no question that your content is unique. If you are spinning articles, copyscape can be helpful, but the final word comes from the Big “G.”
To summarize, you will only be penalized for duplicate content if it is within your site and you use it in a deceptive manner ie spam. If your website has duplicate content in the sense that another website has the same content, there is no penalty, but only the most “appropriate” will appear in the SERPs. Articles are not duplicate content, they are syndicated content.







Very interesting. You no longer need to use Copyscape if this is the ruling. Even if only one of your sites shows up, the others are still getting backlink juice – I assume. . .
Hi,
This is an interesting topic. I actually read an article stating that as long as you publish content on your own blog/website and allow it to get indexed, which may take a day or 2, you are allowed to submit that exact article to any article directory, with no penalty for duplicate content.
So as long as your original article is yours and is published and indexed on your blog. Then you are all set to submit it to article directories.
Thanks for sharing.
Hey Ian,
Getting your content indexed on your site first doesn’t disable the duplicate content penalty, remember the duplicate content penalty only refers to the same content appearing multiple times on the same domain. Indexing your content on your site first encourages Google to view your site as the original source. This means that you won’t outrank yourself in the Search Engines with your article on a directory page rather than the page on your site.
Ben,
This article really clears up a lot of confusion I had about duplicate content. I have subscribed to your blog and look forward to more quality articles about SEO, Keyword Research, etc.
Respectfully, Truman Anderson
Truman Anderson recently posted..Bio Energy Dome
Thanks Truman, I appreciate the feedback. Let me know if there’s any content in particular that would like to see here, just shoot me an email through the contact form or comment here.
Wow, this is a eye opener!
Please add more topics on the myth page.
And do you know good plugins for WP for the canonicalization thing?
P.s. add a subscripe check box so I know if somebody answered my question.
Els recently posted..Happy Parking
I guess that has just cleared my mind about duplicate content. I was on doubt submitting may articles on article submission sites. I haven’t really known that it is the process of syndicating the article. Thanks Ben. I really find your post very helpful and an eye opener.
I seem to get this question so often from clients and readers. Glad to see you’ve covered the topic. People always confuse syndication with duplicate content and because of it are either leery about syndicating, or stress about it after the fact.
Warren recently posted..My Sure Fire Way To Gain More Traffic
Yeah I think it’s an inherently confusing term. Syndicated content IS duplicated in the general sense of the word, but not with the SEO penalty bearing of the word. I think it’s just a matter of being educated towards the terms we use in SEO and how to use them.
Ben. That makes sense at well. I am reading your “Article Marketing & Beyond” and am glad that you are advocating less articles but all well written.
Thanks Mig, you know even after Panda I think article marketing is still a feasible strategy. It’s best to diversify and use all methods of link building.
Hey Ben,
Most of you content are really good, From my first day i follow the white hat SEO. I use to refer many experts content before getting in to real action. I am trained professional by Bruce Clay, But again few thing i have learn myself right.The reason i came here, my boss want me to do article marketing, So i though of duplicate content issue in here. I know the same content used in our websites or server as considered as duplicate content.
This part really helped me man
Part 1) If I submit an article to multiple article directories, is that duplicate content?
Thanks for wonderful content, i have also downloaded AMR, hope every thing should be best. Happy New year 2012, will check back your website soon.
Thanks for the great post and insights of how to avoid common misunderstandings about Google’s duplicate content. By this post, I get great info about it. Thanks for sharing Ben.
Jim Paulson recently posted..Fotokopi
Ben,
Wow, you really do publish some high quality articles on SEO! My white hat is off to you. I have subscribed to your blog because of the high quality of information you have here. I just read your “Article Marketing and Beyond” PDF and it is fantastic, cleared up a lot of things I was struggling with conceptually about article marketing. I’m going to go through your entire online “SEO Course” because what you are putting on this site is so good. Thanks for being one of the “good guys” in Internet Marketing and for practicing what you preach.
-Kevin T. Ryan,SSG(ret.)COARNG
Kevin,
Thanks so much, you just made my day
More good things are on the way!