Posts Tagged ‘Duplicate Content’

The Search Implications of Syndicating Content

August 29, 2008

Online publications share articles for a variety of reasons. But syndication—pushing your content via partnerships or an automated solution to multiple sites to broaden brand exposure, audience reach and boost link-back traffic—has become a top component of an online marketing strategy. But from a search engine perspective, syndication can cause a bit of a conundrum. If an article is a relevant result for a search, the search engine wants to show it to the searcher. It does not, however, want to show it twice (or three times or more). And that makes sense. If you’re searching for something, you don’t want multiple results that all lead to the same content even if that content is on different sites.

Read more in Circulation magazine

Irony

July 11, 2008

A (Google official) blog post about scraped content on a scraper site.

But the original does rank first.

Although not in blog search.

(But that appears to be because the post isn’t indexed in blogsearch at all. Because rivva.de is listed in its place?

Irony

June 12, 2008

A (Google official) blog post about scraped content on a scraper site. [Site is here (www.arsgeek.org/2008/06/11/duplicate-content-due-to-scrapers/), but link removed as the site may now contain malware.]

But the original does rank first.

Although not in blog search.

(But that appears to be because the post isn’t indexed in blogsearch at all. Because rivva.de is listed in its place?

Ranking As The Original Source For Content You Syndicate

May 14, 2008

When you write content on your site, whether it’s a blog post, product description, or an article, you likely want to rank well for it. I’m often asked how best to ensure this when you’re also syndicating that content.

Why Syndicate?
There are good reasons for syndicating content. Syndication can bring traffic, exposure, and sales.

If you’re a blogger, you might syndicate your posts to get wider distribution. If your posts are seen by a bigger audience, you might gain some of those readers for yourself. If your site provides authoritative resources, you might have a partnership with other sites that want to include that content. And if you sell products, you might provide affiliates with content feeds, which in turn brings in additional revenue.

But What Should Rank?
But from a search engine perspective, syndication can cause a bit of a conundrum. If what you wrote is a relevant result for a search, the search engine wants to show it to the searcher. But not show it twice (or three times, or maybe even a thousand times in the case of an affiliate feed). And that makes sense. If you’re searching for something, you don’t want multiple results that all lead to the same content even if that content is on different sites.

So what’s a search engine to do?

Search engines generally identify duplicate results and filter out all but one. They have lots of ways to decide which version to show. They try to figure out which one is the “original” by looking at things like which version was published first and which has the most links pointing to it.

Your content may appear on other sites at times other than when you syndicated it (such as when your RSS feed has been scraped), and search engines try to account for that too by looking at things like which site is more authoritative.

What If Search Engines Get It Wrong?
Generally, search engine algorithms work pretty well and your original version shows up. However, the system isn’t perfect. Michael Gray recently noted that sometimes Google gets it wrong and shows the version from a more authoritative site, even when that is not the original version. He suggested some ways for making sure that the original version shows up first. And he linked to the Search Illustrated column on Search Engine Land that shows a great illustration of how search engines determine the version to show.

How Can You Make Sure Your Site Ranks First?
So what do I suggest you do if you’re syndicating content but want your original version to rank about the syndicated ones?

  • Create a different version of the content to syndicate than what you write for your own site. This method works best for things like product affiliate feeds. I don’t think it works as well for things like blog posts or other types of articles. Instead, you could do something like write a high level summary article for syndication and a blog post with details about that topic for your own site.
  • Always include absolute links back to your own site in the body of the article. This is particularly helpful when your content is scraped.
  • Ask your syndication partners to block their version of your article (via robots.txt or a robots meta tag). Whenever I suggest this, people laugh and tell me that the sites they are syndicating to would never agree to this as they want the content so they can rank for it. I can completely understand this. But as someone who’s providing your content for syndication, you should then just realize you’re in a competition with your syndication partners for ranking and it’s quite possible they can outrank you. If you are able to, put together a syndication agreement that states they get your content as a benefit for their readers, not as a way to acquire search traffic for that content, then you can keep control of ranking for what you’ve written and they can provide a benefit to their audience.

But Make Sure Duplication Is the Issue
In Michael’s case, he explained that he has an agreement with Web Pro News that enables them to syndicate any blog post of his that they’d like for their own site. And in the case he describes, the article on the Web Pro News site is ranking above the version on his blog. He speculates that’s because Web Pro News is a more authoritative site. I am sure that what he describes can happen (particularly since in this case, his Web Pro News version of the article doesn’t have a link back to his original article; at the very least, he should negotiate an introductory paragraph at the beginning of his syndicated posts that explain where the original is located with a link to it, not only for search engine ranking purposes, but to give readers better content), but in his particular case, I’m not so sure that’s the cause.

I can’t find his original post indexed at all. Obviously, if a page isn’t indexed, it has no chance of ranking. I’m not sure why that particular page isn’t indexed. It’s not blocked with robots.txt or a robots meta tag. It sounds like he can see it indexed, so maybe I’m hitting a different data center. If that’s the case, I don’t know if the one I’m hitting was refreshed more recently than the one he’s hitting or if his is.

Don’t Give Away Your Control
His point that syndicating content can be tricky if you want to rank for that content remains, even if the root cause of his particular case is a bit hazy. If search is not yet a large acquisition channel for your site, then you may not mind if another site ranks for your material as you may get more traffic from the syndicated site (so make sure you at least have a link back to your site!). But as you site starts to stand on its own and search traffic starts growing, you will want to have more control. So think of your longer term strategy when you negotiate syndication partnerships and don’t give up all of the control of the content you work so hard to create to others.

Buffy, in duplicate

June 6, 2007

You know you’re traveling too much when you have to fly to the town where you live in order to speak at a conference. But indeed, I flew into Seattle just in time for SMX Advanced. I spoke about duplicate content, but since the panel was a summit format, it was mostly designed to get feedback from the audience about how search engines could make the lives of webmasters easier. And since it was an advanced audience, I figured they mostly knew about duplicate content issues already and I may as well just dive right into getting feedback rather than spend time on the basics. So, I did the logical thing and just talked about Buffy.

Buffy had to deal with duplicate content issues a lot. It’s quite possible she’s gotten tired of all that vampire slaying and is now off doing SEO.

Speaking of duplicates, fuzzy, good Willow or evil Willow? Here are some pictures to help you decide.

Fuzzy Sweater Willow

Willow in a fuzzy pink sweater

Willow in a fuzzy purple sweater

Evil Leather Outfit Willow

Evil Willow in leather

The Second Evil Willow
Willow going after Warren
(A non-Buffy tangent: The question I got most often after the session was about the supplemental index. Does having duplicate content cause sites to be placed there? Nope, that’s mostly an indirect effect. If you have pages that are duplicates or very similar, then your backlinks are likely distributed among those pages, so your PageRank may be more diluted than if you had one consolidated page that all the backlinks pointed to. And lower PageRank may cause pages to be supplemental.)

Now back to your regularly scheduled Buffy.

  • Nine By Fox

    Stories from the online marketing industry, Vanessa's travel adventures, and more. For reference material and analysis, see the Library.
  • Buy the Book!

  • Categories

  • The Latest From Twitter