Mind-Blowing Hack for Competitive Link Research

 
 

Peek-a-boo Links

Last week I presented a method for search result hijacking. The story got much coverage in the SEO community, perhaps due to the fact that Rand Fishkin’s authority pages were also compromised as part of our experiment. One thing I did not elaborate on in the original article was the peculiar way Google Webmaster Tools handle document canonicalisation.

TL;DR

You can see somebody else’s links in your Google Webmaster Tools as if you were the authorised user of that site. The process involves creation of identical document copy and the results are visible in about two weeks.

In the following screenshot we see what Google calls “an intermediate” link. An old link which points to our old domain 301′s to our new domain. Nothing unusual about this.

Redirect

There are other instances of the “intermediate link” in Google Webmaster Tools. One of them is related to document canonicalisation process described in a paper called “Large-scale Incremental Processing Using Distributed Transactions and Notifications” by Daniel Peng and Frank Dabek. This is exactly the same process I used in the result hijack article and the most interesting thing is that it works in reverse! (I’ll get to that later).

Here’s an example of one such case, some of you may remember this website from my hijacking experiment:

Google Webmaster Tools Links

As you can see the “intermediate link” notification suggests that my page above receives a link from Rob’s website but the thing is, it doesn’t. So what’s going on?

Well, the page I created is a replica of the original page on http://www.marketbizz.nl/en/. Google has seen my duplicate version and ‘merged’ it so to speak. Now I not only see the links pointing to my own version but the aggregate list of links for the one canonical/merged document across all applicable domains.

I am seeing the same links the owner of that site would see in their Google Webmaster Tools.

Here’s the interesting part, it works in reverse! You don’t even have to hijack the result for this to work, you can see the results with the ‘loser’ URL. If you create a duplicate page with lower PageRank (not very difficult to do that is it?) of any page on the web, you will be able to see its links in your own Google Webmaster Tools.

To test this concept I copied a PDF from another site and simply got it indexed, in about ten days I saw all its backlinks in webmaster tools, here it is:

DomainsLinks
doc-txt.com38
documbase.com19
pdfqueen.com7
cmu.edu6
msra.cn6
quora.com5
130.203.0.1335
google.com4
berbatek.com3
podpdf.com3
blogspot.com3
seobythesea.com3
writingseo.com3
newyorklastminutetravel.com2
automotivedigitalmarketing.com2
123people.com2
blumenthals.com2
psu.edu1
pitt.edu1
65.54.0.1131
52opencourse.com1
chatmeter.com1
journalogy.net1
vebidoo.de1
c4ads.org1
ryanmcd.com1
ebookbrowse.com1
uic.edu1
rightnow.com1
osti.gov1
informationweek.com1
christopherpotts.net1
sdsu.edu1
keywordspy.com1
psugeo.org1
tagwalk.com1
delib.net1

Now I can take any page/document from my competitor, place it on a domain of my choice, have it indexed and in a few weeks I’m able to see all their backlinks within Google Webmaster Tools.

It took me exactly 14 days to see the link data of Rob’s website.

whoa

Whoa indeed.

Dan Petrovic is a well-known Australian SEO and a managing director of Dejan SEO. He has published numerous research articles in the field of search engine optimisation and online marketing. Dan's work is highly regarded by the world-wide SEO community and featured on some of the most reputable websites in the industry.

More Posts - Website

Copy the code below to your web site.
x 
 

36 Responses to “Mind-Blowing Hack for Competitive Link Research”

  1. Now that’s a find. I wonder if Google will be able to quickly patch it up, or is it too deeply affected by their canonicalization process for a quick fix?

     
  2. I would imagine this would take some time before it’s sanitised.

     
    • Dejan SEO
  3. I’m not at all a technical SEO, or technical anything for that matter, so I’m always amazed (and envious) when I read posts like this. Amazing stuff, and thank you for sharing it with the SEO community.

    Someday I’d love to hear the story about how you made this discovery.

     
  4. I just saw some weird links which weren’t mine. Didn’t stop at that and continued investigating. One thing lead to another and once I’m finished playing I publish my findings.

     
    • Dejan SEO
  5. How is this any better than putting a URL into Majestic or OSE?

    GWT is hardly known for the accuracy of its backlink data.

     
    • Butler
  6. Hi Dan,
    What you found is just great and many SEOs just like me really appreciate the time and efforts you put in this. Just wanted to thank you for this and ask a question.
    When you created a duplicate of the marketbizz.nl or submitted it into webmasters did the guys there receive any notification/email/alert/warning or not?
    Regards…

     
    • Keith Wymer
  7. ok, it’s interesting, but: is it worth spending your time if you have ahrefs, majesticseo, opensiteexplorer, linkdiagnosis and other tools?

     
    • Slava Rybalka
  8. Top stuff again.. thanks for sharing Dan.

     
  9. Simple. I found links which were not in OSE / Majestic.

     
    • Dejan SEO
  10. Thanks Keith. No notification.

     
    • Dejan SEO
  11. Just want to elaborate on what Dan said – OSE is a smaller scraper than Google, so it won’t find a lot of links that are being counted. Besides, you now know for a fact every link that Google is counting :)

     
    • Jon Cooper
  12. Nice work Dan. I’ve had to explain (multiple times) to different employers that Google does not publicly display backlinks for any website, even your own. You’d have to be a verified user and look inside of GWT. This hack certainly circumvents that. Going to put it to the test. Thanks!

     
  13. Hi Dan,

    Another epic post. Do you foresee this tactic being used for evil?

     
    • Jason Mun
  14. Whoah indeed! Great discovery Dan thanks for sharing. Lets see how long Google lets it last!

     
  15. Hmm this looks a little bit evil… ;-)

     
    • Claire
  16. Nice little hack you found there Dan!

     
    • JDIZM
  17. Awesome post, thank you dan

     
  18. Hudge !!!

     
    • Vince
  19. At the risk of looking like comment spam, thanks for the post, this is awesome.

     
  20. Interesting! Let say I try to peek on other website’s inbound links with my own lower PR site than the original, and then it took about 2 weeks before I see the inbound link. If the original uses Copy Scape or some kind of alerts, wouldn’t he/she be able to found the copy before I get the data?

     
    • Alwi
  21. That is scary indeed .. and getting backlink data from GWT for any of your competitors like this is going to create quite a bit of fuss, I wonder how G is going to handle this . I should try this out for a demo site soon. Thanks for sharing this finding with us.

    Love the kind of research you are are doing , keep up the good work.

     
    • Saijo George
  22. Hmm, I wonder…If Google is reporting these as links to your site (even without a actual link), does that mean they’re passing PR to your page? The question seems to be…Is it a false report, or are they exposing how part of their algorithm works? Interesting.

     
    • Adam
  23. I am going to have to try this. My question would be if Google finds out that you are doing this will there be penalizations for manipulation on purpose…guess it wouldn’t matter if they take down the duplicate page. :)

    But I would also like to know (if possible), when you say duplicate are you doing just content because that is a sinch and if this works well, then holy crap I have some work to do before this leak in the damn is plugged.

     
  24. Very interesting, it should be tested

     
    • Pano Kondoyiannis
  25. This is super awesome hahaha!!! Wohooo

     
  26. Dan, I posted about a strange situation in an SEOMoz question: http://www.seomoz.org/q/strange-situation-started-over-with-a-new-site-wmt-showing-the-links-that-previously-pointed-to-old-site

    I think that what you saw is happening to my client’s site. The short version is that we scrapped a Penguin hit site and started fresh. We removed the old from the index and put up a new site with the same content other than the home page. There are no redirects from the old to the new. However, now WMT is picking up all of the links from the old site and attributing them to the new one. I’m worried that the new site will be affected when Penguin rolls out again.

     
  27. I am wondering what is going on in my mind is also going on in your mind. So does it mean we can increase the pagerank of our site with this technique?? For example you copy some good quality posts with good backlinks on your domain, the domain authority and the pagerank of the domain will also increase??

     
    • a_usman
  28. That’s my question too! And if the links to the original page are manipulative (i.e. Penguin like), could they affect your domain?

     
  29. Yes, none of those tools would ever display all the links one would have in their WMT account.

     
    • Rod
  30. Somebody tried it on us :)

     
    • Dejan SEO
  31. Heya, I tried it and it doesn’t work for me.. (waited 2 weeks though). Is it not working anymore ?

     
    • Duracellite
  32. hey man very intresting ,but i doaubt bout some thing,i searched alote ,byt not get better results, my question is if we add our url as ?url=http://www ,this is calculated ad back links?

    example http://seocheki.net/site-check.php?u=http://www.maccopacco.com
    is this count AS backlink for maccopacco ,pls help

     
    • jinu
  33. But WMT does not have this info either

     
    • Jon
  34. Is this hack still working today, or did they find some way around it???

     
    • josephjrobison
  35. Does this still work ? I think 10 monthes are enough for GG to solve this problem ^^

     
    • Cyril KUHM
  36. Hi Dan,

    This is very interesting. I never thought this could work out since I’m not really aware of my competitor’s SEO performance. Anyway, this is really helpful to track the links and make some assessments to my own site. Thanks a lot for sharing this informative article.

     
    • Spook SEO

© 2014 DEJAN SEO. All Rights Reserved. Privacy Policy by TRUSTe - Link to us
DEJAN SEO PTY LTD - ABN 77 151 340 420 - ACN 151 340 420
National Sales Number: 1300 123 736 - International Callers: +61 7 3188 9200

Privacy Policy by TRUSTe Google Partner