Archive for ‘Research’

Anchor Text Harvesting

This article explains how to crawl a single domain and generate a complete list of all external anchor text used.

harvest

80Legs

  1. Log into 80legs and create a new job. You can name it whatever you like.
  2. Add the URL of the domain you wish to crawl in the “Simple” mode of “Seed List of URLs” field.
 

How to get 25,000 +1′s and PageRank 7 in one week

WARNING: Misleading title.

Now that I got your attention I’d like to point out just how smart Google has become over the years. It seems they’re able to understand when a site has been moved and consolidate their signals accordingly without any canonicalisation or 301 redirects.…

 

Penguinflux or Hummingbird?

Today we upgraded Algoroo to include all available SERP flux data ranging from December 2012 until now. One thing that strikes me as unusual is the impact Penguin 2.0 left on search results. It seems that the level of search result volatility never restored to its pre-Penguin 2.0 state which hints at a possibility of a whole new real-time search quality algorithm.…

 

Tested: Google’s New Barrier to Entry

Spam

Penguin 2.0 did many great things for the quality of Google’s search results, but one of them went largely unnoticed by the mainstream SEO community. Since the last update I was approached by several people who have noticed a new layer of protection against spam entering Google’s results quickly.…

 

How to Visualise Very Large Websites

If you’re running a small-to-medium sized website, visualising website architecture isn’t difficult. What do you do if your website exceeds 100,000 or even 1,000,000 pages?

Visio, PowerMapper, SmartDraw and similar applications won’t do the trick. They produce great visualisations in a relatively short time period, but they fail/break/time out when facing websites such as eBay and Amazon.…

 

Results: Ten Months of Google Algorithm Changes

Google Flux

Data Source: Algoroo

We process search query data for thousands of keywords on a daily basis and observe fluctuations in their position. By adding both ups and downs into a single movement score we’re able to map the change levels on a daily basis.…

 

Google’s Results Change During Weekends and Weekdays

We’ve been tracking volatility in Google’s results for a while now and one thing that’s been consistently popping up was a high level of result changes during weekends. Typically two days, but ranges from one to four. It looks as though they change the results in time for weekend and then restore them back to a “week mode”.…

 

Introducing: HyperPagerank

hyperpagerank

A ‘traditional’ link graph is a directed graph which consists of nodes connected by edges, a schematic representation of a document collection connected by directional links (A → B). Link-based search engines (such as Google) rely on directed link graphs to determine varying degrees of (query-agnostic) importance of pages in its index.…

 

Google+ Hashtags: Initial Observations

One of the more interesting aspects of Google+ redesign in my opinion is the new way of #hashtag handling. This short post goes over a few basic observations I’ve made so far.

Triggering

What I find interesting is that certain keywords trigger the tag, while others do not.…

 

Crowdsourcing: Googler’s Examples of Bad Links

Bad Links

Google’s webspam team consists of people who look at spammy links on a daily basis and can recognise bad links quickly. These people also help webmasters recover from penalties by guiding them through the reconsideration process. Part of the process often includes examples of remaining bad links and this is what we’re interested in.…

 

© 2014 DEJAN SEO. All Rights Reserved. Privacy Policy by TRUSTe - Link to us
DEJAN SEO PTY LTD - ABN 77 151 340 420 - ACN 151 340 420
National Sales Number: 1300 123 736 - International Callers: +61 7 3188 9200

Privacy Policy by TRUSTe Google Partner