Google Plus SEO Audit
Update: 8 January 2013
Somebody at Google has decided to use more rel=”nofollow” throughout Google+ and here’s an example:
URL within the post is nofollow, comment URL is nofollow, comment poster URL is nofollow (that was always like that) and the shared post URL is also nofollow.
Wow. OK, let’s switch to activity layer:
“+’ed” by is nofollow and reshared is nofollow. The question remains, what is follow?
The primary post URL is still followed. This makes sense as that’s typically a deliberate link topically related to the post and created by dropping the URL in the initial message field or using the link icon. One bit they missed in their anti-spam patch-up is the +mention as you can see in the example below:
Judging by the posts we analysed the update took place on the 5th of January 2013.
Learn from the best they say, so I decided to run an SEO audit and analyse the architecture of Google+ to see if I can learn anything useful.
Google+ has a number of canonical issues and I will highlight one of them. On my posts page there are 11 instances of links to /u/0/
This causes everything to have two URLs with parameters which don’t change the content of the page. Something webmasters are advised to control through rel=”canonical” and/or parameter settings in Google Webmaster Tools.
Here’s an Example:
Their robots.txt file disallows /s/ which is related to search results, though these URLs can still appear in results, for example: https://plus.google.com/s/SEO appears in this search: https://www.google.com.au/search?q=site%3Ahttps%3A%2F%2Fplus.google.com%2Fs%2FSEO&pws=0
What’s amusing though is that anything that links to a noindexed directory goes in, even uppercase/lowercase variations as illustrated above.
- https://plus.google.com/u/1/+Coca-Cola/posts (info: does not resolve)
- https://plus.google.com/u/0/+Coca-Cola/posts (info: does not resolve)
- https://plus.google.com/u/1/113050383214450284645 (info: resolves to the code based URL)
- https://plus.google.com/u/0/113050383214450284645 (info: resolves to customised URL)
UPDATE: Source code on /posts page contains <link rel=”canonical” href=”https://plus.google.com/+Coca-Cola“> and canonicalises to relevant pages on /about, /photos and /videos.
Unrelated to this audit, but still amusing, is that when you search for Coca-Cola you find Pepsi (who don’t even have their vanity URL yet):
What is interesting is that the “Co” in Pepsi’s page has been triggered as a relevant search term to “Coca Cola“. Why?
Reader Comments and Suggestions
@dejanseo As far as i’m aware /u/0 and /u/1 are used to control which account you want to view if you’re logged into multiple G accounts.
— Mark Hannant (@tripleox) September 10, 2012
Duplicate Content & Hidden Links?
If you investigate links to my Google+ profile you will see the following list:
- Case 1: I +1’d Dan Russell’s post and my link is nofollow, I am not sure why this shows up in the link: if it’s not part of Google’s link graph. In any case I accept that there is a connection.
- Case 2: The hangout post links to my profile directly as I was one of the participants. No problems there.
- Case 3: This one was confusing as it had nothing to do with me. It’s a post about a car to which I did not engage in any way. What’s going on?
Here’s how it looks like:
So how’s this duplicate content? Well, if the same snippet is contained on a number of recent posts in addition to the posts page and the post URL itself we’re looking at more than 3, possibly 10-20 duplicated chunks of content. Now, clearly the ‘hidden links’ are not there for malicious or manipulative reasons so I’m letting Google of the hook for that one. At this stage I am more concerned about their content duplication issues.
Google+ sitemaps have been broken up into small chunks ranging from 0 to 47,999 and referenced from the robots.txt as http://www.gstatic.com/s2/sitemaps/profiles-sitemap.xml
I am not sure about their scheduling but given the amount of new users joining their social platform one would expect fresher data:
As you can see their last modified date is set to 20th of April 2012.
Another unusual thing is their referencing of URL pages:
It could be that users with “profiles.” have not actually joined Google+ yet. I attempted to access the .profiles page with a plus. URL and noticed a neat 301 redirect back to the .profiles URL.
Google’s advice to webmasters is not to worry about PageRank sculpting and that using rel=”nofollow” on internal links is time wasted on something which could have produced better results, for example write good content or engage on social networks. What puzzles me is why Google+ then uses nofollow or many of their internal links and what exactly they are trying to achieve with that.
This is my question for Matt Cutts, if you’d like it answered please vote for it on Google Moderator:
Google+ is using rel=”nofollow” on some of its internal links. Why?
“Sometimes we try to be cautious, just so we don’t become any sort of spam attack vector ourselves. Over time, as we come up with ways to make sure that things are spam-resistant, we might remove that nofollow attribute though.” Matt Cutts, Mountain View
Although I do appreciate the answer I still don’t understand how this would help since they’re not using nofollow on external links and using it on only selected internal links. It just doesn’t strike me as an anti-spam prevention measure.
I started mapping the architecture of Google+ in order to understand why they did this and have initially come up with a few modes of various entities and actions on their social network:
In the next stage I started mapping various scenarios starting with posts, both original and shared.
When you create your own post links will flow naturally unobstructed by nofollow, however when people comment, +1 or share your post, link to their profile will be nofollowed. This applies to both users and pages. Exception to the nofollow rule are +Mentions which flow naturally as well as any links shared in post comments, including links to external sites which is odd. It seems Google is not afraid of linking out and counting on the post author to moderate.
Shared posts operate similar to original posts except that the original author’s profile link gets a nofollow. Why? To be fair the link to the original post is a dofollow.
Photos and images have two distinct modes. They can appear in context of a post (either as tagged, original and shared) and as a gallery item. Tagged people are nofollowed in the post mode, but like most other things are dofollow in the gallery mode. In gallery mode live links to external links are not possible. Pasting a URL in the gallery-mode comments will simply display it as text and not include the hyperlink.
- Gallery Page: https://plus.google.com/photos/106901586566350117179/albums/5534350902409677473
- Gallery Image: https://plus.google.com/photos/106901586566350117179/albums/5534350902409677473/5619813418937964802
- Image Share Post: https://plus.google.com/111588754935244257268/posts/JpSWtTDbiNT
Essentially the same thing as photos, in both shared post and gallery mode. Shared author gets a nofollow, original post is followed, hashtags, +Mentions and live links are also followed.
In the gallery mode for this video URLs are not hyperlinked and everything that is linked up is dofollow.
As you’ve already seen from examples above, comment authors are followed links only in media pages (images, videos) in gallery mode. Everywhere else they’re nofollowed. Comment +Mentions and URL do not contain nofollow.
Events are nofollow free, posts, comments, mentions – the works. Here’s an example.
One thing that confuses me about event posts is the two similar post types created with the event. One is supposed to be an actual post (it contains ripples option, event description and the “on air” button) and the other one is the event itself in form of a post where people can interact (RSVP) their attendance and invite others.
Followed links on event posts (apart from the usual ones) are selected three who have set their attendance to “Going”. Neither of them have a post URL you can easily grab as you would with everything else by clicking on the post date. Instead the URL is available once the event post is re-shared. How strange!
Here you can see examples of shared event and event post which now have available URL:
- Shared Event: https://plus.google.com/111588754935244257268/posts/cxgYrAHadMD (only visible to my extended circles, you’ll have to add me to see it)
- Shared Event Post: https://plus.google.com/111588754935244257268/posts/My9vQhzr7eA (available publicly)
Note that one of them doesn’t have the original “post” link reference whereas the other links to the event page instead.
Hangout participants listed on a public post are all followed profile links:
Links to people in shared circles are nofollow:
Pages & User Profiles
There are three key areas I observed with pages and user profiles on Google+. First one is the fact that the tabs: posts, about, photos, videos…etc basically all represent one page cached in Google and almost simulate URLs for the purpose of usability.
Basically, what I’m saying is that at least two of the above URLs in orange are essentially the same page in Google’s cache (posts and about). Further to that the followers and following segments (in red) doesn’t seem to be in cache nor source code so this means that adding people for the purpose of manipulating PageRank will not work. Finally the page’s official URL (in green) is an enabled link which makes sense.
Engagement on the place page is rewarded by a dofollow link for reviewers who have a Google+ profile activated. The rest are not hyperlinked.
This article may be updated again soon.
Drop me a line if you find something interesting and I’ll add you as the article contributor.