Distribute Your Link Sources

Imagine you want to establish yourself as a true leader in your market space. However, the market space has been around for a few years, and there are other sites that have already established themselves as leaders. A natural thing to do is to backlink those leaders (i.e. use a tools such as LinkScape to get a map of who links to your competitor).

The next thing that people often do is to start contacting people at the best sites (with the most link juice) that link to the competitor. No question that this is a good idea, but the analysis should not stop there. Here are the two major reasons why:

  1. The first problem is that even if you proceed to contact everyone that links to your competitor you are likely to only get about 10% of those people to link to you (10% is actually a very good result). However, we have already stipulated that you want to be a leader, not 10% of a leader. So while this is a good strategy, it should not be the only strategy. You need a strategy that will help you catch up with (and pass) the competitor.
  2. Most people recognize that contacting every single site that links to a competitor will get a little bit tedious. So they target the obvious high end links. This is OK, but there is a real danger that your links will become one dimensional. What I am getting at here is that your competitor defines your challenge by more than just their top links. The scope and breadth of those links is equally important to understand.

Expanding upon the 2nd point above, the search engines already know how to assess the relevance of a link. They also recognize sub-sectors within each market space, such as news sites, blog sites, sites selling related commercial products, academic sites, industry organizations, not to mention the scores of categories of sites that are tangentially related to the space, and therefore refer to link to sites in it from time to time.

A true leader in a market space will most likely have links from all of these sectors. A single focused link building campaign will not give you this type of breadth. To think about it in non-SEO terms (or in a non-search engine world), a single successful marketing campaign does not make you a broadly accepted leader in a space.

You need breadth too. You need to implement campaigns that will touch all of the various segments of the market. Note that you don’t have to do these all at the same time, but over time you want to touch as many different relevant segments as possible.

One great way to do this is with PR related strategies (there are other ways to achieve breadth in links other than via PR, we are just using it for an example here). Create some truly unique content or tool, and then work hard to let the world know about it using PR. Fuel the fire by implementing more great content and tools. This may seem like a hard road, but you can’t get to a leadership position in a market space without doing these types of things.

But, the focus of this post is not to tell you to build great content and promote it well (although you should do that). The focus is on putting together a map of what defines breadth for your market space and implementing a link building and promotional strategy to achieve breadth in your link building. To help do this, there are two questions you should ask:

  1. What does a well distributed link graph in your market space look like?
  2. How did your market leading competitors achieve their breadth, i.e., what were there promotional strategies?

To answer these questions you will need to do more than look at the best links your competitor has received. You will need to map out a broader picture of different market sectors that have linked to them along the way. I don’t have a simple formula for doing this, other than getting detailed backlink reports on your leading competitors, and analyzing hundreds of the links they have and classifying them. What categories to use will vary for each market, but you can certainly start with the list of categories I mentioned above.

The next step will be to figure out what types of content, tools, and marketing you will need to build a similarly broad profile. With this plan in hand you will be well on your way to being able to design a market leadership strategy for your business.

Latest Interview: Microsoft’s Dr. Scott Prevost – Comment Here

This week I am publishing a recent interview with Microsoft’s Dr. Scott Prevost. Scott is the principal development manager in the Powerset group at Microsoft. I used this discussion to get some insight into what Powerset is about, and how Microsoft intends to use it as a part of their core search technology in the future.

Powerset focuses on natural language search, which is a concept that certainly has a lot of potential. At the moment, the amount of this technology that has been integrated into Live Search is realtively small. You can see a test platform running the Powerset technology on Wikipedia at http://www.powerset.com.

Latest Interview: YouTube PMs – Comment Here

One of my recent interviews was with Tracy Chan and Matthew Liu of YouTube. Some of the areas discussed include:

  • YouTube Insight’s Popularity feature
  • YouTube Insight’s Discovery functionality
  • YouTube Insight’s Demographics tool
  • The ability to export data from YouTube Insight
  • How Promoted Videos can help you launch a video
  • Other tactics for driving video views
  • YouTube video optimization basics

All in all, a pretty good primer for learning about how to get started with YouTube.

Latest Interview: John Mueller – Comment Here

This week I get to post an interview with well known Googler John Mueller. John works in the Google Zurich office and has a lot of insight into the inner workings of the Google algorithms as well as their webmaster guidelines and policies.

Our interview was laser focused on the issue of cloaking, as well as First Click Free. We discuss many different aspects of cloaking, such as:

  • Replace bad URLs (with session IDs and superflous URLs) with SE friendly URLS.
  • Multivariate testing
  • Rendering content within a graphic to a search engine in text, including sIFR.
  • Fix up complex javascript/coding, or Flash messes, including SWFObject.
  • Language/country/region cloaking
  • Serving up different content based on cookies.

Check it out!

Trust and Link Building

One of the things that is evolving in my philosophy of SEO is how I look at the role of trust. Trust was not something that was important during AltaVista’s hey day, when keyword density was king, or even during the early days of Google when PageRank in its purest form ruled the day.

Another big thing in my mind these days is that the number of factors, and the importance of each factor, involved in ranking algorithms has changed. When the PageRank paper was published, you pretty much had a blueprint for how it all worked. However, knowledge is power, and in this case the power was in the hands of the spammers.

As a result of these factors, numerous patents have been published by each major search engine, on a variety of topics related to ranking, yet these patents no longer provide a clear roadmap to ranking algorithms. They provide hints as to what the search engines could choose to use a ranking signals, but they don’t tell us what they do use. For this we have to rely on intuition, judgment, and testing. In evaluating ranking signals I believe there are two major factors:

  1. Noisiness of the signal. Does a strong positive always, or nearly always, mean a good, relevant site? Does a strong negative always, or nearly always, mean a poor, or less relevant site?
  2. Importance of the signal. Assuming that we have a signal that is not noisy, how significant an indicator is it when compared to other signals? What made links such a powerful element is that they were, and still are, a powerful indicator of relevance and quality.

One example of a noisy signal is Bounce Rate. In principle, the idea is that when a user goes to a site, and returns to the SERPS after a relatively short period of time, that this is an indicator that the result was not a good one. But the problem with it is that on a reference search (e.g. zip code for Charlotte) the user may have gotten what they want in just a few seconds.

One of the signals that I think has low noisiness and a high degree of importance is trust. One important paper on this topic was published in 2004 by Yahoo! and Stanford University. The paper was titled Combatting WebSpam with TrustRank. The paper proposes that the search engines use human editors to identify a site of highly trusted seed pages. Then, “once we manually identify the reputable seed pages, we use the link structure of the web to discover other pages that are likely to be good”.

The general notion is that the closer a web page is to a highly trusted page (closer as mentioned in number of link hops) the most likely it is to be a trustworthy page. You can think of a Trust Rank factor that reduces the overall trust level of a page based on the number of hops from the human reviewed seed sites. The paper also suggests that links placed on pages with lots of links (even if they are on one of the human selected seed pages) tend to be placed with less care than links on pages with very few links. As a result the trust communicated by two selected seed pages can differ.

The researchers who wrote the paper on TrustRank also authored an interesting paper on a concept they call spam mass. This paper lays out a method for calculating the percentage of a web sites total PageRank that results from being linked to by spam pages. The higher this ratio is the greater the likelihood that the site is itself a spam site. There are some obvious problems with this idea – as your competitor could buy links to your site from thousands of poor quality sites and potentially trash your rankings by giving you a high spam mass. Nonetheless, the concept is an interesting one.

Once you start thinking about these things, it is easy to come up with fresh ideas. For example, the trust damping factor you might apply for each link between a given page and the highly trusted seed pages could vary depending on the TrustRank level of the intervening web pages. It could also vary depending on the trust level of the domains on which those pages reside. For example, the home page of http://www.usa.gov may be deemed to be a highly trusted seed page.

In three hops you may be able to find yourself on links fairly well removed from the seed page. But if those three hops take you to a different page on USA.gov, do you lower the TrustRank as much as you would if you have transitioned to a completely different domain? Maybe not. You can also think about the notion of “Reverse TrustRank”. This is the notion that if your site links to spammy sites that this should lower its TrustRank. This thought should provide ample motivation to make sure that you take care to not link to any bad sites. Better still, you should screen your site for this on a regular basis. After all, the quality domain you link to today may have a different owner tomorrow, and that new owner may have poor intentions. Don’t let this happen to you!

Summary

So we don’t have as clear a roadmap to ranking factors as we used to, but we can still use common sense. Take the time to learn the basic tactics that search engines can use to do their job. Don’t get overly hungup on any one factor. Most of the time, if you build a great site and promote it properly you should achieve good results. But, knowing what things search engines are likely to rely on can really help you understand what you need to do to improve your rankings.

In addition, the way the search engines use and measure trust most likely varies significantly from the papers reference above, but my own experimentation convinces me that they are measuring it, and using it in a significant way. We factor this into our thinking at STC on a day to day basis.

Latest Interview: Sarah Bird – Comment Here

After a brief break from posting regular interviews, I am back at it! This week I have a great one, with Sarah Bird of SEOmoz. As a general counsel, Sarah handles a variety of different legal issues, and there are some issues that face both those who work as SEO professionals and the companies that engage them. For example, we talk a bit about performance based SEO contracts, fair use, DMCA requests and more. Good stuff!