A Mathematical Model for Assessing Page Quality

Guest Post by Ashutosh Garg

In this article, by page quality we refer to quality of a page with respect to a search query and a user who issued the query.

Page quality is a broad concept and depending upon the actual context in which one plans to use quality score, the actual algorithm will vary. Instead of going into a specific algorithm, this article will present a framework in which to think about page quality and how one can morph it to fit one’s unique situation.

Some of the situations where page quality is used are

  1. Search Engines – search engines score a page with respect to query and use this signal to understand if a page might be relevant to a user’s query or not. Additionally, by assigning a numeric score, one can identify one page is “relatively” better than the other one or not.
  2. Ad Targeting – when showing a particular ad to a user, an adnetwork may score the ad and corresponding landing page against user issued query and use it to identify if the ad is indeed relevant to what user is looking for or not.
  3. Discovery – A page can be evaluated, even in absence of query, to understand its quality and thus identifying if this page should be recommended to end user or not.

In this article we will consider the different algorithms used to assess page quality.

The first set of algorithms will compute the score of the document as a function of the actual query issued by the user –

IR score – Information Retrieval community has been researching how to compute the best possible score for a page given a query. This is probably the most important score one can use in evaluating a page. Various open source search engines like Lucene implemented this algorithm. Given a query Q={q1, q2, q3} containing three words and page P, various steps that are used in computing the score of the page are

  1. Come up with a relative weight of each section of the page – A typical webpage can be divided into various components like – title, headings (H1, H2, H3..,) body, bold text, large text, small text (based on font sizes), text above the fold of the page (assuming a certain display), anchor text, boiler plate text, text on pages being pointed to, text on pages user visited to prior to visiting this page, text present in images on the page, URL text etc. Depending upon the application, one can assign different weight to different elements of the page. One rule of thumb is to see how people are going to discover the page and form their first impression. If it is search – people will discover page by reading the title and snippet displayed by search engines. People will form their first opinions by reading the text above the fold.
  2. Generate features based on query – Take the query and break it down into n–grams (a bigram is all phrases of length two). This is followed by assigning weight to each of these n–grams. For e.g. consider a query – “canon digital camera”. In this query, “canon” is an important unigram as it refers to the brand. “canon digital” is a bad phrase while “digital camera” is a good phrase. Traditionally people have used TFIDF (http://en.wikipedia.org/wiki/Tf*idf ) to come up with a weighting. One thing to be cautious is which dataset that is used to compute TFIDF. It should very closely resemble the dataset where weighting is applied.
  3. Document quality for computing TFIDF score – A document that consists of the content of all the pages on the web will match any query. However, it is not a great experience to come across a very large document. At the same time a document which is identical to query is bad as a user won’t learn anything new when (s)he lands on the page. Review what the platform is used by most of the visitors of your website. If they are using smartphones, ideal document length should be less than 500 words, tablet – 1K words, laptop – even longer given the presentation. Some way of normalizing the score by document length should be used. Various papers have been published in IR community to normalize IR score based on document length.
  4. A simple scoring of a document can be

Page P consists of fields di, with weight wi and query Q consists of words qk. Length of page is L, number of phrases in query is Nq

where f is a normalization element function based on doc length.

Which page has a higher IR Score for “Canon digital camera”?

Continue Reading…

Stefan Weitz on Quora Integration and Social Data Sets

Key Points

  1. Quora is one of the latest networks that Bing has socially integrated into search. Quora helps to disambiguate and make sense of some of the different facets of certain search terms.
  2. The goal of using Quora (and other social integrations Bing is doing) is to replicate the very human characteristic of involving others in decision-making.
  3. Actions, opinions, and recommendations are happening all around us, it’s not as if we need to say anymore or do anymore. We need something (like Bing) to find that little needle in the information haystack and bring it right into your search experience.
  4. Bing is continuing to add more networks to it’s social integration. The focus is to find the best method to expose the networks in a way that fits in the search experience.

Full Interview Transcript

Eric Enge: Hi. I’m Eric Enge, I’m the CEO of Stone Temple Consulting. We are a holistic internet marketing optimization firm. Whatever that is! I’m here again with Stefan Weitz in what’s becoming a regular series of discussions about new social integrations from Bing. How cool is that?

Stefan Weitz: It’s very cool, and Eric, I think you can just basically say you’re the premier digital, online, SEO/SEM, marketing service on the whole planet, and you’d be fine.

Eric Enge: That’s excellent. But, it’s much better when you say it than when I do. So, the case of wine or whatever it is you choose is going to be shipped to you…

Stefan Weitz: I’d like some Metromint water. That would be great.

Eric Enge: Yeah. Here we are, by the way, both showing our unshaven looks. So, I think shows that the search industry remains a casual place to hangout.

Stefan Weitz: I did shower.

Eric Enge: Yes, a brief reintroduction to you would be great.

Stefan Weitz: Sure. My name is Stefan Weitz, as it was last time. I work at Bing. I do a lot of work with start-ups, entrepreneurs, academics, and venture capitalists to really understand where search is going and how to advance the state of the art. Then, I work with the product teams both to help incorporate those things into the product, as well as evangelize those things once they actually reach the product.

Quora LogoEric Enge: Cool. I really like that you’re continuing these social integrations. The latest is Quora. Tell us a little bit about that.

Stefan Weitz: Quora is a great one. It’s one of those things that we looked at. We always, in the back of our head, we wanted to go and tap this data set, because it is such a different set than, say, Facebook or Twitter, or what not. In many cases you get these layers of answers and they’re layers for answers on a variety of topics. I’m just looking at the stream here coming in. I see Jodie Foster was one that had a high Quora. There must have been a discussion on Quora about that.

…Quora expands the canvas of what one could talk about when looking for particular things online.

Now, you could get a bunch of information on Jodie Foster in standard search, and you could even maybe see one of your friends liked the movie she was in with our current implementation of social search in the sidebar. But really, Quora expands the canvas of what one could talk about when looking for particular things online.

Continue Reading…