Interview with Google’s Frederick Vallaeys

A couple of weeks back I interviewed Frederick Vallaeys, who is a Senior Product Specialist for Google AdWords. We covered a wide range of topics, with a review of some recent product announcements, and also some tips and tricks. In this post, I will summarize the main points of the interview, but do read the full interview if you want the details.

Universal Search for AdWords: The AdWords team is actively looking at the types of things that have worked in the organic search results. Clearly the introduction of images, videos, maps, and other elements has been a great success in web search, so the AdWords team is beginning to incorporate similar elements. So if you search for a movie, you may get an associated video clip as part of an ad.

Another feature that has been ported over is Sitelinks. As an example of this, check out the search results for Orbitz. This feature will come up in particular when you do branded searchers (and the advertiser has turned it on).

Additional pricing models have been added as well. Try a search on mortgage to see an example of one of them. Google calls these “Comparison Ads”. They offer the user a simple for to fill out. Once the users fill the form out, the information is shared with a few lenders. Note though, one cool additional feature – Google anonymizes the user’s phone number and provides an alternative phone number to the lenders, and when the lenders call that number, Google redirects it to the user’s actual phone number.

Another new pricing model is called “Product Listing Ads”. This is a model for retailers to list products on Google and pay on a cost per acquisition basis. Google pulls matching vendors from the Google Affiliate Network, and given the CPA model this presents little risk to the advertiser. Pretty cool.

As we switched into “tips and tricks”, Frederick led off with the Content Network. As I noted during the interview, the Content Network got off to a bad start because it was bundled so tightly with regular web advertising. This is a problem because the usage of keywords is completely different. Keywords in web search relate to actual user queries. In the world of the Content Network, Google uses keywords to find web pages of participating publishers that have those words on their web pages. If the match is strong enough, the AdSense box with your ad in it will be displayed. A completely different algorithm with many implications.

The other truth about the content network is that the user is in a different mindset. The search user is already looking for something. Someone visiting a web site is probably looking for something else, and you are now trying to get them to look at your product or service. A pretty different mentality, and the best results are obtained if you create pretty different looking ads.

But, if you do these two things well, you can be well off to the races. Frederick reports: “we found that those using the network would typically get 20% of all their leads and conversions from the Content Network”. That is pretty significant. Also, Google has added the ability to show you View Through Conversions. This is essentially analytics data showing you how many of your buyers saw a Content Network ad prior to making a purchase. With this data you can see what sales the Content Network “assited” in getting for you.

Frederick’s next tip was about making use of Conversion Optimizer. This is a free tool that allows you to manage your keywords on a cost per acquisition basis. This is the type of thing that bid management tools do for you, but it is free. In addition, Google can leverage data it has more easily than the pay for bid management tools can, such as geographic data (where the searcher is located) and adapt the bids for your keywords on a per query basis. The tool does not allow management on an ROI basis yet, and the pay for tools offer other features, but for many advertisers, Conversion Optimizer will be enough.

Last up was the search based keyword tool. The tool identifies missed opportunities, such as cases where a company has a page getting organic search traffic related to a product, but there are no keywords being bid on for the same product. The tool presents you with both the keyword and proposed landing page, which makes acting on the suggestions really easy.

There were several other things in the interview, so check it out if you want more.

11 Notes from my Interview with Pankaj Mathur

Just over a year ago I did an interview with Pankaj Mathur, the VP of Sales for InfoGroup. I enjoyed the initial interview tremendously, so I really looked forward to the opportunity to do it again. As with the first interview, our conversation focused almost entirely on the local search data problem. What makes it hard, and some of the solutions that InfoGroup has in place to deal with it. If you are interested in understanding the complexities of extracting and presenting accurate local data, then the first interview, and this year’s interview with Pankaj Mathur is for you.

For those of you who want the CliffsNotes version, here are some of the major points that emerged from the interview:

  1. Pankaj Mathur: (in our data) “we have approximately 15 million businesses in the U.S., 1.4 million in Canada, and between 2.5 and 3 million in the UK. … “In the North American market last year, we made over 25 million phone calls”, … “It’s all operator-driven, although we do use a smart dialer”.
    My Comment: Right away we get a sense of the business problem. InfoGroup, which as a company prides itself on the quality of its data, feels is it necessary to make 25 million phone calls to validate the data in the U.S. and Canada. This is a far from trivial expense, and one which you can assume they undertake because of the inherent difficulty in building an accurate local search data set.
  2. Pankaj Mathur:” We get approximately 6,000 phonebooks every year, but we do not necessarily compile each phonebook every year”
    My Comment: As you will see later, Pankaj does not believe that having businesses submit their own listings is a great solution, because of the inconsistencies in how they maintain that data. But, chances are, that they keep their advertising up to date. Better still, if they go out of business, they probably cancel the ad.
  3. (Note: InfoGroup evaluates data using four metics, and this is the first one) Pankaj Mathur: “Completeness – is a measure of the total number of listings and in some sense reflects coverage”
    My Comment: An important metric (of course). If you are in Saginaw today, and you are looking for a dry cleaner, you want to know that the search you are performing will return the one closest to you.
  4. Pankaj Mathur: “Infogroup has around 15 million companies in U.S., the IRS claims about 20 million, and the Chamber of Commerce claims about 24 million
    from a Chamber of Commerce perspective, if a license was filed back in 1959, it is considered a valid business in 2010 as long as the owner is still alive and has not filed for bankruptcy” … “Infogroup has defined a business as a brick-and-mortar store having a phone number and location address”, and later: “Google, or any search engine, follows a much broader definition of business or point of interest”
    My Comment: This is one of the key problems with local search. What do you define as a business? Here are some related additional points from the interview:

    • Pankaj Mathur: “A restaurant may have a drive-in, bar and an ATM on premises”
    • Eric Enge: “Say a business only has a P.O. Box for an address; is that something that you would count as a valid business”? Pankaj Mathur: “Yes, we do, such scenario can occur for A financial advisor or a tax consultant working from home”
    • Eric Enge: “what about a kiosk-based location like an ordering terminal in a shopping mall or at an airport”? Pankaj Mathur: “When you look at the corporate list, they will tell you that there is a Dunkin’ Donuts or Baskin-Robbins at a particular address, which may actually be a retailer. In this case, what we usually do is make a decision on a case-by-case basis. For the example above, there probably isn’t enough evidence to necessarily route somebody, looking for Dunkin’ Donuts, to a grocery store just because grocery store has a shelf where you could pick donuts of certain brands. … There are cases like an ATM location inside a bank that is still considered a line of business, and we will compile it.”
    • Eric Enge: “If someone is working as a plumber out of their house and they use their house as the brick-and-mortar address, are they counted”? Pankaj Mathur: “Yes, that will work”

    My Comment: These are just some sample scenarios. We have covered here how InfoGroup handles these scenarios, but each local business search player needs to make their own decisions about these things, and are likely to make different decisions. Is your head hurting yet?

  5. Pankaj Mathur: (this is the second metric) “Conformance – in some sense this implies standardization or adherence to structure” … “we may come across a Hilton listed under ‘Banquet Halls’ and no mention of it under the ‘Hotels’ category. We have quality control rules and audits in place that helps ensure that all Hilton locations are assigned to ‘Hotels’ as a primary line of business”
    My Comment: Another layer of complexity. A given business may consider itself to be relevant to many different categories of business. A restaurant may offer catering services, for example.
  6. Pankaj Mathur: “it is possible that a Hilton shows up as a golf course … We can call this Hilton and verify objectively if there is a golf course attached to it, and then assign the appropriate categories to the record”
    My Comment: Expanding on the prior point, not only is it important to make sure that you have identified the primiary line of business, but you do want to categorize the alternative lines of business as well. Someone looking for a golf course might want to know about the one at the Hilton, for example.
  7. Pankaj Mathur: (this is the third metric) “Accuracy. This is the probably the easiest of all four to understand, because it is factual, but Accuracy is also the most expensive aspect for data compilation … we use phone validation to ensure reliability of listing information and Accuracy automatically follows from it!
    My Comment: The biggest problem with accuracy is the rate at which the data set changes. Businesses close, move, change names, get acquired, or new businesses open. When a business closes, you can pretty much guarantee that they are not contact all the data providers out there to tell them that. Even when something like a brand name changes, it is unlikely that the business will update all the places on the web where the old brand appears. Another thing that can happen include is that the person providing the information simply does it incorrectly.
  8. Pankaj Mathur: (this is the fourth metric) “Relevancy can be best correlated to intent. So if I am searching for a McDonald’s, the information on John Doe LLC who owns the location is irrelevant (even if it is accurate).
    My Comment: The searchers intent is a critical element to the puzzle as well. More on this in the next point.
  9. Pankaj Mathur: “The intent is different when I am searching in front of a desktop than when I am searching on my smart phone at 10 O’clock at night. Due to this evolution of LBS, there are additional attributes that are coming to the forefront, like opening-closing hours, credit cards accepted, ratings, reviews, and coupons and so on”
    My Comment: Mobile search brings a whole new layer of complexity to the problem, because the availability of a whole new level of data becomes critical.
  10. Eric Enge: “You recently wrote an article about how merchant-submitted listings are not the solution to the local search problem”. Pankaj Mathur: “The intent of the article is to highlight the fact that data coming from corporate chains may not necessarily comply with the four guidelines namely completeness, conformance, accuracy and relevance … If you are a big chain corporation like McDonald’s or KFC, managing data on over 10,000 locations can be quite a daunting task … even if a particular store location open or closed, there is some lag time when these lists get updated”.
    My Comment: An example of this was provided above, with the Hilton potentially representing itself as a Banquet Hall. And, as Pankaj suggests here, for large chains, keeping track of all their locations can be prpblematic all by itself. This suggests that a layer of human interpretation may be crucial to this process in the long term.
  11. Pankaj Mathur: “Usually there is a perception, largely amongst data compilers who do not invest as much in compilation efforts, that merchant submitted listings are “gold” so take it for its face value. My personal opinion is that merchant submitted listings are at best “okay”; there is lot of crap in there that needs to be cleansed to make it valuable”
    My Comment: His overall conclusion on merchant listings is clear. Decent source of data, but NOT authoritative. This is consistent with what I have heard from Google.