Eric Interviews Microsoft’s Mikko Ollila

When I had the chance to interview Mikko Ollila, senior product manager for Local Search at Bing, I was excited. Bing has done some nice things with their local search experience, and I wanted to learn a bit more about it.

Our discussion quickly shifted into a review of how Silverlight enables immersive experiences. The affect is really cool and worth a closer look, if you have not already looked at it. Consider the process you go through when you get directions to some place you have not been to before. With a few clicks in Bing Maps you can go from seeing a place a map, to getting directions, to seeing what the place looks like. In addition, you can move around the neighborhood around the place you searched on to see what is nearby.

Bing is also reaching out to developers to get them to create map applications based on Bing Maps. Partnerships also are playing a big role in the Bing strategy. Their recent announcement with Facebook is one example of this. Much more in the interview than this, so check it out!

Google’s Carter Maslan on Local Search

Today I am publishing the transcript of my recent interview with Carter Maslan. This post summarizes some of the main points of the discussion. We spoke quite a bit about the new service area business tool from Google. Some of the major points made by Carter about this were:

  1. Many businesses don’t want to have their address listed in local search results. For example, a plumber who works out of his home, but always goes to the customer to provide his services. Carter also noted that service area businesses are “primarily defined by whether or not the business brings its services to the customers”. However, service areas businesses can include consultants that work from home that the customer can call to obtain their services.
  2. The ability to set oneself up as a service area business is “pretty broadly accessible”.
  3. “Even though we don’t have specific numbers to share … there area ton of home-based and service businesses in this country”, and “I think it is at least a third of the overall total”.
  4. “giving the end user a PO Box as a pin on the map is not really helpful”, and “If a business really cares about its customers knowing where its PO Box is, I think it’ll be clear that it should be a service area business”.
  5. I also asked Carter about spam. He indicated that they do more or less the same thing they do with other types of spam. For example, when I asked him about a plumber who declared they would serve anyone within 1,000 miles, he indicated that “There are a lot of signals regarding whether or not this is suitable”.

We also discussed Place Pages. What emerged from the discussion was that the purpose of this was to provide access to all the available information about a given place. This would include, but not be limited to, businesses that have no web site. The information on a Place Page could include information provided by the business, but will also include information found by Google in crawling the web.

I asked Carter whether or not having individual landing pages for each location of a business with many locations was preferred. The answer was yes, provided that there was meaningful information that differentiated on locaiton page from another (inventory info, driving directions, etc.)

Google has also made it quite a bit easier for people to report errors. This is basic crowdsourcing in action. They are happy to take reports even if all they specify is that something is wrong with a listing. Note that Google can also look at user interaction data (with a particular listing) to get signals to possible problems as well.

We talked about training the local search algorithm. One concept we bandied about was that of having humans build a mini-data set (e.g. some number of tens of thousands of hand researched listings), and then running the algorithm to see how its results compared to the hand crafted test set of data.

As always, in the interest of providing a short synopsis, I have passed over many details and other points from the interview. Read the full interview for more.

11 Notes from my Interview with Pankaj Mathur

Just over a year ago I did an interview with Pankaj Mathur, the VP of Sales for InfoGroup. I enjoyed the initial interview tremendously, so I really looked forward to the opportunity to do it again. As with the first interview, our conversation focused almost entirely on the local search data problem. What makes it hard, and some of the solutions that InfoGroup has in place to deal with it. If you are interested in understanding the complexities of extracting and presenting accurate local data, then the first interview, and this year’s interview with Pankaj Mathur is for you.

For those of you who want the CliffsNotes version, here are some of the major points that emerged from the interview:

  1. Pankaj Mathur: (in our data) “we have approximately 15 million businesses in the U.S., 1.4 million in Canada, and between 2.5 and 3 million in the UK. … “In the North American market last year, we made over 25 million phone calls”, … “It’s all operator-driven, although we do use a smart dialer”.
    My Comment: Right away we get a sense of the business problem. InfoGroup, which as a company prides itself on the quality of its data, feels is it necessary to make 25 million phone calls to validate the data in the U.S. and Canada. This is a far from trivial expense, and one which you can assume they undertake because of the inherent difficulty in building an accurate local search data set.
  2. Pankaj Mathur:” We get approximately 6,000 phonebooks every year, but we do not necessarily compile each phonebook every year”
    My Comment: As you will see later, Pankaj does not believe that having businesses submit their own listings is a great solution, because of the inconsistencies in how they maintain that data. But, chances are, that they keep their advertising up to date. Better still, if they go out of business, they probably cancel the ad.
  3. (Note: InfoGroup evaluates data using four metics, and this is the first one) Pankaj Mathur: “Completeness – is a measure of the total number of listings and in some sense reflects coverage”
    My Comment: An important metric (of course). If you are in Saginaw today, and you are looking for a dry cleaner, you want to know that the search you are performing will return the one closest to you.
  4. Pankaj Mathur: “Infogroup has around 15 million companies in U.S., the IRS claims about 20 million, and the Chamber of Commerce claims about 24 million
    from a Chamber of Commerce perspective, if a license was filed back in 1959, it is considered a valid business in 2010 as long as the owner is still alive and has not filed for bankruptcy” … “Infogroup has defined a business as a brick-and-mortar store having a phone number and location address”, and later: “Google, or any search engine, follows a much broader definition of business or point of interest”
    My Comment: This is one of the key problems with local search. What do you define as a business? Here are some related additional points from the interview:

    • Pankaj Mathur: “A restaurant may have a drive-in, bar and an ATM on premises”
    • Eric Enge: “Say a business only has a P.O. Box for an address; is that something that you would count as a valid business”? Pankaj Mathur: “Yes, we do, such scenario can occur for A financial advisor or a tax consultant working from home”
    • Eric Enge: “what about a kiosk-based location like an ordering terminal in a shopping mall or at an airport”? Pankaj Mathur: “When you look at the corporate list, they will tell you that there is a Dunkin’ Donuts or Baskin-Robbins at a particular address, which may actually be a retailer. In this case, what we usually do is make a decision on a case-by-case basis. For the example above, there probably isn’t enough evidence to necessarily route somebody, looking for Dunkin’ Donuts, to a grocery store just because grocery store has a shelf where you could pick donuts of certain brands. … There are cases like an ATM location inside a bank that is still considered a line of business, and we will compile it.”
    • Eric Enge: “If someone is working as a plumber out of their house and they use their house as the brick-and-mortar address, are they counted”? Pankaj Mathur: “Yes, that will work”

    My Comment: These are just some sample scenarios. We have covered here how InfoGroup handles these scenarios, but each local business search player needs to make their own decisions about these things, and are likely to make different decisions. Is your head hurting yet?

  5. Pankaj Mathur: (this is the second metric) “Conformance – in some sense this implies standardization or adherence to structure” … “we may come across a Hilton listed under ‘Banquet Halls’ and no mention of it under the ‘Hotels’ category. We have quality control rules and audits in place that helps ensure that all Hilton locations are assigned to ‘Hotels’ as a primary line of business”
    My Comment: Another layer of complexity. A given business may consider itself to be relevant to many different categories of business. A restaurant may offer catering services, for example.
  6. Pankaj Mathur: “it is possible that a Hilton shows up as a golf course … We can call this Hilton and verify objectively if there is a golf course attached to it, and then assign the appropriate categories to the record”
    My Comment: Expanding on the prior point, not only is it important to make sure that you have identified the primiary line of business, but you do want to categorize the alternative lines of business as well. Someone looking for a golf course might want to know about the one at the Hilton, for example.
  7. Pankaj Mathur: (this is the third metric) “Accuracy. This is the probably the easiest of all four to understand, because it is factual, but Accuracy is also the most expensive aspect for data compilation … we use phone validation to ensure reliability of listing information and Accuracy automatically follows from it!
    My Comment: The biggest problem with accuracy is the rate at which the data set changes. Businesses close, move, change names, get acquired, or new businesses open. When a business closes, you can pretty much guarantee that they are not contact all the data providers out there to tell them that. Even when something like a brand name changes, it is unlikely that the business will update all the places on the web where the old brand appears. Another thing that can happen include is that the person providing the information simply does it incorrectly.
  8. Pankaj Mathur: (this is the fourth metric) “Relevancy can be best correlated to intent. So if I am searching for a McDonald’s, the information on John Doe LLC who owns the location is irrelevant (even if it is accurate).
    My Comment: The searchers intent is a critical element to the puzzle as well. More on this in the next point.
  9. Pankaj Mathur: “The intent is different when I am searching in front of a desktop than when I am searching on my smart phone at 10 O’clock at night. Due to this evolution of LBS, there are additional attributes that are coming to the forefront, like opening-closing hours, credit cards accepted, ratings, reviews, and coupons and so on”
    My Comment: Mobile search brings a whole new layer of complexity to the problem, because the availability of a whole new level of data becomes critical.
  10. Eric Enge: “You recently wrote an article about how merchant-submitted listings are not the solution to the local search problem”. Pankaj Mathur: “The intent of the article is to highlight the fact that data coming from corporate chains may not necessarily comply with the four guidelines namely completeness, conformance, accuracy and relevance … If you are a big chain corporation like McDonald’s or KFC, managing data on over 10,000 locations can be quite a daunting task … even if a particular store location open or closed, there is some lag time when these lists get updated”.
    My Comment: An example of this was provided above, with the Hilton potentially representing itself as a Banquet Hall. And, as Pankaj suggests here, for large chains, keeping track of all their locations can be prpblematic all by itself. This suggests that a layer of human interpretation may be crucial to this process in the long term.
  11. Pankaj Mathur: “Usually there is a perception, largely amongst data compilers who do not invest as much in compilation efforts, that merchant submitted listings are “gold” so take it for its face value. My personal opinion is that merchant submitted listings are at best “okay”; there is lot of crap in there that needs to be cleansed to make it valuable”
    My Comment: His overall conclusion on merchant listings is clear. Decent source of data, but NOT authoritative. This is consistent with what I have heard from Google.

Chris Silver Smith Interview: Comment Here

It seems like forever since I last posted an interview. Take this one as a heads up that I do intend to reinvigorate the interview series over the next month or two. They are too much fun to do, and very useful in digging up information on search.

Today I am publishing an interview with Chris Silver Smith of KeyRelevance. It is a great discussion on the ins and outs of local search. A former SuperPages.com employee, Chris has an insider’s knowledge of how this all works. Don’t miss it!

Latest Interview: Yahoo’s Frazier Miller and Shailesh Bhat

This week’s interview is with Frazier Miller and Shailesh Bhat of Yahoo. This is the third in our series of interviews focused on Local search. The first two were with Google’s Carter Maslan and Microsoft’s Ziya Genceren.

The discussion is heavily focused on Yahoo local search. Check it out!

Latest Interview: Microsoft’s Ziya Genceren

This week’s interview is with Microsoft’s Ziya Genceren. Ziya is Group Product Manager for local search at Microsoft. We go into depth into the best ways to work with Microsoft local search. Check it out!

Latest Interview: Google’s Carter Maslan

Recently I got to speak with Google’s Carter Maslan about Local Search. The discussion should help people interesting in Local Search to more effectively put together their strategy. There is also a little bit of myth busting in this read, as not all of my questions were answered quite the way people might expect.

Feel free to use the space below to comment on the interview.