Archive

Author Archive

SES Chicago Dec 7 - 11th

December 7th, 2009
Chicago, originally uploaded by aserpa.

Well, it’s been a long time, a fantastically busy time and that’s the end of my non-excuse for the lack of posting of late.

I’ve just landed in Chicago to attend SES and discovered some weather for the first time in 15 months (you’d be amazed to hear you can actually miss weather if you move to California!) and a bank of Taxis outside the conference parading around with the new Yahoo! advertising on the top. I have a feeling this is a happy accident, but marketing is rather a ‘dark art’ so I’m not committing to that.

I’m here because I was asked to join a panel (Developments in Information Retrieval on the Web) and talk to the crowd about Semantic Data along with Jamie Taylor (MetaWeb), Martin Hepp (Universität der Bundeswehr München) and Jay Myers (Best Buy).

It’s a great panel - two guys capable of talking about any aspect of RDF and Microformats and two guys who’ve had the pleasure of learning from them and implementing structured data solutions. Jay works over at BestBuy where he’s done a cracking job of integrating structure on a public site with a massive catalog (~500k+ pages) - oh and he’s also been instrumental in developing the GoodRelations spec, so all-in-all a Semantic Superstar! That sort of implementation makes my life so much easier and hopefully, in turn, yours as Search Engines and aggregators start to use this structure to help you find what you want.

If you’re in town and would like to meet for a drink please drop me a mail, IM or reach out on Twitter.

UPDATE:

It’s all over, i’m heading back to California where white stuff doesn’t fall out the sky and coats are for people with holiday cabins in Tahoe. Huge thanks to Sean Golliher (great blog template, Sir!) for organising a great panel, it was a really enjoyable session.

Shame on those of us who felt the delay whilst Martin got his Mac ready would have made a good advert for Microsoft’s ‘I’m a PC’ series - the upshot is this great video of his presentation with slides. http://vimeo.com/8065914 It’s a pity we don’t have the full 126 slide original to compare it against!

In the spirit of sharing, as soon as I get to a stable connection I’ll be adding my slides to SlideShare and asking Jamie and Jay to do likewise…. more soon.

UPDATE 2:

I’ve uploaded my slides from the panel to: http://www.slideshare.net/NickCox/ses-chicago-2009-searchmonkey

UPDATE 3:

I just saw a Tweet from Jay Myers, now his slides from SES are up on Slideshare at http://slidesha.re/4UoQbg. 3 down, 1 to go!

SES Toronto

June 10th, 2009

If you’ve never done it, you’d be amazed at the havoc a 3 hour PST to EST timezone change can play with your body clock. Just as I was getting tired on the first night I realised it was actually 3am local time and I was due at the conference center a little sooner than I’d planned. Whinging aside, my trip to SES Toronto was worth every foggy headed moment.

My thanks to Jed, Anne and Bill for making it an enjoyable panel. We covered a good range of topics from how to utilise rich/social media to improve your branding, to the latest user-session insights and the benefits of using structured data and correctly labelling your content.

I also got a chance to revisit my TV career and remind everyone why I average about 19 years between appearances.  For those who follow these things my last appearance was in 1990 on the great BBC kids program Blue Peter.

Huge thanks go to the Yahoo! Canada team for their warm hospitality. If the chaps at border patrol don’t put out an APB for shifty eyed English guys, you just try and stop me coming back. I need claim my remaining 1min 13seconds of fame!

Author: Nick Cox Categories: Search, Yahoo! Tags: , ,

Yahoo! Placemaker

June 5th, 2009
Yahoo! Placemaker

Yahoo! Placemaker

Recently Yahoo! launched a new Geo API called Placemaker. I’ve been playing with it all week and am continually delighted with the recall and accuracy it’s able to deliver.

Essentially you can pass in a text string or web document (structured or unstructured) and the service will identify, disambiguate and extract the places contained within. For example this sentence includes the location Sunnyvale, California which whilst seemingly completely out of context is where I work. I ran this paragraph through the API and here’s an extract of what was returned:

<document>
<administrativeScope>
<woeId>2502265</woeId>
<type>Town</type>
<name><![CDATA[Sunnyvale, CA, US]]></name>
<centroid>
<latitude>37.3716</latitude>
<longitude>-122.038</longitude>
</centroid>
</administrativeScope>

</document>

Along with the location name, a latitude and longitude of both the centroid and each corner of a bounding box we also have the superb WOEIDs (Where-on-Earth ID). Armed with all this information there’s almost no location based application I can’t build. Indeed sites such as Just Landed which searches Twitter for the text ‘just landed in’ and geocodes the places in order to provide intriguing visualisations just became as simple as tying two APIs together!

As a supporter of all things Semantic, it’s important to highlight that this API goes far beyond some complex string matching. Placemaker recognizes geographic semantic tags, such as the W3C Geo Vocabulary, and microformats such as geo and adr. Pretty neat huh? Drop a note in the comments below and let me know what you think about this and post any links to cool applications it’s allowed you to build.

SES Toronto

May 29th, 2009

I’m excited to be able to confirm that I’ll be speaking at SES Toronto on June 8th. It’s been a few years since I worked a booth at an SES event back in my Mirago days and I’m looking forward to see how much things have changed.

I’ll be part of a panel discussing ‘Universal and Blended Search: Comprehensive Visibility Challenges’ along side Anne Kennedy (Beyond Ink), Bill Tighe (Google), and Jed Schneiderman (Microsoft).

Obviously with the timing so close to the scheduled launch of Bing I’m expecting a storm of questions for Jed about their new approach to highly dynamic federation. Also with both Yahoo! and Google on the stage attendees will get to really understand the benefits of (and increasingly easy ways of incorporating) structured data to visibly impact their representation on the SRP.

Track: Nuts & Bolts
Universal and Blended Search: Comprehensive Visibility Challenges
Search result multiplicity is not a new phenomenon, but recent advancements guarantee that the world of search and marketing will be changing forever. How do the new “blended” search results pages affect your marketing strategy? Do these changes mean that the major search engines are eager to keep the “second click” on web properties owned by themselves? How popular are the new blended search results with users? This session will include research data available only at SES.

Moderator:
Mike Grehan, SES Advisory Board & Global KDM Officer, Acronym Media

Speakers:
Anne Kennedy, SES Advisory Board & Managing Partner and Founder, Beyond Ink
Bill Tighe, Agency Business Development AE, Google Canada
Nick Cox, Senior Product Manager, Yahoo!
Jed Schneiderman, Online Marketing Lead, Consumer & Online, Microsoft Canada

Google Joins Semantic Web

May 28th, 2009

As I highlighted a few short weeks ago, Google has been dropping hints about the Semantic Web so subtle that even us chaps realised something exciting was going on over at the Googleplex. During the Searchology conference (their annual slap in the face to startups who dared think they were on to something unique and exciting) Big G revealed that the Christmastime rumors of data islands were no more and that RDFa was accepted!

The announcement focuses on hCard and hReview, which if found on your page be will be turned in to a visual presentation and added to your result on their SRP. Sound familiar? If it does that’s because, as many bloggers pointed out, it’s incredibly similar to Yahoo! SearchMonkey Structured Objects. Competition aside, this is great news for publishers as it is yet another vindication of the benefits of structured data on your pages.

Google Rich Snippets

Google Rich Snippets

Yahoo! SearchMonkey

Yahoo! SearchMonkey

Where SearchMonkey has focused on complete Objects for presentation - e.g. a Video looks like this whilst a News article looks like that - Rich Snippets, as Google is calling this, call out single key/value pairs which can add value to a standard result. So far however their presentation appears to be behind flood controls as you need to add your domain to a waiting list. My hunch is that Google is treading carefully due to concerns as much about spam as the resulting visual impact on their end users.

Now that the top two engines are adopting public, open-standards we can expect to increasingly enjoy the benefits of ever richer, more accurate results with highly targeted presentations.

Wolfram goes Alpha

May 17th, 2009

Continuing their ‘WTF?!’ launch policy Wolfram|Alpha chose to open the floodgates to their servers late on Friday afternoon, several days earlier than announced. Perhaps this was ploy to reduce the likelihood of their hardware stumbling under the load, if it was it didn’t work - mostly because their target audience doesn’t have much else to do on a Friday night.

As Google proved a few days prior, server problems happen to the best of us and I for one won’t be marking them down for that - it’s an Alpha for a reason. If anything it’s extra marks for thinking ahead and offering an error message aimed squarely at their audience.

Wolfram|Alpha Server Failure Message

Wolfram|Alpha Server Failure Message

Ok, so what are the results like? Overall I’m impressed, the linking of data is frankly excellent even if you get the feeling they’re just showing off at times. For example, knowing the height of the ‘tallest tree’ in the most appropriate unit would be satisfactory. Going on to convert 385ft to miles, yards, meters, km, cm and even fathoms is bordering on the autistic. Another classic example informs me that the speed ‘55mph’ is 0.62 x the speed at which Marty McFly needed to drive the Delorean DMC-12 in order to time travel ( 88 mph ) - now is that geeky, a fun Easter egg or just data because it was there?

Childlike fact telling aside, Wolfram doesn’t offer the most accurate Query Linguistic Analysis engine and that leads to many failed queries which it would appear Wolfram actually does have the answer to. For example ‘average salary’ fails whereas ’salary’ returns average salary information for a set of major occupations. This is something that can be improved dramatically with access to a massive volume of real world queries, this Alpha release and associated ‘Google Killer’ hype will certainly enable the collection of that.

I’m also not going to knock off marks for the user interface or breadth of their dataset, both of those can be fixed over time if the proof of concept warrants it - and the first look suggests that it really does. Whilst Google wanted to index the world’s data, Freebase, Wikipedia and now Wolfram seem to have most of the worlds ‘factual content’ wrapped up.

VoCamp Sunnyvale CA: June 18-19, 2009

May 12th, 2009

I’ve talked recently of my sadness at the lack of a central repository for ontological knowledge on the Web. Until the major players can sort that out (I really don’t expect it to be long coming now) on the Web there is plenty you can do back in the RealWorld(tm).

VoCamps provide a two day forum for vocabulary creation and discussions on the management of the Semantic Web. Unlike Semantic Web meet ups which typically take a few hours and focus on a single presentation, the VoCamp format is open and provides time to members of the community to talk about current issues with vocabularies and semantic interoperability and the chance to work in small groups.

If you live in the Bay Area and want to come along to a VoCamp and help shape the future of the Semantic Web please sign up on the VoCampSunnyvale2009 wiki page. Space is limited, but we will try to expand if necessary. The event is right after SemTech San Jose so you won’t have far to travel, and perhaps best of all it’s free!

Bogged down by Semantics

May 9th, 2009

I’m running massively behind on my Podcasts. The backlog has been building up for the past month whilst I’ve been focusing on that ever present joy - quarterly planning. As you might have guessed from my place of work, planning right now has a few more variables than one might hope for. Digressions aside, I grabbed a few hours this weekend to get psyched about Tech again.

Highest on my playlist was The Semantic Web Gang, and not just because my colleague Peter Mika was taking part this time. This is regularly a great show for anyone wanting to learn more. I ended up a little depressed as the conclusions of everyone on the panel sadly matched those I’ve been coming to for a while.

No one likes to ‘reinvent the wheel’ so before delving in to code most of us look around to see if we need to. When investigating Semantic Objects today there is no clear source of truth as to prior-art for any developer (corporate or personal) wanting to create an Ontology. Whilst this doesn’t surprise me at this stage in the Semantic Web, I am a little shocked that no one has attempted to take ownership of this space.

It’s in the interest of the community to offer a set of complete vocabularies for specific objects and all of us spend a fair amount of time trying to define the next set. With both these thoughts in mind, here’s my elevator pitch for a possible solution:

  • Offer a gallery style view of known and ‘complete’ objects.
  • This gallery would be user contributable.
  • This gallery would allow for comments and feedback to the authors to ensure the needs of the wider world are considered by the authors.
  • This gallery would offer links to ontology creation tools.
  • This gallery would support and allow for group collaboration on the definition of a new object.
  • When an ontology is complete and examples of real world usage were linked to by more than 3 people Yahoo!, MSN, Ask, Google etc.  would extend support for it by adding crawler support (e.g. we would agree to accept this format for our indexes).
  • The entire Ontology set would be made available under CC licenses (or most appropriate alternative) and ‘donated’ to the community to ensure adoption.

Why is ‘something’ like the above useful? It would be a start point for the confused masses. Does an ontology exist for ‘bicycles’? A simple search could return nothing:  You’ll need to go and create something, and here are some tools and access to a community. Or something: Here’s an ontology you can go and use or contribute to in order to extend it as you need.

Well, that’s one possible way to lower the barriers to entry which people are increasingly telling me are too high right now. What do you think, is there a better way?

Wolfram|Alpha

April 28th, 2009

Ignoring the traditional ‘how to launch a new site’ playbook which state you must whore yourself around expert commentators, provide personal updates on your blog for months in advance, build a following among an ever increasing alpha test group and finally issue an overblown PR announcement on the day of launch which preferably includes some quotes hinting ‘Google killer?’ from your new friendly commentators, Stephen Wolfram has seemingly rubbed much of the industry the wrong way the mysteriously quiet run up to the launch of Wolfram|Alpha.

As redundant as this may sound, geniuses are aren’t stupid people. For a while there though I was starting to question the wisdom of the MacArthur genius grant review committee. Whilst Wolframs approach has garnered the biggest swell in anticipation prior to a launch since, well probably since, Teoma and Wisenut back in 2002/3, yesterdays webcast was a bust for me. Scheduled at a time I couldn’t attend I hoped to catch up later in the evening. No such luck. The broadcast appeared to have been replay free until over 30 hrs had passed and we started to see some neat download options began to appear – download the video, stream it or grab the MP3 – cool! Speaking of which, Cuil followed the playbook, everyone seems to hate them, and even if they did publish an MP3 version of their most recent announcement (a timeline presentation seen before a dozen times elsewhere) nobody would have cared. It’s important to add that you don’t get a single screenshot of this ‘amazing’ new product during the entire 90 minute presentation - Stupidity or extreme genius? You decide.

What can it do? It can describe places, like Lexington, Mass., by its vital statistics, like location, population, weather, etc. It can compare Lexington with Moscow. If you type “LDL 180,” it will tell you the percentile of the population with higher or lower cholesterol and show you the answer on a chart. If you tell “LDL 180 male 45,” it will adjust the chart for gender and age group. It can chart the life expectancy of a male age 40 in Italy or tell you who was president of Brazil in 1928

http://bits.blogs.nytimes.com/2009/04/28/wolfram-alpha-veil-lifted

Without visual proof of the thing in action it’s hard to state this with any degree of convicion, but there appears to be nothing in the demo that couldn’t be achieved without a decent query parser and a triple or perhaps, if we wanted to store the context of the data, a quad store. I have seen a few leaked screenshots from the initial webcast and it would seem that many of the examples can be knocked up with Freebase. So is Wolfram|Alpha one of the next generation of Object Data store powered Search Engines? Hard to say from this small ‘preview’, but the indications do hint at it.

To cap the growing excitement with the fateful rubber stamp of ‘Google Killer’, Google themselves came out with a Direct Display for the top of their results to show US Census data. A nothing launch on any day of the week – with the exception of the nice graphing animations thanks to Trendalyzer – this timing got the press buzzing. Do Google think this is a threat? Is Google trying to prove that whatever Wolfram can do they can do better? And so on until you loose the will to care. Well, at least until you get the chance to see for yourself in May when the real launch happens.

[UPDATE May 11th]  According to their blog, Wolfram|Alpha will open to the full force of the Webs interest on 18th May 2009. If you’re lucky enough to stumble in to a test bucket you may be able to experiment already.

Conference Depression

April 24th, 2009

I’ve just returned from a relaxing couple of weeks touring my new home state of California. Since moving to the US 7 months ago I’ve not taken any holiday and still find the SF tourist stuff intriguing. Among the 2,104 emails (real figure) awaiting my return, the award for most depressing email goes to O’Reilly. The Found conference has been cancelled.

This is depressing on two fronts - first (selfishly) I’m obviously sad not to be presenting my talk on the Semantic Web to the SEO community. Secondly, and more importantly, it’s one of the first major signs of the ‘Great Depression 2.0′ here in the Bay Area. Sure people have been laid off in their thousands, property prices have plummeted, and queues have run from Job Centres out on to the street, but oddly this doesn’t seem to be reflected in the traffic on the 101 and 208 each morning.

For those interested in the statement from O’Reilly, here it is in full. 

O’Reilly Found Conference 2009

Due to the challenging economic environment, we’re sorry to announce that we’ve made the difficult business decision to postpone the O’Reilly Found Conference, which was to take place June 9-11 in Burlingame, CA.

We are grateful for the support of everyone involved in the event, particularly program co-chairs Vanessa Fox and Nathan Buggia, sponsor Microsoft Live Search, and the event partners and participants.

O’Reilly will continue to explore the topic of search-friendly architecture for developers, including the possibility of integrating some of the excellent Found program into other offerings from O’Reilly.

If you would like to continue the conversation on making the web easier to find, please visit janeandrobot.com and follow twitter.com/janeandrobot to become part of the community, read the latest on technical SEO issues from industry experts, and attend local technical SEO meetups. 

 

Is this the first of many? I expect so. Have you seen any other cancellations?

Author: Nick Cox Categories: Internet Tags: , , ,