Google Reader+ And Identity vs. Personas

Google has announced that Google Reader will finally get a much-needed revamp. It will now be integrated with Google Plus, and its native isolated social network will be abandoned. See Techmeme for responses from the tech blogger community. The response from Google enthusiasts has been largely positive, as you can see in this Google Plus thread. For non-Google enthusiast responses, see this Hacker News thread.

As a heavy user of Google Reader, I have mixed responses to this announcement.

Positives

  • Google Reader will finally get a much-needed UI revamp. I suspect removing the native social follower-model within Google Reader will make it much faster.
  • Sharing from Google Reader to Google Plus will be much easier. I can quickly share an item from my Google Reader to my “Tech Enthusiasts” circle on Google Plus.
  • No way to get an RSS feed of your Google Reader shares. Many people use this RSS feed for auto-posting shares on their WordPress/Blogger/Tumblr blogs, in addition to Twitter. Of these, Twitter is where the most noise is generated by this auto-posting. I have written about this in great detail before.

Negatives

  • No way to follow a highly-curated tech-focused feed of other Google Reader enthusiasts. As a passionate Reader enthusiast who stays on top of tech news all day, my feelings about missing this feed is well expressed by Sarah on TechCrunch.

Understanding the Root Problem

My Google Reader shared feed is a tech-focused feed and nothing else. My Google Plus feed, however, is a mix of personal photos, personal blog posts, shares as a father about my daughter, etc. Where will my Google Reader followers get my tech-focused feed now? No, Google Circles doesn’t solve the problem.

The reason I have this tech-focused blog, and keep a separate personal blog (where I’m currently writing about Western Classical Music appreciation) is that readers of this blog expect to read tech-focused posts, while friends who know me personally enjoy reading my personal blog too. I do not pollute my own Google Reader shared items with my own personal blog posts.

The reason I have two separate Twitter accounts is for the same reason. @ScepticGeek is well-known as a tech expert, while people who either know me in real life or are interested in my other non-tech interests follow @Palsule. Different people even call me in real-life either as “Mahendra” or “ScepticGeek”.

Identity and Personas

Both Google and Facebook are now forcing me to be myself with all my varied interests in all my sharing and engagement on those networks. Twitter allows me to be two different persona. This is a crucial difference, recently described best by Chris Poole, nicely summarized by Tim Carmody here. The money quote:

Both Google+ (with Circles) and Facebook (with Smart Lists) misunderstand the core problem of online identity: It’s not only about who you’re sharing with, but how you represent yourself. “It’s not who you share with, but who you share as.”

On Google Reader I am @ScepticGeek, on Facebook I am @Palsule, on Twitter I can be both, and now I wonder what I am supposed to be on Google Plus.

The Future: Focus on Interest Graph

Does this mean Google Plus necessarily becomes a place of incongruous, irrelevant shares? No. What we need is better filters for relevance. I have written before about how Quora complements the Social Graph with an Interest Graph for greater relevance as well as serendipity. As a general-purpose social network, Google Plus needs to do more.

We need to be quickly able to filter the Google Plus feed by source – Google Reader, Photos, YouTube, etc. Google needs to invent a way to auto-tag/auto-classify Google Plus posts such that I can view a feed of tech news, personal photos, humor, photography, etc. using a simple UI filter.

This problem is understood by Bill Gross, who started Chime.in as a way to “Follow a Part of a Person”, the idea being that you can follow both @ScepticGeek and @Palsule on the same network, and depending on your interests, you will auto-magically see only the shares you are interested in. But with the likes of Google and Facebook in the race for dominance of the social web, it is unclear whether new startups focusing exclusively on this problem stand a chance.

Do you know who is already capitalizing on this problem and is hugely successful? Tumblr. Most people use Tumblr by sticking to a specific area of interest, and the social network makes it easy to follow others sharing your interests. With 850 million Facebook users, 50 million Google Plus users, why are there almost 30 million Tumble blogs out there with over 10 billion posts? I suspect it is because neither Facebook, nor Google Plus are an interest-based social network like Tumblr. The future war of the social web hinges on who better creates the most relevant experience for users.

Tagged with:
 

Google Plus: Why Facebook and Quora Should Worry

Google launched Google+ this week, and there have been many excellent posts highlighting its potential as well as its challenges. My first impressions are very positive. I will not regurgitate any of the points already made by others, and will limit this post to what I think has been missed.

Relevance: “The Mother Of All Streams”

Most of the commentary about Google Plus so far has focused on its social feature – “Circles”, a new way of grouping contacts for targeted sharing. But as I’ve said before, the future belongs to the Interest Graph complementing the Social Graph.Google-plus-logo-640

Facebook has done a not-so-great job capturing users’ interests. Many people have ‘Liked’ hundreds of pages just because they were asked to do so by their friends. Facebook’s obsession with and overreliance on the social graph has corrupted their interest graph, and this might well be Facebook’s Achilles Heel in the long term.

Google Plus takes a different approach. The goal of Sparks is to capture your true interests. It is in a primitive state at present, but I’m talking about the Big Picture here! As Andrew Tomkins explains:

“Sparks is essentially the stuff that flows to you through the interest graph and the stream is the stuff that flows to you through the social graph”

This is precisely what I described was the secret sauce behind Quora:

Quora’s newsfeed is an interesting showcase of what happens when you mix an Interest Graph with a Social Graph – and the result is the mysterious addictiveness so many have experienced, but found difficult to explain.

Steven Levy goes on to explain how the Google Plus team plans to mix these two to create the “mother of all streams”.

Also: Once Google gets to know you better, it can help provide more relevant search results. Classic search disambiguation problem – when user searches for ‘apple’ is it for the fruit or the company? Your interests from Sparks can help Google learn what you’re looking for.

Why Quora Should Be Worried

It was reported earlier today that code for Questions has been found in Google Plus. If this comes as a surprise, you haven’t understood Google’s ambitions with Emerald Sea.

Unlike Quora, where users/moderators need to manually tag Questions to fit their taxonomy, Google could easily auto-tag questions. Further, it could easily AutoComplete your Question in a way Quora could only hope. And even further, in many situations, Google could answer your question without waiting for a human being to respond.

Imagine such a Q&A service working across mobile devices, where Google knows your location and much more about your interests and friends.

Why Facebook Should Be Worried

AllFacebook has a great post on how Google Plus is a challenge for Facebook. Some folks have already opined that Facebook has nothing to fear, that the mainstream users are not going to join Google Plus and quit Facebook in droves. But pundits have been wrong before.

I wouldn’t dismiss Google Plus so quickly if I were Facebook. Challenges for Google Plus:

  • Critical Mass: Google Plus needs a critical mass of users if its ever to gain mainstream acceptance. However, these are very early days, and early adopter response has been largely positive.
  • Games: Mainstream users love games. Google is reported to have invested in Zynga, while Facebook has had a rocky relationship with them. What if the next Farmville were to launch exclusively on Google Plus and not on Facebook?
  • Simplicity: As it stands today, Google Plus is not actually more complicated than Facebook, it just feels like it because it is new. Try introducing Facebook to a first time user and walk through the different features, and you’ll agree that Facebook has slowly evolved to a much more complex service with a plethora of components. Google Plus will need to become simple and intuitive to attract a sizeable mass of followers before adding new features.

There are a lot of unknowns, and my take is that it’s too early to make predictions. In any case, the stickiness factor of Google Plus is a big challenge for Facebook.

I am very impressed with what I’ve seen so far. There are challenges, but for once, I think the Emerald Sea team is seeing things in the right perspective and making all the right moves. In Feb 2010, I explained Why Google Buzz Doesn’t KISS. So far, Google Plus does.

Tagged with:
 

The Age Of Relevance

[This is a copy of my guest post on TechCrunch, in which I have recapitulated and refined many of the concepts discussed in earlier posts on this blog.]

What’s the next big thing after social networking? This has been a favorite topic of much speculation among tech enthusiasts for many years. I think we are already witnessing a paradigm shift – a move away from simple social sharing towards personalized, relevant content. The key element of the next big thing is the increasing significance of the Interest Graph to complement the Social Graph. While Facebook, Twitter, and Google are already working on delivering relevant content, a slew of startups are focusing exclusively on it. Relevance is the only solution to the problem of information overload.

The Information Discovery Matrix

image

The above matrix is a representation of how the process of online information discovery has evolved.

Phase I: The Search Dominated Web

This is how Google began its dominance over the web two decades ago, using PageRank to surface the most popular web pages as identified by other web pages that linked to them.

Phase II: Web 2.0 With Social Bookmarking

In the Web 2.0 era, social bookmarking services gained significant traction, surfacing popular content. Sites like Reddit and SumbleUpon are hugely popular even today, driving millions of page views.

Phase III: Personalized Recommendations

Services like Hunch, GetGlue, etc. have focused on building an Interest Graph for users, to deliver personalized recommendations using a ‘taste engine’.

Phase IV: Personalized Serendipity

The latest crop of startups is focusing on personalization using a combination of Interest and Social Graphs. Personalized Serendipity is what Jeff Jarvis calls ‘Unexpected Relevance’. Examples include Gravity, my6sense, Genieo, and TrapIt.

What Exactly Is Relevance?

The battle against information overload is sometimes presented as a choice between Relevance and Popularity, where ‘relevant’ is equated to ‘personalized’ as against popular. However, Relevance does not always mean Personalized. Relevance is very dynamic – it depends on the needs of a person at a specific point in time. There are times when users want to know about the most popular stories, and other times when they seek personalized content.

There are multiple approaches to filtering information for Relevant Content. Google, Paper.li, and PostRank are examples of algorithmic filtering, while Reddit, Hacker News use a crowdsourcing approach. Klout can be used to filter Twitter streams by influence, while Facebook uses social affinity as a filter for its newsfeed and social signals for its new Comments Plugin. Location is another high-impact signal for delivering relevant content, gaining importance in a mobile world.

In other words, Relevance spans across all the quadrants of the Discovery Matrix above, and none of the above approaches to filtering for relevance is the ‘best approach’. There is no killer approach to Relevance. Henry Nothhaft, Jr., CMO of TrapIt, described it as “the myth of the sweet spot”. The competitive edge will be with services that support multiple discovery methods, multiple filtering approaches, have flexibility, and support multiple mobile platforms.

Quora: A Showcase Of The Interest Graph

Quora has pioneered the use of the Interest Graph as a dominant signal for its newsfeed. Quora asks new users to select Topics to follow, as part of its onboarding process, which is the first revelation that Topics are as important as Users to follow.

Quora’s newsfeed is an interesting showcase of what happens when you mix an Interest Graph with a Social Graph – and the result is the mysterious addictiveness so many have experienced, but found difficult to explain. An item pops up in your newsfeed not because you were following a user, but because you were following a related topic. This often leads to Personalized Serendipity – or Unexpected Relevance – which is why Quora gets many people hooked.

The war over the Interest Graph began between Twitter and Facebook last year, as Eric described eloquently. So how did Quora beat them to this game? For starters, Quora is built from the ground-up with the Interest Graph being a backbone of the framework. Twitter’s ‘Browse Interests’ is too broad and primitive to be of use, even at present. And while Facebook has a mechanism for allowing publishers to push new items to your feed, most publishers have been unaware of this functionality. This is also the reason why Facebook’s Like Button now publishes a full news feed story. The future clearly belongs to who best captures the Interest Graph as Max Levchin and Bill Gurley put it.

The Future: A Paradigm Shift

The implications of a Relevance-driven web are wide-ranging and broad in scope. Better utilization of the Interest Graph by services will lead to better ad targeting, and a potential decrease in reliance on CPM/CPC-based advertising. Monetization focus will be on higher yields through transactions and subscriptions as Dave McClure once described. Online media publishers will focus on Relevance Metrics revealing engagement and time-spent on site, than primitive metrics like page views and traffic. Social media may lose its obsession with follower numbers and traffic, evolving to context-driven reputation systems and algorithms.

Interest Graphs will be used to build Better Social Graphs. Today’s monolithic Interest Graph will get further specialized into Taste Graphs, Financial Graphs, Local Network Graphs, etc., yielding higher relevance for different needs. The Age of Relevance beckons!

Tagged with:
 

Filtering for Relevance with my6sense

As a champion of Relevance over Numbers, I have been happy to see the increasing popularity of my6sense – a mobile app for iPhone and Android that uses Digital Intuition to filter your social streams and provide you with personalized, relevant content.

my6sense_Home my6sense_AddContent

my6sense is simply the best tool to catch up with updates from your social networks. It works with Twitter, Google Buzz, and Facebook to get content, and you can also import your Google Reader feeds and add any specific websites if you wish. The mobile app allows you to share the content easily on any of the networks you’ve connected. The latest version 1.4 lets you post status updates and features a smart widget.

my6sense_MyStream my6sense_Twitter

It is a tricky challenge for me to write an unbiased review of my6sense because I am not a representative user. Hence, I will write about the limitations I experienced when using the app, and also attempt to assess it from the perspective of an average user.

Digital Intuition Engine

There is a period of learning required for the relevance engine to understand your personal preference for content. The app tracks which items you click, how long you spend time reading items, and which items you share, to gauge the relevance of items to you.

Within 2-3 days of using the app, I could start ‘sensing’ the digital intuition engine. The more time you spend using it, the better it gets.

my6sense_Item_Menu my6sense_Settings_Content

I have previously written about different approaches to filtering information for relevance. my6sense uses a combination of filtering based on social graph and algorithms. It assesses whose tweets and Google Reader shares interest you the most from your social graph, and uses semantic analysis of the linked content to determine relevance.

One of the things I liked is that the focus is purely on relevance – so-called influencers are not given a boost and your content is truly personalized.

Why I Am Not A Representative User

Before going further, I need to explain in brief why this app is not part of my daily news reading routine.

  • As technology news editors, we often break stories virtually in real-time before they are covered by tech blogs. my6sense is not meant for discovering breaking news.
  • I have written several times about how I am brutal in curating my sources. I follow a few hundred people on Twitter and between 25-30 on Google Reader/Buzz. my6sense is more suited for those who follow a large number of people.
  • I spend very few hours every morning to catch up with news and spend the rest of the day covering breaking news. my6sense is more suited for those who have more than a 12 hour backlog.

Limitations & Recommendations

  1. My most relevant content on Twitter is in my Techmeme Leaderboard Twitter List. I do not follow any of these Twitter accounts. There is no way I can tell my6sense to focus on a specific Twitter List for relevant content. I suspect a lot of heavy Twitter users use Lists to organize their following and I would rate this support to be a high priority requirement.
  2. There is no two-way sync between Google Reader and my6sense. Items read via Streams in my6sense are not correspondingly Mark As Read in Google Reader. You can share items on Google Buzz but they don’t seem to be shared in Google Reader. As I have a dedicated Google Reader following, this necessitates an unnecessary duplication of effort. It would be great if items shared on Buzz are shared on Google Reader as well.
  3. No desktop or web-based version.
  4. In the Streams view, it would be helpful to have counts of the items similar to unread counts shown in Google Reader. I found myself checking each folder in turn, only to find no items within it.
  5. Imported steams from Google Reader appear to be stale. It continues to show items that I have already previously read and shared, while new items are not always shown. For example, here are two screenshots of a feed folder taken at the same time while writing this post:

GReader_Folder_Brands

my6sense_StreamFolderView_Brands

These are essentially the reasons why my6sense is not a part of my daily news reading routine.

The Incredible Potential

I have already written about how my6sense is part of the Personalized Serendipity Quadrant in my Relevancy Matrix – the hottest space for many startups today. After having tried many services aiming for personalized relevance, I can say without hesitation that my6sense’s Digital Intuition Engine is way ahead. It’s combination of semantic analysis and social graph filtering provides a unique experience that you can intuitively feel working for you.

The mobile apps are said to be just a demo of the powerful API provided by the backend. It is exciting to think of the possibilities in which this engine can be utilized. From personalized content on publishers’ websites to integration with Twitter clients – my6sense has potential to unlock relevance in the ever-increasing information deluge. With Barak Hachamov’s vision and Louis Gray’s marketing, there is incredible promise indeed.

For most average users, who need to catch up with news and shares from social networks, I would heartily recommend my6sense in the Top 5 ‘must-have’ mobile app category.

Tagged with:
 

Mapping Startups & Services Filtering For Relevance In A Matrix

After looking at the different approaches to filtering for Relevance, I have been seeking a way to map them visually. There are many different startups competing in this space along with the giants, and a way to map them in a matrix would help us see the big picture of how the battle for relevance is evolving on the social web.

What are the fundamental ways in which these approaches and startups differ? These could form the axis around which we can then proceed to map them.

The Popular – Personalized Axis

Filtering either works by showing us the most popular stuff being shared online, or by understanding our individual preferences and surfacing personalized content. Thus, we have the following axis:

PopularPersonalized

The Serendipity – Search Axis

You either search for content or you see it serendipitously without seeking anything specific. Search is actively initiated by the user and is goal-driven, while serendipitous discovery is gifted with the user being passive at the receiving end. This gives us our second axis:

SerendipitySearch

The Filtering for Relevance Matrix (FORMAT)

We combine these two axes to form the backbone of our visualization. We then place different services within our matrix as per their core filtering approach. The result is the Filtering FOR Relevance Matrix (FORMAT) as seen below:

 

Format

Let us now look at each quadrant closely.

Popular – Search Quadrant

This is the simplest and oldest of all. Search powered by algorithms to surface most popular content online. This also includes other Twitter search services like Topsy. These services are powered by algorithms such as PageRank, PersonRank, Resonance, etc. to surface the most popular result relevant to a query.

This approach dominated the Web 1.0 era before the advent of the social web.

Popular – Serendipity Quadrant

Services in this category help you find the most popular content being shared online across different social networks. These were the next to evolve in the Web 2.0 era, beginning with social bookmarking services like Reddit, StumbleUpon, etc.

There is an element of personalization provided by many of these, in that you “follow” some users, but the motive behind such following is less to seek personalized content, more to seek trending, viral content.

Note how Digg is attempting to move from this quadrant to the personalized quadrant, and facing hurdles along the way.

Search – Personalized Quadrant

A breed of services has evolved around delivering personalized recommendations and content tailored for your needs. Hunch learns about you and acts as a “taste engine”, while Blekko allows you to personalize your searches with slashtags. Google is making forays in this space with its Social Search service, which tries to personalize search results based on your social graph.

Personalized Serendipity Quadrant

This is the hottest space where most of the competition is today.

Twitter Lists are personalized (created by you) and deliver fresh, serendipitous content relevant to your interests. Facebook Likes give you serendipitous discovery from your personal friends. Flipboard provides a social magazine based on your personal social circle on Facebook and Twitter. My6sense delivers new content using ‘Digital Intuition’. Vertical networks like Last.fm deliver music recommendations based on your individual taste. Personalized Twitter newspapers give you fresh content filtered by your social graph on Twitter.

Note how Datasift lies at the center of the matrix. This is because Datasift is a platform providing different filtering services and approaches. Developers may use the platform to develop different services and apps that can lie in any of these quadrants.

How does FORMAT help?

So what is the point of this exercise? Using FORMAT:

  • We see the big picture of how services providing relevance and filtering are evolving.
  • We see how personalized serendipity is the holy grail of the social web right now.
  • We see how different services relate to each other and who is competing with whom and how.
  • We see how identifying the target quadrant is important for any new startup in this space.
  • We see how users provide friction when a service tries to change quadrants (Digg).

If you are involved in a startup aiming to provide filtered, relevant content to users, which quadrant would you target? See how FORMAT helps?

Tagged with:
 

DataSift Curation Engine Aims for Relevance in Real-time

As I have said many times previously, if 2009 was all about the hype of Real-time, the future is all about capturing Relevance in real-time. Datasift has partnered with Twitter to get the full Twitter firehose and is building a platform to enable curation and filtering in real-time.Datasift

An introductory video about Datasift was posted in their first blog post, which didn’t reveal much about how the platform works. Now, uber-geek Robert Scoble has posted a video of an extensive discussion with Datasift’s founder, Nick Halstead.

Robert Scoble with Datasift founder Nick Halstead

This post is a summary of Datasift as discussed above concluding with my own thoughts.

The Basics

Twitter’s firehose at present has around 800 tweets/sec, or 70 million tweets/day. Datasift can filter this firehose using over 20 variables. Examples of these variables include:

  • Profile information like name, location, bio, number of follows, followers, lists, etc.
  • Text and language of tweets
  • Geo-location of tweets
  • Verified users
  • Source of tweets – web, Seesmic, TweetDeck, etc.
  • Number of Retweets
  • Whether tweet contains a hyperlink

Datasift is a rules-based engine that can filter this firehose using thousands of complex rules and provide a filtered stream in real-time within milliseconds. It is built using a Service Oriented Architecture and has an API.

The Rules

Rules can comprise of any combination of filters using the above variables. Rules can be combined and merged, or added and subtracted, into a single new rule. Stream outputs from Datasift using such rules can become columns in Twitter clients like TweetDeck.

Here are a few examples of how rules can be used:

  • Show me tweets containing “google” from users who don’t have “social media” in their bio, and who have more than 500 followers.
  • Show me tweets from my curated Twitter list of tech brands that have more than 100 Retweets.
  • Show me tweets originating from within a radius of 5 miles from the location of XYZ Conference that don’t have swear words, irrespective of whether their tweets contain the hashtag for the conference.
  • Show me tweets originating from Starbucks shops around the world, of users who are “Verified Accounts”, irrespective of what they’re about.

Datasift’s website is intended as a community website for curators and developers to collaboratively work on developing these rules. You can leverage rules created by others to avoid duplication of effort. Rules are classified with tags, and Datasift provides search, ranking and trending for easier discoverability of rules.

Partnerships for Influence Tracking and Sentiment Analysis

Datasift has partnered with PeerIndex and Klout to enable filtering using their influence and authority scores. It has also partnered with a firm for real-time sentiment analysis.

Thus, any of the above rules can be filtered further using such scores, and a stream of tweets with negative sentiment about a brand or product, combined with any other rules, can be monitored in real-time.

Alerts and Analytics

For esoteric rules that may provide a result infrequently, alerts can be set up. The example discussed is of any politicians from a Twitter list tweeting the word “scandal”. Developers can send these alerts as email, SMS, or notifications on smartphones.

The resulting streams from all rules applied by the engine are stored by Datasift. This data can be extracted, segmented, and analyzed later. For example, this can be used to track the performance of social media campaigns.

Relevance Filtering of Links

Datasift can use TweetMeme and other databases to check the links in tweets, and determine whether they are relevant to a specific topic. Not much details on how this is achieved, but apparently, Nick says that all sites are already classified into different subjects by Tweetmeme and other such databases.

Blekko-style Twitter Search

Datasift has developed a prototype of Twitter search along the lines of Blekko’s slashtags. Thus, along with your query text, you can use filters such as “/nolinks” to get tweets without links, or “/California” to get tweets originating from CA.

RSS Feeds

Compared to the massive volume of the Twitter firehose, the volume of RSS is minimal. Datasift plans to have their own PubSubHubbub server. Developers and third-parties can plugin any RSS feeds and use Datasift’s filtering rules to get an output feed.

Revenue Model

One option is free access to the stream with in-stream ads. Ads will be tailored and designed for the target form factor – desktop/mobile/tablet/etc.

Second option is selling data B2B for developers and brand companies, charged by volume of data consumed.

Prospective Partners

Datasift is seeking to work with startups like Flipboard, who are creating new ways for curated content consumption. This can also include any of the startups focusing on Relevance, such as TwitterTimes or Paperli.

My Thoughts

When I compared approaches to filtering information for relevance, I had suggested that the service most likely to succeed would be the one that supports multiple approaches and platforms. We can easily see that Datasift supports all platforms and several approaches like crowdsourced filtering, influence filtering, location filtering, etc. It is easily the most powerful relevance filtering engine I have seen yet.

The market of end-users for curated real-time content is at present unknown. Startups involved in creating pleasant experiences for consuming content have yet to find a monetization strategy. The degree of Datasift’s success from an end-user perspective is largely dependent on:

  • The creativity of developers and curators to create compelling experiences, and
  • How the monetization strategies of presentation apps fare and how Datasift is able to work with them

Nevertheless, with the amount of content being created online growing exponentially, curation and filtering will eventually become necessities for any social media client. It is just a matter of time.

I also see a bright future on the B2B front. By partnering with influence and authority tracking companies, combined with sentiment analysis, Datasift may already be a compelling choice for brand monitoring and social media reputation tracking.

Lastly, thanks to Robert Scoble and Nick Halstead for the interesting interview.

Tagged with:
 

In a previous post, we looked at the big shift From Numbers To Relevance. There are dozens of apps/sites that are focusing on filtering information today, but which of them will succeed?

To attempt to answer that question, let’s first look at the different approaches employed by such apps/sites today in the search for Relevance. This is a topic that is usually the subject of scholarly research papers in academia; this is only a layman’s overview.

The different approaches I observe are:

  • Algorithmic Filtering
  • Filtering Based on Social Graph
  • Human Filtering
  • Crowdsourced Filtering
  • Shared Sources Filtering (Meta)
  • Influence Filtering
  • Social Search
  • Location Filtering

Algorithmic Filtering

If you tell us what you want or like, our software can show you what you will like.

Google Suggest

The predominant use of algorithmic filtering is in web search, where Google has dominated and driven the web economy for the past two decades. You search for something and Google’s search algorithm filters billions of web pages to find the most relevant results.

Google also uses algorithmic filtering to suggest items in Google Reader’s “Sort by Magic” feature.

Pros: Highest relevance when searching for information.
Cons: No serendipity. Only useful for goal-oriented task of search. No personalization (search engines typically unaware of demographic information).

Filtering Based on Social Graph

If your friends like it, you’ll probably like it too.

This is the dominant approach being used today by various apps and websites. For example, Facebook uses the EdgeRank formula to determine what to display in your news feed:

edgerankform2

The key driving factor is the affinity score between you and the source.

Google also uses this approach when recommending posts in Google Buzz.

Most of the apps listed in my previous post, as well as the new Digg, use this or a similar approach that employs your Twitter or Facebook friends to recommend items.

Pros: High serendipity. Helps being “in the know”, a socially cool factor. Higher personalization.
Cons: Relevance depends on social graph, which often is not optimized for relevance, as Kevin Anderson noted.

Human Filtering

I trust a specific person to share all of the good stuff I like to know.

Some people make it a habit to go through news items every day and share what they deem to be the most significant ones. Others begin relying on them as trusted news sources.

Pros: High serendipity. Easy to use. Quickly become part of social circle of an influencer.
Cons: Unreliable. Susceptible to preferences and agendas of other people.

Crowdsourced Filtering

Quickly see what’s most important to know.

TweetMeme, OneRiot, Digg, and many other social bookmarking services aggregate the actions of millions of people to surface the most popular services. Techmeme and MediaGazer add human curation to the aggregation of thousands of websites to surface the most important tech and media stories.

Pros: Be up-to-date with the most important/popular need-to-know information.
Cons: No personalization. Popular doesn’t always equate to relevant.

Shared Sources Filtering (Meta)

If you read from sources similar to someone else, you’ll probably like their other sources too.

Facebook uses this approach to suggest new Fan Pages that you may like because your friends like them. Google Reader also uses this to recommend new RSS feeds. Toluu also compares your subscribed RSS feeds with other users to help you discover new feeds.

GR Recommendations

Pros: Useful for discovering new sources in social networks.
Cons: Filters sources, not actual news items, hence limited in scope.

Influence Filtering

Only read what influential people are saying/sharing.

This approach uses influence scores of sources to filter the news feed. An example of this is HootSuite, which uses Klout to let you filter tweets according to their Klout scores.

Klout Filtering

Pros: Flexibility. High serendipity. Helps being “in-the-know”.
Cons: Influence metric is unreliable. Currently only available for real-time feeds like Twitter.

Social Search: Algorithms + Social Graph

Let your social circle find the most relevant results for you.

Social Search uses a combination of algorithms and social graph to find relevant results.

Social Search

Pros: High relevance. Combines goal-orientation of search with serendipity of social. Very useful for news items from recent past.
Cons: Requires searching. Lesser utility for fresh, real-time news.

Location Filtering

If we know where you are, we can help you find relevant results.

Location is a treasure trove for relevance. As the mobile web explodes, services that provide information about nearby businesses or friends are gaining increased adoption.

Pros: High relevance. Can be serendipitous with real life impact.
Cons: Privacy concerns. Limited in scope.

Conclusion: Which Approach is the Best?

None. Relevance is dependent on the requirements of an individual at a specific moment in time. These requirements change from time to time and from person to person. There is no killer approach to relevance.

Which app or service is likely to succeed? I think the following factors will make a difference:

  • Support for multiple approaches
  • Flexibility of degree of filtering
  • Number of Mobile Platforms supported
  • Next Step: What can you do with the info? (e.g. Siri lets you take actions)

What do you think? Are there other approaches that I missed? Which other factors matter?

Tagged with:
 

Predicting Tech News in 2015

Last month, Stuart Miles, founder of the gadget and tech blog Pocket-Lint, asked me to contribute to its feature on “FutureWeek”:

What gadgets will we be using in 2015, where will Augmented Reality take us? What about robots, gadgets of the future, super-fast internet speeds, cars, materials of 2015 and much, much more.

The entire set of stories make excellent reading with insights from thought-leaders around the web. Apart from gadgets, there are other posts on what to expect from the semantic web and how we will consume content in 2015. Being in the technology news business, my thoughts were included in What will be the big tech stories in 2015.

I would like to elaborate on my thoughts here. I must say that these ‘predictions’ are nothing but a reflection of my hopes as well as fears. Further, I sent these on 21st March, after which there have been some interesting developments.

Facebook will not become AOL 2.0. To remain competitive, it will be forced to interoperate with other networks.

There has been discussion on this issue time and again on the web. I personally think the web is resilient to any attempts to dominate it in the long term. I also think the team working at Facebook is wise to learn from the past.

Social Networks will no longer be "places" on the web. Instead, your "social graph" will follow you on the web.

  1. You will control your social graph – choose and add from among different networks – Facebook, Twitter, Google, Windows Live – which will all be interoperable using an open standard. This evolution of social networking will be similar to that of Instant Messaging, where the open XMPP standard became popular, achieving interoperability to an extent.
  2. Rather than social networks wanting you to visit and spend time on their site, they will compete to become an inseparable part of the time you spend online, whether mobile or desktop.
  3. The social graph that follows you will help personalize and customize your browsing experience for everything:
  • Primary Content on websites – for example, which headlines/articles you see
  • Ads – tailored to your social identity and graph
  • Search Results
  • Which friends of yours are online, shown within your browser
  • Reactions/comments from your friends optionally shown for the web page you’re visiting

All the above is pretty self-explanatory. We are already seeing glimpses of this in Facebook chat, Google Sidewiki, and so on. Interestingly, one week after I sent these, there were reports of Facebook planning a “Like” button for any content anywhere on the web, and launching a Meebo-style persistent toolbar. Imagine my reaction when I saw these developments! :)

Websites will personalize according to your social graph using mechanisms like Facebook Connect, Google Friend Connect, Twitter Following/Followers graph, etc.

This is an ongoing trend I see towards a personalized relevant web. Again, a week afterwards, there were reports of Facebook sharing your profile data with external sites, so that these sites will tailor content for you.

I had also pointed out Facebook Connect being a mechanism for precisely this goal, when I wrote in January about Facebook’s non-portable data-portability. Marshall Kirkpatrick now points it out as well: there’s a big difference between opt-in and opt-out “data portability”.

Anti-trust legislation will be a major threat to Google’s dominance both in US and EU. "Will Google split up?" will be a question discussed in the media.

This is speculation. Google’s expansion into virtually every aspect of technology have already brought it under the scrutiny of anti-trust authorities.

Apple’s mindshare will start to decline. As Steve Jobs approaches retirement, questions will be asked of Apple’s survival.

Two weeks after I sent this, the question of what happens after the iPad and after Steve Jobs has been asked. I have my doubts about Apple’s innovation and competitive capabilities in a post-Jobs era, but would be happy if they’re proved wrong.

Privacy and Anti-Piracy will continue to make headlines.

  1. On Privacy: We would move to a public-by-default, private by opt-in model.
  2. On Anti-Piracy: Anti-Counterfeiting Trade Agreement (ACTA) will be in place, along with a global version of the DMCA.

These are my fears and they are very real. ACTA negotiations are making progress, and includes a global version of the DMCA. The politicians behind these negotiations may not understand technology and the people who understand the technology are busy writing about other topics that get their blogs more traffic. It’s also a case of those who matter, don’t understand; those who understand, don’t matter.

Do read the other pieces in FutureWeek. And thanks to Stuart for the opportunity to share my thoughts!

Tagged with:
 

The Evolution from Numbers to Relevance

Social media and Businesses on the web today are driven by the numbers game – of traffic, page views, and follower numbers. But the trend I foresee is:

The web is evolving from a numbers model to a relevance model.

Paradigm Shift: What is the Relevance Model?

Historically, monetization driven by CPC/CPM based advertising has led to websites and marketers focusing on page views and traffic. This is partly the cause of social media being spammed by internet marketers, ranking algorithms being gamed for traffic, and so on.

Numbers Model

Relevance Model

# of Followers Context-driven Lists
# of Clicks # of Interactions
# of Page Views # of Returning Visitors
# of Ads Displayed Time spent on site
# of Ads Clicked # of Subscriptions Gained
Obnoxious Ads Relevant Ads
Influence Management Dynamic Social Graph
Sharing Orgy & Noise Curation
Information Overload Filtered, Relevant Information
Traffic Economy Attention Economy
SEO and SMO Personalization

 

The above table lists different attributes of this paradigm shift. The “Influence Management” entry links to a post by Mia Dand who describes how leveraging social media is often about using a handful of influencers (read: with large follower numbers) to spread your message. Contrast that with Dynamic Social Graphs as described by Robert Scoble, where influence is dynamically determined based on relevance and not just numbers.

The Facebook Kingdom was built on Relevance

The king of the social web, Facebook, was not built on numbers, but relevance.

The success of Facebook and why it has garnered over 400 million users is because it grew on a base of real-life friends who were relevant in the users’ social circle. Other networks have failed to challenge Facebook partly because they have tried to go the other way around – from numbers to relevance.Bullseye

Prioritizing numbers over relevance is putting the cart in front of the horse.

Even as its explosive growth continues unabated, Facebook has not compromised on relevance. It knows that its success depends on users finding relevant content on Facebook and is willing to sacrifice advertising revenue to avoid becoming irrelevant.

I’ve touched upon various aspects of this ongoing theme while tracking the Google vs. Facebook race towards a relevant real-time. It’s becoming increasingly apparent that relevance wins over real-time.

While Facebook has never been in the numbers game, other networks like Digg are now moving from the numbers model to the relevance model.

Relevance vs. Real-Time in Location Check-ins

Consider the hottest trend of check-ins via location services, such as Foursquare or Gowalla.

When I check-in at a restaurant, the real-time checkins of my friends in other places is irrelevant. What is more important and relevant to me is the tips from my friends who have checked-in at the same place as I am right now.

In all cases, my friends are relevant in real-time only if they are at the same location as me. My other friends NOT at the same location become irrelevant.

Relevance wins over real-time.

The Mobile View

While mobile internet access grows, the screen of mobile devices remains constrained by its form factor. This is a major factor driving this evolution. If the content on your screen is constrained by its display, it had better be relevant.

Lifestreaming and Aggregation

As I discussed extensively in my post on why Google Buzz should not simply be yet-another-aggregator, lifestreaming and aggregation have failed to take off and gain mainstream adoption. The reason is simple – lack of relevance.

Which is why, it is personally heartening to see the champions of lifestreaming and aggregation turn their focus towards relevance and disaggregation.

Startups focusing on Relevance

Quite a few startups are hoping to capitalize on this trend:

  • my6sense – recently introduced an ‘Attention API’ allowing publishers to deliver relevant content to users
  • Cadmus – auto-filters Twitter/RSS streams by relevance
  • Knowmore – surfaces relevant stuff from Twitter/Facebook
  • TwitterTimes – personalized aggregation from Twitter
  • FeedTrace – personalized aggregation from Twitter
  • VictusMedia – ‘Intelligent Media Manager’
  • MixPanel – tracking what I’ll term “Relevance Analytics” for publishers
  • Cascaad – personalized news stream based on social graph from Twitter/Facebook

From Around the Web

Here are related posts that further elaborate on this evolution:

Switch to our mobile site