Research award for microblog search

When it rains it pours.

After the exciting news that Google funded my application to their digital humanities program, I found out this week that they will also fund another project of mine (full list): Defining and Solving Key Challenges in Microblog Search.  The research will focus largely on helping people find and make sense of information that comes across Twitter.

Over the next year the project will support me and two Ph.D. students as we address (and propose some responses to) questions such as:

  • What are meaningful units of retrieval for IR over microblog data?
  • What types of information needs do people bring to microblogging environments and how can we support them?  Is there a place for ad hoc IR in this space?  If not (or even if so) what might constitute a ‘query’ in microblog IR?
  • What criteria should we pursue to help people find useful information in microblog collections?  Surely time plays a role here.  Topical relevance is a likely suspect, as are various types of reputation factors such as TunkRank (and here).
  • How does microblog IR relate to more established IR problems such as blog search, expert finding, and other entity search issues?

This work builds on earlier work that I did with Gene Golovchinsky, as well as research I presented at SIGIR this week.

For me, one of the most interesting issues at work in microblog IR is: how can we aggregate (and then retrieve) data in order to create information that is useful once collected but that might be uninteresting on its own?

Is it useful to retrieve an individual tweet that shares keywords with an ad hoc query?  Maybe.  But it seems more likely that people might seek debates, consensus, emerging sub-topics, or communities of experts with respect to a given topic.  These are just a few of the aggregates that leap to mind.  I’m sure readers can think of others.  And I’m sure readers can think of other tasks that can help move microblog IR forward.

In case anyone wonders how this project relates to the other work of mine that Google funded (which treats retrieval over historically diverse texts in Google Books data), the short answer is that both projects concern IR in situations where change over time is a critical factor, a topic similar to what I addressed in a recent JASIST paper.


4 Comments on “Research award for microblog search”

  1. Jon says:

    Congratulations, Miles. Great news, and sounds like a really interesting project.

  2. Congratulations! And great catching up with you last week in Geneva.

  3. Congrats! Quite interesting research questions.

    I’m especially interested in the argumentation aspect:
    “But it seems more likely that people might seek debates, consensus, emerging sub-topics, or communities of experts with respect to a given topic.” (I’m working on argumentation on the social web.) Quotes are one way to find the debates and communities. For instance this week, in response to the Prop8 ruling the “Gender no longer forms an essential part of marriage; marriage under law is a union of equals.” has been quoted and referenced considerably, with positive and negative sentiment:

    Since Twitter posts are relatively brief, I find it hard to separate sentiment analysis from argumentation, though of course there are debates in Twitter conversations!

    So far events (a la “Tweet the Debates”) are the easiest to come across, I think. I’ll be interested to hear more as your project gets underway!

  4. Miles Efron says:

    Thanks, Jodi. Using quotes is a great idea for finding contentious topics. I suspect that will come in very handy!

Leave a Reply

Fill in your details below or click an icon to log in: Logo

You are commenting using your account. Log Out /  Change )

Google photo

You are commenting using your Google account. Log Out /  Change )

Twitter picture

You are commenting using your Twitter account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s