Skip to main content

Posts

Showing posts from February, 2010

Key/Value Store using Tokyo Cabinet

In this post , the test results show that Tokyo Cabinet hashtable has a remarkable performance. It is pretty convenient to use it as a Key/Value store for data analysis. pytc is a python interface of Tokyo Cabinet. pytc doesn't provide a document, but we find some examples here .

Aardvark paper in WWW 2010: about social search

Aardvark, which powers vark.com, published an interesting paper entitled " Anatomy of a Large-Scale Social Search Engine " at WWW 2010. The paper talks about the social search engine applied in vark.com. The search engine is based on social graphs and topics instead of keyword. The paper addresses search engine like Google as library paradigm and Aardvark as village paradigm. The search engine of village paradigm gets answers by asking the one who are expert in the underlying topic in social graphs. In library paradigm, the search engine needs to figure out what a user what based on keywords, search history and user profile, which considers to be a very difficult task. The village paradigm leaves the difficult part to human being, so, only problem is to find the right person. The model of Aardvark considers that a user u1 asks a question q, and the search engine should find the right user u2 to provide the answer. Aardvark associates both users and questions to topics. Aa