Skip to main content

Posts

Showing posts from 2009

The Web of Nations

I found a very intering poster in WWW 2009 with a very cool title "The Web of Nations". The previous study found that the web at page level is looked like a bow-tie with regards to SCC(Strongly connected compoent), IN(link to SCC) and OUT(linked from SCC). But are every nations just like the world, are nations like USA affecting the structure of the web of the world? The authors of this 2 pages paper crawl billions of web pages of 200 nations to see what exactly the web of every single nations look like. Some interesting results comes out that nations like India and Vietnam have higher inlinks/outlinks rate than other countries, the inlinks and outlinks in those nations are more than other nations, but those nations have the lowest number of other nations related with their inlinks/outlinks.   I refer the paper as below: Chung, Sukwon and Shiowattana, Dungjit and Dmitriev, Pavel and Chan, Su (2009) The Web of Nations. In: 18th International World Wide Web Confere

PrefixSpan source code in python

The prefixspan is a key algorithm for mining sequential patterns. I have implemented the algorithm in Python. The algorithm is based on the following paper: Jian Pei, Jiawei Han, Senior Member, Behzad Mortazavi-asl, Jianyong Wang, Helen Pinto, Qiming Chen, Umeshwar Dayal. Mining Sequential Patterns by Pattern-Growth: The PrefixSpan Approach. IEEE Transactions on Knowledge and Data Engineering, 2004. or their conference paper You may download the source code at the following addresses: Link1