Funded by the European Union. Views and opinions expressed are however those of the author(s) only and do not necessarily reflect those of the European Union or the European Education and Culture Executive Agency (EACEA). Neither the European Union nor EACEA can be held responsible for them.

Tips

Ranking pages for a web search engine

Globally said, to become famous on the internet, i.e. get a high rank in a search engine, you need many high ranking websites (not dummy ones you make yourself ) referring to your site. The ranking is relative with respect to the structure of the web. To find the ranking requires solving some numerical equality to obtain a fixed point in a procedure. The basis is a square matrix Q with entrances Q_ij? 0 if there is no link from web j to i and a value N_j if there is, where N_j is the total number of outgoing links in web j. Notice that the sum of the columns of Q is 1 and the number of nonzeros in a row gives the number of sites that link to i. If the ranking is given by a vector r, we "simply" have to solve r=Qr. Given that r should sum to one, be nonnegative and Q has billions of rows, it does not feel easy to find a ranking. There is the possibility to use the power method. This method multiplies the iterative vector r_k with the matrix and normilizes it such that the sum is one. The convergence depends in fact on the second eigenvalue. The smaller that is, the faster the convergence.

Example:
A small instance example does of course not reflect what happens on the internet. However, it can easily be constructed. The book of Eldén provides a small example with 6 elements, where you can find the eigenvector of the 6x6 Q matrix with eigenvalue 1 and observewhat happens if one performs the power method. Remember, to get famous, just take care that "famous" websites refer to yours or select a very special name that is unique in the world.

Reference:
S. Brin and L. Page. The anatomy of a large-scale hypertextual web search engine. Comput. Networks ISDN Syst., 30:107–117, 1998.
L. Eldén. Numerical linear algebra in data mining. Acta Numer., 15:327–384, 2006.
L. Page, S. Brin, R. Motwani, and T. Winograd. The PageRank Citation Ranking: Bringing Order to the Web. Stanford Digital Library Working Papers, Stanford, CA, 1998.

Author of the tip:
Eligius Hendrix
University of Malaga