Search Engines and Page Ranking

Seminal Papers
[The anatomy of a large-scale hypertextual Web search engine
S Brin, L Page - View as HTML - Cited by 1087

Authoritative sources in a hyperlinked environment
J Kleinberg - Cited by 1059

Graph structure in the web
A Broder, R Kumar, F Maghoul, P Raghavan, S … - View as HTML - Cited by 354

Scatter/gather: A cluster-based approach to browsing large document collections
DR Cutting, DR Karger, JO Pedersen, JW Tukey - Cited by 405

Search Articles
An modern day discussion about HITS, Page Rank and Teoma search engine
A Chronicle Article discussing Google and Teoma

Other Papers
Impact of Search Engines on Page Popularity
J Cho, S Roy (WWW 04 paper)

Building Nutch: Open Source Search
M Cafarella, D Cutting


Beyond Document Similarity: Understanding Value-Based Search and Browsing Technologies
A Paepcke, H Garcia-Molina, G Rodriguez-Mula, J Cho

Implicit Structure and the Dynamics of Blogspace Eytan Adar, Li Zhang, Lada A. Adamic, Rajan M. Lukose

Ranking the Web Frontier
N Eiron, KS McCurley, JA Tomlin - Cited by 1

 A taxonomy of web search
A Broder

How a Search Engine Works

A UW Information Retrieval Systems course
Stanford Text Retrieval and Mining Course

How Page Rank Works

The Google File System
S Ghemawat, H Gobioff, ST Leung - Cited by 14

Beyond Document Similarity: Understanding Value-Based Search and Browsing Technologies
A Paepcke, H Garcia-Molina, G Rodriguez-Mula, J … - Cited by 7

The connectivity server: Fast access to linkage information on the web
K Bharat, A Broder, M Henzinger, P Kumar, S … - Cited by 78

A SearchEngineWatch Article about A9

Video

Udi Manber, Amazon, UW Lecture 2004, on Search and Amazon's search inside the book

Doug Cutting on Nutch

UW Lecture by Soumen Chakrabarti, give nice history including HITS, page rank, etc.
    mentions how HITS takes the docs with keywords and follows links one distance in and out

Google's Linux Cluster UW 2002 talk by Google's Urs Holzle

Larry Page in 2002, Google should not be an anomoly. (cant get it to play, codec problems)

Search Engine Optimization
Eric Wolfram page on increasing your pages Google rank.
Google's take on Search Engine Optimization