20071223

The Anatomy of a Large-Scale Hypertextual Web Search Engine

The Anatomy of a Large-Scale Hypertextual Web Search Engine (1998) (Make Corrections) (641 citations)
Sergey Brin, Lawrence Page
Computer Networks and ISDN Systems

Bookmark in CiteULike

Home/Search Context Related

Links: ACM DBLP


View or download:
stanford.edu/pub/papers/google.pdf
dbs.cs.unisb.de/lehr...googlewww98.ps
Cached: PS.gz PS PDF Image Update Help
Problem Downloading?
From: stanford.edu/pub/papers/ (more)
Homepages: S.Brin L.Page

Rate this article: (best)
nice job
View Comments (1)
(Enter summary)

Abstract: In this paper, we present Google, a prototype of a large-scale search engine which makes heavy use of the structure present in hypertext. Google is designed to crawl and index the Web efficiently and produce much more satisfying search results than existing systems. The prototype with a full text and hyperlink database of at least 24 million pages is available at http://infolab.stanford.edu/~backrub/google.html To engineer a search engine is a challenging task. Search engines index tens to... (Update)

Cited by: More
A Sketch-based Sampling Algorithm on Sparse Data - Ping Li Pingli (Correct)
Proceedings of 2002 International Conference on Parallel.. - Popularity-Based Ppm An (Correct)
Utilizing Human Categorisation Ability for Knowledge - Management James Sinclair (Correct)

Similar documents (at the sentence level):
13.8%: A Survey On Web Information Retrieval Technologies - Huang (2000) (Correct)

Active bibliography (related documents): More All
0.0: The PageRank Citation Ranking: Bringing Order to the Web - Page, Brin, Motwani, Winograd (1998) (Correct)
0.0: A Probabilistic Model for Optimal Searching of the Deep Web - Mukherjee (Correct)
0.0: Effective Web Crawling - Chapter 2 - Castillo (2004) (Correct)

System load high. Please wait...
Timeout. Please try your query later.
Similar documents based on text: More All
0.2: Linguistic Search Engine - Adam (Correct)
0.2: Note on Source of this Text - Great Deal Of (Correct)
0.2: Breadth-First Search Crawling Yields High-Quality Pages - Najork, Wiener (2001) (Correct)

Related documents from co-citation: More All
41: Authoritative sources in a hyperlinked environment - Kleinberg - 1997
29: Automatic resource compilation by analyzing hyperlink structure and associated t.. - Chakrabarti, Dom et al. - 1998
28: Improved algorithms for topic distillation in hyperlinked environments - Bharat, Henzinger - 1998

BibTeX entry: (Update)

Sergey Brin and Lawrence Page. The anatomy of a large-scale hypertextual Web search engine. In Ashman and Thistlewaite [2], pages 107--117. Brisbane, Australia. http://citeseer.ist.psu.edu/brin98anatomy.html More

@article{ brin98anatomy,
author = "Sergey Brin and Lawrence Page",
title = "The anatomy of a large-scale hypertextual {Web} search engine",
journal = "Computer Networks and ISDN Systems",
volume = "30",
number = "1--7",
pages = "107--117",
year = "1998",
url = "citeseer.ist.psu.edu/brin98anatomy.html" }
Citations (may not include all citations):
576 Authoritative Sources in a Hyperlinked Environment - Kleinberg - 1998 ACM DBLP
344 The PageRank Citation Ranking: Bringing Order to the Web - Page, Brin et al.
280 Managing Gigabytes: Compressing and Indexing Documents and I.. - Witten, Moffat et al. - 1994
88 The Effectiveness of GlOSS for the Text-Database Discovery P.. - Gravano, Garcia-Molina et al. - 1994
75 ParaSite: Mining Structural Information on the Web (context) - Spertus - 1997 DBLP
72 Finding What People Want: Experiences with the WebCrawler (context) - Pinkerton - 1994
65 GENVL and WWWW: Tools for Taming the Web - McBryan - 1994
57 Queries and Computation on the Web - Abiteboul, Vianu - 1997 ACM DBLP
38 The Quest for Correct Information on the Web: Hyper Search E.. - Marchiori - 1997 DBLP
21 Efficient Crawling Through URL Ordering - Cho, Garcia-Molina et al. - 1998 ACM DBLP
1 Publisher: Beacon (context) - Bagdikian, Monopoly et al.
1 Publisher: Department of Commerce (context) - the, REtrieval et al. - 1996



The graph only includes citing articles where the year of publication is known.


Documents on the same site (http://www-db.stanford.edu/pub/papers/): More
Replicated Data Management in Mobile Environments.. - Barbará-Millá.. (Correct)
Extracting Semistructured Information from the Web - Hammer, Garcia-Molina, Cho, .. (1997) (Correct)
U-PAI: A Universal Payment Application Interface, v 0.93 - Ketchpel, Garcia-Molina, .. (1996) (Correct)

Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback

CiteSeer.IST - Copyright Penn State and NEC

No comments: