Extracting knowledge from the World Wide Web
Henzinger M, Lawrence S. 2004. Extracting knowledge from the World Wide Web. Proceedings of the National Academy of Sciences. 101(suppl_1), 5186–5191.
Download (ext.)
https://www.ncbi.nlm.nih.gov/pmc/articles/PMC387294/
[Published Version]
Journal Article
| Published
| English
Scopus indexed
Author
Henzinger, MonikaISTA ;
Lawrence, Steve
Abstract
The World Wide Web provides a unprecedented opportunity to automatically analyze a large sample of interests and activity in the world. We discuss methods for extracting knowledge from the web by randomly sampling and analyzing hosts and pages, and by analyzing the link structure of the web and how links accumulate over time. A variety of interesting and valuable information can be extracted, such as the distribution of web pages over domains, the distribution of interest in different areas, communities related to different topics, the nature of competition in different categories of sites, and the degree of communication between different communities or countries.
Publishing Year
Date Published
2004-04-06
Journal Title
Proceedings of the National Academy of Sciences
Publisher
Proceedings of the National Academy of Sciences
Volume
101
Issue
suppl_1
Page
5186-5191
ISSN
eISSN
IST-REx-ID
Cite this
Henzinger M, Lawrence S. Extracting knowledge from the World Wide Web. Proceedings of the National Academy of Sciences. 2004;101(suppl_1):5186-5191. doi:10.1073/pnas.0307528100
Henzinger, M., & Lawrence, S. (2004). Extracting knowledge from the World Wide Web. Proceedings of the National Academy of Sciences. Proceedings of the National Academy of Sciences. https://doi.org/10.1073/pnas.0307528100
Henzinger, Monika, and Steve Lawrence. “Extracting Knowledge from the World Wide Web.” Proceedings of the National Academy of Sciences. Proceedings of the National Academy of Sciences, 2004. https://doi.org/10.1073/pnas.0307528100.
M. Henzinger and S. Lawrence, “Extracting knowledge from the World Wide Web,” Proceedings of the National Academy of Sciences, vol. 101, no. suppl_1. Proceedings of the National Academy of Sciences, pp. 5186–5191, 2004.
Henzinger M, Lawrence S. 2004. Extracting knowledge from the World Wide Web. Proceedings of the National Academy of Sciences. 101(suppl_1), 5186–5191.
Henzinger, Monika, and Steve Lawrence. “Extracting Knowledge from the World Wide Web.” Proceedings of the National Academy of Sciences, vol. 101, no. suppl_1, Proceedings of the National Academy of Sciences, 2004, pp. 5186–91, doi:10.1073/pnas.0307528100.
All files available under the following license(s):
Copyright Statement:
This Item is protected by copyright and/or related rights. [...]
Link(s) to Main File(s)
Access Level
Open Access
Export
Marked PublicationsOpen Data ISTA Research Explorer
Sources
PMID: 14745041
PubMed | Europe PMC