Required Reading
Background Reading/References
Resources
|
|
Note: A lot of material below is on
semi-structured data, which is only one component of this course as this
is not primarily a database course (and hence we will not cover small
subset of reading on this topics).
Search Engines
- Web Search
-
General:
World Wide Web Search Technologies
- Wen-Chen Hu
-
Example Search Engine:
Sergey Brin and Lawrence Page.
The
Anatomy of a Large-Scale Hypertextual Web Search Engine
-
Usage:
Introduction to Search Engines - Kansas
City Public Library
-
Crawling Technical:
High-Performance Web Crawling,
Najork, Marc ; Heydon, Allan, HP Report.(html)
-
General Technical:
"Searching the Web," Arvind Arasu, Junghoo Cho, Hector Garcia-Molina,
and Andreas Paepcke.
ACM Transactions on Internet Technology (TOIT),
1 (1), pp. 2-43, August 2001
-
Ranking:
Google: Page Rank,
Teoma: Subject-Specific Popularity
-
Example
Simple Web Crawler with
Code
- MetaSearch:
Technical
Tutorial (Weyi Meng),
- Specialized Search
-
"Web Mining : A Bird's Eye view" Tutorial at
WISE 2002.
Web Directories and
Categorization
Enterprise and B2B
Portals
Component technologies
Top
Background Reading Material
Books:
Semi-structured Data, XML, RDF & related
Technology
Readings on metadata
(content and applications)
Tutorials/Talks/Resources:
-
Search Engines
- Crawler
- Indexing
- Web Services
Tim
Finin,
An Overview and Underview
of the Semantic Web, October 2002
Other reading:
Related Courses:
Robert Meersman and Amit Sheth, SIGMOD Record Special Issue on "Semantic
Web, Database Management and Information Systems,"
December 2002.
Michael
Denny's Survey of
Ontology
Editing software
Russel Letson,
Taxonomies put Content in Context,
Transform, December 2001
Ontology in a Nutshell (cache), Fabien
Gandon's
2d Knowledge Management Summer School (INRIA).
Source
Taxonomies, thesauri, ontologies, and other systems
of knowledge organization
Terms defined
Ontology and Information Systems by Barry Smith
What is Ontology at
Ontolog.org
Magkanaraki et al
Ontology
Storage and Querying
Industry reports with substance:
Top
Where can you go to find more information:
Top
Top |