Semantic Integration: The Hawkeye Approach
Phillip C.-Y. Sheu
University of California, Irvine, California, USA
Search for more papers by this authorHeather Yu
Search for more papers by this authorC. V. Ramamoorthy
Search for more papers by this authorArvind K. Joshi
Search for more papers by this authorLotfi A. Zadeh
Search for more papers by this authorSummary
This chapter approaches to semantic integration and then present the Hawkeye knowledge base, in which it loaded more than 166 million facts from a diverse set of real - world data sources. In order to support Hawkeye, the chapter extended our DLDB knowledge base system with additional reasoning capabilities. DLDB is a system that, given sufficient OWL descriptions, can answer queries that span heterogeneous data sources. This chapter use the Hawkeye knowledge base to demonstrate realistic integration queries in e - government and academic scenarios. It presents a system that uses a declarative approach based on ontologies to integrate Web sources and provides a uniform view of the Web. The chapter focus on two main objectives. First, to evaluates the new reasoning capabilities of DLDB. Second, to demonstrate the ability to answer realistic queries in the semantic Web from distributed and heterogeneous data sources.
Controlled Vocabulary Terms
deductive databases; semantic Web
REFERENCES
- O. Etzioni and D. Weld, A softbot-based interface to the internet, Commun. ACM, 37 (7): 72–76, 1994.
- S. Soderland, Learning to extract text-based information from the world wide web, in KDD, 1997, pp. 251–254.
- G. Wiederhold, Mediators in the architecture of future information systems, IEEE Computer, 25 (3): 38–49, 1992.
-
Y. Arens, C. A. Knoblock, and W.-M. Shen, Query reformulation for dynamic information integration, J. Intell. Inform. Syst. Special Issue Intell. Inform. Integrat., 6 (2/3): 99–130, 1996.
10.1007/BF00122124 Google Scholar
-
T. Berners-Lee, J. Hendler, and O. Lassila, The Semantic Web, Sci. Am., May 2001.
10.1038/scientificamerican0501-34 Google Scholar
- M. K. Smith, C. Welty, and D. L. McGuinness, OWL web ontology language guide. Recommendation, available: http://www.w3.org/TR/owl-guide/, February 2004.
- F. Baader, D. Calvanese, D. L. McGuinness, D. Nardi, and P. F. Patel-Schneider (Eds.), The Description Logic Handbook: Theory, Implementation, and Applications, Cambridge University Press, Cambridge, MA, 2003.
- N. Choi, I.-Y. Song, and H. Han, A survey on ontology mapping, SIGMOD Rec., 35 (3): 34–41, 2006.
- M.-C. Wu and A. P. Buchmann, Research issues in data warehousing, in Datenbanksysteme in Buro, Technik und Wissenschaft, 1997, pp. 61–82.
-
A. Halevy, M. Franklin, and D. Maier, Principles of dataspace systems, in PODS'06: Proceedings of the Twenty-Fifth ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems, ACM Press, New York, 2006, pp. 1–9.
10.1145/1142351.1142352 Google Scholar
- Z. Pan, A. Qasem, and J. Heflin, An investigation into the feasibility of the semantic web, in Proc. of Twenty First National Conference on Artificial Intelligence (AAAI 2006), 2006.
- Y. Papakonstantinou, H. Garcia-Molina, and J. Ullman, A query translation scheme for rapid implementation of wrappers, in Proceedings of the Conference on Deductive and Object-Oriented Databases (DOOD), Singapore, 1995.
- M. Roth and P. Schwarz, Don't scrap it, wrap it! A wrapper architecture for legacy data sources, in Proceedings of 23rd International Conference on Very Large Data Bases, 1997.
-
H. Garcia-Molina, Y. Papakonstantinou, D. Quass, A. Rajaraman, Y. Sagiv, J. D. Ullman, V. Vassalos, and J. Widom, The TSIMMIS approach to mediation: Data models and languages, J. Intell. Inform. Syst., 8 (2): 117–132, 1997.
10.1023/A:1008683107812 Google Scholar
- C. A. Knoblock, S. Minton, J. L. Ambite, N. Ashish, P. J. Modi, I. Muslea, A. G. Philpot, and S. Tejada, Modeling web sources for information integration, in Proceedings of the Fifteenth National Conference on Artificial Intelligence (AAAI - 98), AAAI/MIT Press, Menlo Park, CA, 1998, pp. 211–218.
- M. R. Genesereth, A. M. Keller, and O. Duschka, Infomaster: An information integration system, in Proceedings of 1997 ACM SIGMOD Conference, May 1997.
- N. Kushmerick, D. Weld, and R. Doorenbos, Wrapper induction for information extraction, in Proc. of Fifteenth International Joint Conference on Artificial Intelligence, Morgan Kaufmann, San Francisco, 1997, pp. 729–735.
- N. Ashish and C. Knoblock, Semi-automatic wrapper generation for internet information sources, in Proceedings of the Second IFCIS Conference on Cooperative Information Systems (CoopIS), Charleston, SC, 1997.
- A. Y. Levy, A. Rajaraman, and J. J. Ordille, Querying heterogeneous information sources using source descriptions, in 22nd International Conference on Very Large Data Bases, Bombay, September 1996.
- M. Friedman, A. Levy, and T. Millstein, Navigational plans for data integration, in Proceedings of the 16th National Conference on Artificial Intelligence, Springer - Verlag, Orlando, FL, 1999, pp. 67–73.
- C. Collet, M. Huhns, and W. Shen, Resource integration using a large knowledge base in Carnot, IEEE Computer, 24 (12), 1991.
- A. Farquhar, A. Dappert, R. Fikes, and W. Pratt, Integrating information sources using context logic, in Proceedings of AAAI Spring Symposium on Information Gathering from Distributed, Heterogeneous Environments, AAAI Press, Menlo Park, CA, pp. 47–51, 1995; Also available Technical Report KSL-95-12, KSL.
- V. Kashyap and A. Sheth, Semantic Similarities Between Objects in Multiple Databases, Morgan Kaufmann, San Francisco, 1999.
- J. Heflin and Z. Pan, A model theoretic semantics for ontology versioning, in Proc. of the 3rd International Semantic Web Conference, 2004, pp. 62–76.
- I. Horrocks and S. Tessaris, A conjunctive query language for description logic aboxes, in AAAI/IAAI, 2000, pp. 399–404.
- Z. Pan and J. Heflin, DLDB: Extending relational databases to support semantic web queries, in Proc. of the Workshop on Practical and Scaleable Semantic Web Systms, ISWC, 2003.
- V. Christophides, G. Karvounarakis, A. Magkanaraki, D. Plexousakis, V, Tannen, The ICS-FORTH Semantic Web Integration Middleware (SWIM). IEEE Data Eng. Bull. 26 (4): 11–18, 2003.
- Y. Theoharis, V. Christophides, and G. Karvounarakis, Benchmarking database representations of rdf/s stores, in Proc. of the 4th International Semantic Web Conference, 2005.
- E. Prud'hommeaux and A. Seaborne, SPARQL query language for rdf, Technical Report, available: http://www.w3.org/TR/rdf-sparql-query/, October 2006.
- G. Dong, L. Libkin, J. Su, and L. Wong, Maintaining transitive closure of graphs in SQL, Int. J. Inform. Technol., 5, 1999.
- H. F. Korth, E. Levy, and A. Silberschatz, A formal approach to recovery by compensating transactions, in Proceedings of the Sixteenth International Conference on Very Large Databases, Morgan Kaufmann, San Francisco, 1990, pp. 95–106.
-
P. Reuther and B. Walter, Survey on test collections and techniques for personal name matching, Int. J. Metadata Semantics Ontol., 1 (2): 89–99, 2006.
10.1504/IJMSO.2006.011006 Google Scholar
- K. Chang, B. He, C. Li, and Z. Zhang, Structured databases on the web: Observations and implications, SIGMOD Rec., 33 (3): 61–70, 2004.
- I. Tatarinov, Z. Ives, J. Madhavan, A. Halevy, D. Suciu, N. Dalvi, X. L. Dong, Y. Kadiyska, G. Miklau, and P. Mork, The Piazza peer data management project, in SIGMOD Rec., 2003.
- V. Christophides et al., The ics-forth semantic web integration middleware (swim), in Proceedings of the First International Workshop on Semantic Web and Databases 2003 (SWDB 2003), 2003, pp. 381–393.
- L. Ding, T. Finin, A. Joshi, Y. Peng, R. Pan, and P. Reddivari, Search on the semantic web, IEEE Computer, 10 (38): 62–69, 2005.
- P. Mika, Flink: Semantic web technology for the extraction and analysis of social networks, J. Web Semantics, 3 (2), 2005.
- M. Huhns, N. Jacobs, T. Ksiezyk, W. M. Shen, M. Singh, and P. Canata, Enterprise information modeling and model integration in carnot, in Proc. of the International Conference on Intelligent and Cooperative Information Systems, 1993, pp. 12–14.
- R. Guha, Tap: Towards the semantic web, Demo on World Wide Web 2002 Conference, available: http://tap.stanford.edu/www2002.ppt, 2002.
- J. Broekstra, M. Ehrig, P. Haase, F. Harmelen, M. Menken, P. Mika, B. Schnizler, and R. Siebes, Bibster — A semantics-based bibliographic peer-to-peer system, in Proceedings of the International Semantic Web Conference (ISWC2004), 2004.
-
M. C. Schraefel, N. R. Shadbolt, N. Gibbins, S. Harris, and H. Glaser, CS AKTive space: Representing computer science in the semantic web, in WWW'04: Proceedings of the 13th International Conference on World Wide Web, ACM Press, New York, 2004, pp. 384–392.
10.1145/988672.988724 Google Scholar
-
P. Bouquet, F. Giunchiglia, F. van Harmelen, L. Serafini, and H. Stuckenschmidt, C-OWL: Contextualizing ontologies, in Proc. of the 2003 Intl. Semantic Web Conf. (ISWC 2003), LNCS 2870, Springer, 2003, pp. 164–179.
10.1007/978-3-540-39718-2_11 Google Scholar
- U. Hustadt, B. Motik, and U. Sattler, Reducing shiq description logic to disjunctive datalog programs, in Proc. of the 9th International Conference on Knowledge Representation and Reasoning, 2004, pp. 152–162.