“闭关”阅读三
(续上)这个领域如果不阅读国外文献那就简直是瞎子。以下是关于“互操作”的英文文献(不包括metasearch和OAI)。
Id:00877487.pdf
Title:语义Web:XML和RDF的作用
Title:The Semantic Web: The Roles of XML and RDF
Creator:
STEFAN DECKER AND SERGEY MELNIK
Stanford University
FRANK VAN HARMELEN, DIETER FENSEL, AND MICHEL KLEIN
Vrije Universiteit Amsterdam
JEEN BROEKSTRA
Aidministrator Nederland B.V.
MICHAEL ERDMANN
University of Karlsruhe
IAN HORROCKS
University of Manchester
Source:IEEE INTERNET COMPUTING 1089-7801/00/$10.00 ©2000 IEEE http://computer.org/internet/ SEPTEMBER • OCTOBER 2000
Abstract:XML and RDF are the current standards for establishing semantic interoperability on the Web, but XML addresses only document structure. RDF better facilitates interoperation because it provides a data model that can be extended to address sophisticated ontology representation techniques.
Comment:介绍语义Web及其组成的经典之作,深入浅出,虽然发表于2000年,仍不失未很好的参考资料。
Tag:语义Web XML RDF Ontology
Id:00996702.pdf
Title:语义Web与WebGIS的互操作
Title:Semantic and Interoperable WebGIS
Creator:Yi Shanzhen, Zhou Lizhu, Xing Chunxiao, Liu Qilun, Zhang Yong
Date:2002
Tag:Ontology WebGIS XML RDF
Id:01006300.pdf
Title:OILing the Way to Machine Understandable Bioinformatics Resources
Creator:Robert Stevens, Carole Goble, Ian Horrocks, and Sean Bechhofer
Abstract:The complex questions and analyses posed by biologists, as well as the diverse data resources they develop, require the fusion of evidence from different, independently developed, and heterogeneous resources. The web, as an enabler for interoperability, has been an excellent mechanism for data publication and transportation. Successful exchange and integration of information, however, depends on a shared language for communication (a terminology) and a shared understanding of what the data means (an ontology). Without this kind of understanding, semantic heterogeneity remains a problem for both humans and machines. One means of dealing with heterogeneity in bioinformatics resources is through terminology founded upon an ontology. Bioinformatics resources tend to be rich in human readable and understandable annotation, with each resource using its own terminology. These resources are machine readable, but not machine understandable. Ontologies have a role in increasing this machine understanding, reducing the semantic heterogeneity between resources and thus promoting the flexible and reliable interoperation of bioinformatics resources. This paper describes a solution derived from the semantic web [a machine understandable world-wide web (WWW)], the ontology inference layer (OIL), as a solution for semantic bioinformatics resources. The nature of the heterogeneity problems are presented along with a description of how metadata from domain ontologies can be used to alleviate this problem. A companion paper in this issue gives an example of the development of a bio-ontology using OIL.
Tag:Bioinformatics Ontology
Id:01020330.pdf
Title:Web中语义内容的管理
Title:Managing Semantic Content for the Web
Creator:Amit Sheth, Clemens Bertram,David Avant, Brian Hammond, Krysztof Kochut, and Yashodhan Warke
Affiliation:Voquette and the University of Georgia
Source:IEEE INTERNET COMPUTING JULY • AUGUST 2002
Comment:一篇综述性的深度报道,与Id:00877487.pdf性质相同,深入浅出,值得仔细阅读。
Tag:语义web
Id:01046976.pdf
Title:学习和建立Web本体的整合方法
Title:Integrated Approach to Web Ontology Learning and Engineering
Creator:Michele Missikoff IASI-CNR Roberto Navigli Paola Velardi University of Rome
Abstract:The authors have built a software environment that supports the construction and assessment of a domain ontology for intelligent information integration within a virtual user community.
Source:Computer November 2002
Comment:专栏文章
Id:01174824.pdf
Title:固定收入保险业(?)的语义互操作:一个基于Web的动态信息集成知识表示结构
Title:Semantic Interoperability in the Fixed Income Securities Industry: A Knowledge Representation Architecture for Dynamic Integration of Web-Based Information
Abstract:We examine a knowledge representation architecture to support context interchange mediation. For autonomous receivers and sources sharing a common subject domain, the mediator’s reasoning engine can devise query plans integrating multiple sources and resolving semantic heterogeneity. Receiver applications obtain the data they need in the form they need it without imposing changes on sources. The KR architecture includes: 1) data models for each source and receiver, 2) subject domain ontologies, containing abstract subject matter conceptualizations that would be known to experienced practitioners in the industry, and 3) context models for each source and receiver that explain how each source or receiver data model implements the abstract concepts from a subject domain ontology. Examples drawn from the fixed income securities industry illustrate problems and solutions enabled by the proposed architecture.
Source:Proceedings of the 36th Hawaii International Conference on System Sciences
Id:01234773.pdf
Title:基于本体的普适计算
Title:Ontology-Enabled Pervasive Computing Applications
Creator:
Ryusuke Masuoka and Yannis Labrou, Fujitsu Laboratories of America
Bijan Parsia and Evren Sirin, MIND Lab, University of Maryland
Source:IEEE INTELLIGENT SYSTEMS 2003
Comments:本文属于IEEE Intelligent Systems “Smentic Web”栏目的专栏文章,这个栏目由马里兰大学的James Hendler主持,内容精深。
Id:01241177.pdf
Title:Semantic Web Complex Ontology Mapping
Title:语义Web复杂本体映射
Creator:Nuno Silva, Jiao Rocha
Source:Proceedings of IEEE WI03
Comment:此类研究非常多,这篇2003年的会议论文,好坏无法判断。
Id:01254491.pdf
Title:语义Web服务的翻译和映射的各种实现
Title:The Many Faces of Mapping and Translation for Semantic Web Services
Creator:Mark H. Burstein
Abstract:Semantic web services hold the promise of greatly increasing interoperability among software agents and web services by enabling content-based (as opposed to format-based) automated service discovery and interaction. However, as different services may well use different, only partly compatible ontologies to describe their capabilities, some amount of ontology mapping or translation will be required during the various stages of service discovery and utilization. In this paper, we reexamine some of the processing assumptions that were made in the development of semantic web service models like DAML-S in order to uncover the very different roles of semantic translation in the subprocesses of service discovery, service process model interpretation, task negotiation, service invocation and response interpretation. We present examples and arguments showing how several different styles of translation will be required based on the informational context and perspectives of agents in each of these stages
Source:Proceedings of IEEE WISE03
Comment:本篇是较好的关于SWS的总结。作者在这个领域较为资深。
Id:01333037.pdf
Title:采用陌生本体的语义Web服务的动态激活
Title:Dynamic Invocation of Semantic Web Services That Use Unfamiliar Ontologies
Creator:Mark H. Burstein
Abstract:The Semantic Web allows different ontologies to be used for describing different Semantic Web Services. Invoking an unfamiliar service can require several types of translation, each of which may be done by different agents.
Source:IEEE INTELLIGENT SYSTEMS JULY/AUGUST 2004
Comment:IEEE Intelligent Systems专栏文章
Id:01333572.pdf
Title:Semantic Interoperability of Field-based Thematic Geographic Information
Creator:Toni Navarrete etc.
Abstract:A model for semantic interoperability among a repository of geographic datasets from different providers is proposed in this article. Specifically, this approach focuses on qualified field-based geographic information. Our solution is based on an ontology describing geographic themes. We present a method to build up this ontology from the data schemas of each dataset in the repository. We have developed a tool, which is currently being evaluated by users, to support this process.
Id:01336318.pdf
Title:Approximate Reasoning and Semantic Web Services
Creator:Marek Reformat etc.
Abstract:The initiative of representing Web Services in a machine-understandable way creates a new era for software agent interoperability. A recent introduction of a concept of the Semantic Web makes it possible to automatically locate, discover, composite, and execute the services. A user agent works on hehalf of its owner, and knows user’s personal preferences. As there might be many service providers on the web, finding the best one which matches user’s needs is critical for a suitable performance of the agent. In the paper, fuzziness and approximate reasoning methodology are applied within the Semantic Web environment. The proposed approach aims at providing capability to mimic human behavior
in the case of a multi-criteria decision making process. Ontology with fuzziness is used to represent human needs and preferences. This ontology contains also information about different acceptance levels which user may have depending on responses obtained from different service providers. A prototype Semantic Web Service representing hotel reservation is built using the approach proposed.
Id:01348153.pdf
Title:语义Web的导航模型:一种基于本体的方法
Title:Navigational Modeling and the Semantic Web. An Ontology based Approach.
Creator:Victoria Torres, Joan Fons, Vicente Pelechano and Oscar Pastor
Abstract:Current Web Engineering methods develop “closed” web applications. This fact makes difficult the integration and the interoperability of different web applications. Semantic web languages provide an appropriate framework to achieve these non-functional requirements. Ontologies are proliferating to enable interoperability between Internet-connected applications. This work takes advantage from the navigational model enriching the web implementations with all the knowledge gathered during the modeling and design process. Our approach provides a semantic representation of web applications in the form of a navigational ontology that can be queried through the use of a semantic query language.
Comment:值得一读
Id:01423484.pdf
Title:基于OWL的语义互操作方法
Title:OWL-Based Approach for Semantic Interoperability
Creator:Seksun Suwanmanee, Djamal Benslimane, Philippe Thiran
Abstract:The number of web-based information systems has been increasing since Internet became the global open network accessible for all. The recent Semantic Web that provides supplementary meaningful information (meta-data) about Web resources facilitates automatic processing of machines and interoperability between different systems. In this paper, we focus on an integration of heterogeneous data sources in the Semantic Web context using a semantic mediation approach based on ontology. We use the ontology description language OWL to formalize ontologies of different resources and to describe their relations and correspondences allowing the semantic interoperability between them. We propose an architecture adopting mediator-wrapper approach for a mediator based on OWL. Some illustrations of semantic mediation using OWL are also presented in the paper.
Source:Proceedings of the 19th International Conference on Advanced Information Networking and Applications (AINA’05)
Comment:值得一读
Id:01425253.pdf
Title:Semantic Grid - Interoperability Solution for Construction VO?
Creator:Žiga Turk, Matevž Dolenc, Vlado Stankovski and Etiel Petrinja
Source:Proceedings of the International Conference on Information Technology: Coding and Computing (ITCC’05)
Tag:虚拟组织 VirtualOrganization 语义网格
Comment:a short paper
Id:01432705.pdf
Title:应用自动服务组合的局域本体的实现
Title:Experimentation with Local Consensus Ontologies with Implications for Automated Service Composition
Creator:Andrew B. Williams, Member, IEEE, Anand Padmanabhan, Student Member, IEEE, and
M. Brian Blake, Senior Member, IEEE
Abstract:Agent technologies represent a promising approach for the integration of interorganizational capabilities across distributed, networked environments. However, knowledge sharing interoperability problems can arise when agents incorporating differing ontologies try to synchronize their internal information. Moreover, in practice, agents may not have a common or global consensus ontology that will facilitate knowledge sharing and integration of functional capabilities. We propose a method to enable agents to develop a local consensus ontology during operation time as needed. By identifying similarities in the ontologies of their peer agents, a set of agents can discover new concepts/relations and integrate them into a local consensus ontology on demand. We evaluate this method, both syntactically and semantically, when forming local consensus ontologies with and without the use of a lexical database. We also report on the effects when several factors, such as the similarity measure, the relation search level depth, and the merge order, are varied. Finally, experimenting in the domain of agent-supported Web service composition, we demonstrate how our method allows us to successfully autonomously form service-oriented local consensus ontologies.
Source:IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, VOL. 17, NO. 7, JULY 2005
Id:04391.AshishNaveen.Slides.ppt
Id:04391.AshishNaveen1.Paper!.pdf
Title:NASA and The Semantic Web
Creator:Naveen Ashish
Id:04391.BursteinMark.Slides.ppt
Title:Relationship between Semantic Web Services and Semantic Interoperability
Creator:Mark H. Burstein
Format:ppt演示文件
Id:04391.ChristophidesVassilis.Paper!.pdf
Id:04391.ChristophidesVassilis.Slides.pdf
Title:Integrating XML Data Sources using RDF/S Schemas: The ICS-FORTH Semantic Web Integration Middleware (SWIM)
Id:04391.DennoPeter.Slides.ppt
Title:Standards: How far can they take us?
Creator:Peter Denno,Jerome Euzenat
Format:ppt演示文件
Comment:关于语义Web的各类标准,值得一读
Id:04391.DoerrMartin1.Slides1.ppt
Title:Mapping Typology
Creator:Doerr Martin
Format:pdf演示文件
Comment:关于映射类型的简要介绍
Id:04391.DoerrMartin.ExtAbstract!.pdf
Id:04391.DoerrMartin.Slides.ppt
Title:模式异构的本体解决一例
Title:The CIDOC CRM, an Ontological Approach to Schema Heterogeneity
Creator:Doerr Martin
Id:04391.FininTimothy1.Paper.pdf
Title:Swoogle: A Semantic Web Search and Metadata Engine
Creator:Li Ding, Tim Finin, Anupam Joshi, Yun Peng, R. Scott Cost, Joel Sachs, Rong Pan, Pavan Reddivari, Vishal Doshi
Abstract:Swoogle is a crawler-based indexing and retrieval system for the Semantic Web, i.e., for Web documents in RDF or OWL. It extracts metadata for each discovered document, and computes relations between documents. Discovered documents are also indexed by an information retrieval system which can use either character N-Gram or URIrefs as keywords to ¯nd relevant documents and to compute the similarity among a set of documents. One of the interesting properties we compute is rank, a measure of the importance of a Semantic Web document.
Comment:语义Web的搜索引擎体系架构模型,反映了许多问题和与传统搜索引擎的不同之处,值得参考
Id:04391.FininTimothy2.Slides.ppt
Title:Web Services for Semantic Interoperability and Integration
Creator:Tim Finin
Format:ppt演示文件
Date:20 September 2004
Comment:简洁扼要务实,作者就职于马里兰大学,James Hendler同事
Id:04391.GiunchigliaFausto.Slides.ppt
Id:04391.GiunchigliaFausto1.Paper!.pdf
Title:Semantic Matching
Creator:Fausto Giunchiglia
Date:September 2004
Comment:文章和演示都有,算法设计,比较技术性,思想可以借鉴参考。
Id:04391.GruningerMichael.Slides.pdf
Id:04391.GruningerMichael3.Paper!.pdf
Title:PSL: An Industrially Motivated and Rigorously Formal Approach to Semantic Integration
Creator:Michael Gruninger, Chris Menzel
Comment:语义整合的算法实现
Id:04391.KentRobert.Paper!.pdf
Id:04391.KentRobert.Slides.ppt
Title:Semantic Integration in the Information Flow Framework
Creator:Robert E. Kent
Abstract:The Information Flow Framework (IFF) [1] is a descriptive category metatheory currently under development, which is being offered as the structural aspect of the Standard Upper Ontology (SUO). The architecture of the IFF is composed of metalevels, namespaces and meta-ontologies. The main application of the IFF is institutional: the notion of institutions and their morphisms are being axiomatized in the upper metalevels of the IFF, and the lower metalevel of the IFF has axiomatized various institutions in which semantic integration has a natural expression as the colimit of theories. Some of the ideas used in this paper first appeared in papers by Joseph Goguen [2] and the author [3], and discussions on the SUO email list. See also the companion paper [4].
Comment:a must-read paper
Id:04391.MenzelChris1.Paper!.pdf
Title:Basic Semantic Integration
Creator:Christopher Menzel
Abstract:The use of highly abstract mathematical frameworks is essential for building the sort of theoretical foundation for semantic integration needed to bring it to the level of a genuine engineering discipline. At the same time, much of the work that has been done by means of these frameworks assumes a certain amount of background knowledge in mathematics that a lot of people working in ontology, even at a fairly high theoretical level, lack. The major purpose of this short paper is provide a (comparatively) simple model of semantic integration that remains within the friendlier confines of first-order languages and their usual classical semantics and logic.
Id:04391.MossakowskiTill.Paper!.pdf
Id:04391.MossakowskiTill.Slides.pdf
Title:Heterogeneous Theories and the Heterogeneous Tool Set
Creator:Till Mossakowski,
Date:22.09.2004
Comment:异构理论?是不是很吸引人?
Id:04391.SchorlemmerMarco.ExtAbstract!.pdf
Id:04391.SchorlemmerMarco.Slides.pdf
Title:论语义互操作和整合的数学基础
Title:On the Mathematical Foundations of Semantic Interoperability and Integration
Creator:Marco Schorlemmer
Comment:一篇牛文,就是写得太简单了
Id:04391.ShethAmit1.Paper!.pdf
Id:04391.ShethAmit.Slides.pdf
Title:From Semantic Search & Integration to Analytics
Creator:Amit Sheth
Abstract: Semantics is seen as the key ingredient in the next phase of the Web infrastructure as well as the next generation of enterprise content management. Ontology is the centerpiece of the most prevalent semantic technologies and provides the basis of representing, acquiring, and utilizing knowledge. With the availability of several commercial products and many research tools, specifications and increasing adoption of Semantic Web standards such as RDF for metadata and OWL for ontology representation, ontology-driven techniques and systems have already enabled a new generation of industry strength semantic applications. In particular, Semagix’s Freedom has powered applications in leading verticals such as, financial services, government & intelligence, pharmaceuticals, and media & entertainment. In this paper, we portray some of the requirements of high-end enterprise applications requiring search to integration, and more advanced analytical capabilities, discuss the enterprise scale capabilities expected of a semantic technology, and how Semagix has put an ontology-driven approach to use.
Comment:演示文件的题名不同,但内容大体一致
Id:04391.SintekMichael.Slides.pdf
Title:Using TRIPLE Views for Semantic Interoperability and Integration
Creator:Michael Sintek
Id:04391.SintekMichael1.Paper.pdf
Title:本体映射问题的归一和整合
Title:Generating and Integrating Evidence for Ontology Mappings
Creator:Ludger van Elst and Malte Kiesel
Abstract:For more than a decade, ontologies have been proposed as a means to enable sharing and reuse of knowledge. While originally relatively narrow information landscapes have been in mind (e.g., knowledge sharing between a few expert systems) the application areas proposed nowadays (e.g., organizational knowledge management or the Semantic Web) are rather broad and open. From abstract considerations about the distributed nature of knowledge as well as from observation of actual (human) ontology negotiation processes it seems clear that globally agreed-upon conceptualizations are probably not obtainable. Therefore, ontology matching and mapping procedures play an essential role on more open information landscapes.
In this paper, we present a framework that collects and integrates heuristic evidence for ontology mappings, allows a knowledge engineer to browse a space of (assessed) mapping candidates in order to select adequate candidates and then leverage them to a level of formal statements for ontology merging. A simple example session shows the intended handling of the prototype and demonstrates strengths and weaknesses of particular sources of matching evidence.
Id:04391.SintekMichael2.Paper!.pdf
Title:利用TRIPLE视图对Web资源进行语义查询
Title:Querying Semantic Web Resources Using TRIPLE Views
Creator:Zoltan Miklos1, Gustaf Neumann1, Uwe Zdun, and Michael Sintek
Abstract. Resources on the Semantic Web are described by metadata related to some formal or informal ontology. It is a common situation that a casual user does not know domain ontology in detail. This makes it difcult to formulate queries in this ontology to ¯nd the relevant resources. Users consider the resources in their speci¯c context, so the most straight-forward solution is to formulate queries in an ontology that corresponds to a user-speci¯c view. We present an approach based on multiple views, expressed in simple ontologies. This allows a user to query heterogeneous data repositories in terms of multiple, relatively simple view ontologies. We present how ontology developers can de¯ne such views on ontologies and the corresponding mapping rules. These ontologies are represented in Semantic Web ontology languages, like RDFS, DAML+OIL or OWL. We present our approach with examples from the e-learning domain us- ing the Semantic Web query and transformation language TRIPLE.
Id:04391.SoergelDagobert.ExtAbstract.ppt
Id:04391.SoergelDagobert1.Other.pdf
Title:Semantic mapping/integration tools
Creator:Dagobert Soergel
Comment:东西太简单,就几条大纲,启发思路而已。
Id:04391.StuckenschmidtHeiner1.Paper!.pdf
Title:Ontology Alignment: An annotated Bibliography
Creator:Natasha Noy1, Heiner Stuckenschmidt
Id:04391.StuckenschmidtHeiner1.Slides.ppt
Title:Ontologies: Mapping, Translation, Merging
Creator:Natasha Noy1, Heiner Stuckenschmidt
Comment:关于本体处理的文献汇总和指南,非常好的整理。
Id:04391.StummeGerd.Paper!.pdf
Id:04391.StummeGerd.Slides.pdf
Title:Ontology Merging with Formal Concept Analysis
Creator:Gerd Stumme
Id:04391.SWM.Other.pdf
Comment:Semantic Web会议日常安排
Id:04391.SWM1.Paper.doc
Comment:会议目的分类,以及与会代表的各种观点综述。
Id:04391.SWM4.Paper!.pdf
Title:Semantic Interoperability and Integration, Dagstuhl Seminar 04391, September 19-24, 2004
Executive Summary
Creator:Y. Kalfoglou, M. Schorlemmer, M. Uschold, A. Sheth, and S. Staab
Comment:极好的会议观点综述,参加者都是这个领域的大家,专注于语义互操作与整合的主题。五星级资料!!!
Id:04391.SWM5.ExtAbstract!.pdf
Title:Architectures for Semantic Integration
Creator:Michael Uschold, Michael Gruninger
Abstract:Introduction: One of the goals of this workshop was to begin to lay the foundations for a comprehensive framework for understanding and classifying different problems, approaches and techniques in the field of semantic integration and interoperability. Another way to look at this, is to create a map of the field. In this short paper, we give an example of what a region on such a map might look like. We consider the area of architectures for semantic integration. We followed the following steps:
1. Identify various approaches, e.g. by conducting a literature search
2. Identify the similarities and differences between the different approaches
3. Identify specific issues, or dimensions of variation that are the basis for characterizing the above differences. These will be used to classify the different approaches
4. Identify key questions for each dimension
Comment:must read
Id:04391.SWM6.ExtAbstract!.pdf
Title:Infrastructure for Semantic Interoperability and Integration: Breakout Discussion Summary
Creator:Mark Burstein, Mike Uschold
Id:04391.SWM7.ExtAbstract!.pdf
Title:Representation of Semantic Mappings: Results from the Breakout Session
Creator:Heiner Stuckenschmidt, Mike Uschold
Id:04391.UscholdMike1.Slides.ppt
Title:Semantic Interoperability and Integration
Creator:Mike Uschold
Affiliation:Boeing
Format:ppt演示文件
Comment:主持会议演示稿
Id:04391.VassalosVasilis1.ExtAbstract.pdf
Id:04391.VassalosVasilis1.Other!.pdf
Id:04391.VassalosVasilis2.ExtAbstract!.pdf
Title:Data and Web Services Integration: Where are the semantics?
Creator:Vasilis Vassalos
Abstract:We summarize the discussions of the breakout session on Problem Sharing, held during the Dagstuhl workshop on Semantic Integration and Interoperability. The breakout session brought together people from di®erent communities (databases, AI planning, formal logic, knowledge representation) to share exciting and challenging problems in each community in a common language
Comments:三个文档中有两个是演示,一篇加长文摘(论文),讨论一个主题。
Id:127fileto.pdf
Title:Issues on Interoperability and Integration of Heterogeneous Geographical Data
Creator:RENATO FILETO
Abstract:The interoperability of information systems has been pursued for a long time by researchers and practitioners. It involves the exchange of information among different systems, and requires agreement on formats and application domain concepts. Interoperability may also encompass commonality of user interaction and system behavior. One of the interoperability problems which has been investigated for over 20 years on different contexts is that of integration of heterogeneous data. However, even this fundamental and old problem is very hard to solve. The data integration problem may be considered from many perspectives and on increasing levels of complexity or abstraction. This paper reviews the literature about interoperability in general, and integration of heterogeneous geographical data in particular, presents several facets of the data integration problem, and some approaches to deal with them.
Comment:似曾相似的题目,地理信息的互操作似乎大有文章可做,但是与一般信息的互操作又有较大不同。是否可以借鉴?如何借鉴?还是就作为独立领域应用?
Id:baldonado97metadata.pdf
Id:baldonado97stanford.pdf
Title:Metadata for Digital libraries: Architecture and Design Rationale
Creator:Michelle Baldonado, Chen Chuan K. Chang, Luis Gravano
Comment:这是一片早期的数字图书馆元数据结构设计文章,与K/W结构一样,属于比较经典的数字图书馆技术文章,以前还看不太懂,虽然没有大用,但是对于数图技术架构的把握还是很有必要的。Cf.: Id:original-articles.pdf
Id:contentCreation.pdf
Title:Annotation of Heterogeneous Database Content for the Semantic Web
Creator:Eero Hyvonen, Mirva Salminen, and Miikka Junnila
Abstract:This paper discusses the problem of annotating semantically interlinked data that is distributed in heterogeneous databases. The proposed solution is a semi-automatic process that enables annotation of database contents with shared ontologies with little adaptation and human intervention. A technical solution to the problem based on semantic web technologies is proposed and its demonstrational implementation is discussed. The process has been applied in creating the content for the semantic portal MUSEUMFINLAND, a deployed Semantic Web application.
Id:Cruz_Position.pdf
Title:Ontology Alignment for the Semantic Integration of Heterogeneous Geospatial Data Sets
Creator:Isabel F. Cruz
Comment:a short paper
Id:cruz-ideas2004.pdf
Title:An Ontology-based Framework for XML Semantic Integration
Creator:Isabel F. Cruz, Huiyong Xiao, Feihong Hsu
Abstract:XML is becoming the standard for data interchange on the web. However, XML and its schema languages do not express semantics but rather structure, such as nesting information. Therefore, semantically equivalent documents often present different document structures. In this paper, we provide an ontology-based framework that aims to make two XML documents interoperate at the semantic level while retaining their nesting structure. In our global-as-view approach, we generate an RDF ontology for each of the participating XML documents, which preserves the nesting structure of the document. An RDF global ontology is the result of merging the individual ontologies. The global ontology unifies the query access and establishes semantic connections among the underlying individual databases. We consider two types of queries: those posed on the global ontology and those that are posed to any of the XML documents, in a P2P fashion. The former type of query is processed using query translation from an RDF query to an XML query. The latter type of query entails bidirectional query processing: the translation from an XML query to an RDF query followed by the translation from an RDF query to an XML query. To ensure the correctness of the answer to the query in the latter case, we introduce the concept of reversibility of the query translation.
Id:flexible-interoperability-in-a.pdf
Title:Flexible Interoperability in a Federated Digital Library of Theses and Dissertations
Creator:Marcos André Gonçalves, Robert K. France, Edward A. Fox, Eberhard R. Hilf†, Michael Hohlfeld, Kerstin Zimmermann†, Thomas Severiens
Abstract:Federated digital libraries are composed of autonomous, possibly heterogeneous information services distributed across the Internet. Federation provides users with a seamless, integrated view of the collected information. We are creating a federated system for the Networked Digital Library of Theses and Dissertations (NDLTD), an international consortium of universities, libraries, and other supporting institutions focused on electronic theses and dissertations (ETDs). The NDLTD allows its members minimal restrictions and maximal autonomy, so federating requires dealing flexibly with differences among ontologies, data formats, and finding aids involving several thousand ETDs in four formats and two languages. Our solution involves adapting MARIAN, an object-oriented digital library system, to serve as mediation middleware for the federated NDLTD. Components of the solution include: 1) the use of several harvesting techniques; 2) an architecture based on object-oriented ontologies of searchers and metadata; 3) diversity within the harvested data joined to a single collection view for the user; and 4) an integrated framework for addressing such issues as data quality, information compression, and flexible search. The system can handle very large dynamic collections. It can add new sites and adapt to changes in existing sites. MARIAN’s modular architecture and powerful and flexible data model work together to build an effective integrated solution within a simple uniform framework.
Comment:数字图书馆经典论文选读里面也要包括着一篇。
Id:fox97networked.pdf
Title:Networked Digital Library of Theses and Dissertations: A Framework for East-West Collaboration
Creator:Edward A. Fox
Comment:DLI的项目介绍,参加98年香港“首届亚洲数字图书馆”会议的论文。内容老了点,如果不是想了解数字图书馆历史的,可以不看。
Id:geog5563_lec5.pdf
Title:Semantics and Ontology
Format:pdf演示文件
Comment:一堂课的课件,作者佚名,2002年的内容,值得了解。
Id:gertz.pdf
Title:Achieving Semantic Interoperability Through Controlled Annotations: Position Paper
Creator:Michael Gertz
Comment:似乎是主题词表规范控制方法,只有3页内容。
Id:godby-dc2003.pdf
Title:Two Paths to Interoperable Metadata
Creator:Carol Jean Godby, Devon Smith, and Eric Childress
Abstract:This paper describes a prototype for a Web service that translates between pairs of metadata schemas. Despite a current trend toward encoding in XML and XSLT, we present arguments for a design that features a more distinct separation of syntax from semantics. The result is a system that auomates routine processes, has a well-defined place for human input, and achieves a clean separation of the document data model, the document translations, and the machinery of the application.
Comment:DCMI对于元数据互操作的想法,DC2003会议文献。DC的元数据登记注册体系就是按照这套思路做的,但是设想的功能还有许多没实现。A must read paper.
Id:ijcis-on.pdf
Title:An Ontology for Semantic Integration of Life Science Web Databases
Creator:Zina Ben Miled, Yue W. Webster, Yang Liu
Abstract:The incompatibilities among complex data formats and various schema used by biological databases that house these data are becoming a bottleneck in biological research. For example, biological data format varies from simple words (e.g., gene name), numbers (e.g., molecular weight) to sequence strings (e.g., nucleic acid sequence), to even more complex data formats such as taxonomy trees. Some information is embedded in narrative text, such as expert comments and publications. Some other information is expressed as graphs or images (e.g., pathways networks). The confederation of heterogeneous web databases has become a crucial issue in today’s biological research. In other words, interoperability has to be archieved among the biological web databases and the heterogeneity of the web databases has to be resolved. This paper presents a biological ontology, BAO, and discusses its advantages in supporting the semantic integration of biological web databases are discussed.
Comment:语义技术的又一个应用大户:生命科学领域(前一个是地理信息领域)。本文写得很综合,值得参考。
Id:integrating-bibliographical-data-from.pdf
Title:Integrating Bibliographical Data from Heterogeneous Digital Libraries ?
Creator:Eike Schallehn, Martin Endig, and Kai-Uwe Sattler
Abstract:The integration of bibliographical data today is considered one of the most important tasks in the area of digital libraries. Various available sources of bibliographical information vary widely in terms of data representation and access interfaces. To overcome this heterogeneity during the last years attempts were made to apply methods developed for information system integration, like federated databases and mediators. In this paper we describe our approach using the loosely coupled federated system FRAQL. Furthermore, we present a generic
adapter that can be used in highly distributed scenarios which uses XML and related technology for transfer and homogenization of data. As an application scenario we describe global citation linking for integrated digital libraries.
Comment:这篇大概2000年或01年的论文,写得还是相当不错的,前不久给图书馆杂志审稿,有一篇同样内容的文章,作者显然没有看过本文,水平起码倒退15年。
Id:knowledge-management-for-database.pdf
Title:Knowledge Management for Database Interoperability
Creator:Naphtali D. Rishe1, Rukshan I. Athauda,, Jun Yuan1, Shu-Ching Chen
Abstract:The availability of multiple heterogeneous, autonomous, distributed data sources containing related information has created a need for integrated access to these information systems. Heterogeneous/multi-database systems address this issue when the component data sources are database systems. Resolution of heterogeneities for integrated access requires discovering and managing certain types of knowledge/facts. A generally accepted methodology or approach for managing this knowledge and information is lacking in research and industry. In this paper, we provide a framework for managing knowledge for interoperable access to heterogeneous database systems. The
framework uses knowledge bases at the integration and component sites. Sample schemas of these knowledge bases are presented. A multi-database prototype system utilizing the techniques presented in this paper is being developed.
Comment:也是相当早的一篇研究论文,大约2000或01年,好像陈树新有参与。只有4页。
Id:lima01digital.pdf
Title:Digital Library Services Supporting Information Integration over the Web
Creator:Tarcisio Lima1,, Amit Sheth, Naveen Ashish, Mukesh Guntamadugu, Sriram Lakshminarayan, Narayanan Palsena, and Dilpreet Singh
Abstract:Our research and development activities in digital libraries raised relevant features in supporting Web information integration. Underlain by an in house multi-agent based architecture, the main achievements so far have been prototyped as services: (a) various semantic interoperability niches, by the use of inter-ontological relationships built onto iscapes (a means of specifying information requests using embedded context sensitive information); (b) integrated access to information, by automating metabase (a database of metadata) creation; (c) a framework for creating iscapes and metadata modeling; and (d) information processing, by query planning and cost modeling of Web sources. A real-world application scenario illustrates how geographical and environmental Web-based information systems can benefit from appropriating these facilities.
Comment:8 papers, well organized, should-read paper
Id:marian-flexible-interoperability-for.pdf
Title:MARIAN: Flexible Interoperability for Federated Digital Libraries
Creator:Marcos André Gonçalves, Robert K. France and Edward A. Fox
Abstract:Federated digital libraries are composed of distributed, autonomous, and often heterogeneous information services but provide users with a transparent, integrated view of collected information. In this paper we discuss a federated system for the Networked Digital Library of Theses and Dissertations (NDLTD), an international consortium of universities, libraries, and other supporting institutions focused on electronic theses and dissertations (ETDs). Federation requires dealing flexibly with differences among systems, ontologies, and data formats while respecting information sources’ autonomy. Our solution involves adapting the object-oriented digital library system MARIAN to serve as mediation middleware for the federated NDLTD collection. Components of the solution include: 1) the use and integration of several harvesting techniques; 2) an architecture based on objectoriented ontologies of search modules and metadata; 3) reconciliation of diversity within the harvested data joined to a single collection view for the user; and 4) an integrated framework for addressing such questions as data quality, flexible and efficient search, and scalability.
Comment:MARIAN是Virginia Tech(Ed. Fox)成就NDLTD的一个项目,后来OAI和Open DL(5S理论)的基础,看过多次,都没有记住。
Id:OCL_OWL.pdf
Title:OWL and OCL for Semantic Integration
Creator:Yuxiao Zhao, Uwe Assmann and Kristian Sandahl
Abstract:OCL (Object Constraint Language) is an expression language to specify constraints and to refine UML diagrams to make them understandable for a computer. It is an important language in Model-Driven Architecture
(MDA). OWL (Web Ontology Language) is an ontology language for semantic Web. So what is the relationship between OCL and OWL? Can they complement each other for semantic integration? We emphasize three points on OWL and OCL. First, OWL and OCL have more differences than similarities and some complementary roles exist. Second, for semantic integration OWL can be used for modeling public and shared knowledge as widely known whereas OCL can be used for modeling private constraints. Third, OCL can be used a basis for building Ontology Query Language (OQL).
Id:original-articles.pdf
Title:The Stanford Digital Library metadata architecture
Creator:Michelle Baldonado, Chen-Chuan K. Chang, Luis Gravano, Andreas Paepcke
Abstract:The overall goal of the Stanford Digital Library project is to provide an infrastructure that
a.ords interoperability among heterogeneous, autonomous digital library services. These services include both search services and remotely usable information processing facilities. In this paper, we survey and categorize the metadata required for a diverse set of Stanford Digital Library services that we have built. We then propose an extensible metadata architecture that meets these requirements. Our metadata architecture ®ts into our established infrastructure and promotes interoperability among existing and de-facto metadata standards. Several pieces of this architecture are implemented; others are under construction. The architecture includes attribute model proxies, attribute model translation services, metadata information facilities for search services, and local metadata repositories. In presenting and discussing the pieces of the architecture, we show how they address our motivating requirements. Together, these components provide, exchange, and describe metadata for information objects and metadata for information services. We also consider how our architecture relates to prior, relevant work on these two types of metadata.
Comment:这也是一篇早期的数字图书馆元数据方案,斯坦福大学。cf. :Id:baldonado97stanford.pdf
Id:paepcke00search.pdf
Title:Search Middleware and the Simple Digital Library Interoperability Protocol
Source:D-Lib Magazine March 2000 Volume 6 Number 3
Comment:介绍数字图书馆简单互操作协议(SDLIP)的,已经有详细的中文资料,可以不看。
Id:park_J_2005.pdf
Title:Semantic Interoperability across Digital Image Collections: a Pilot Study on Metadata Mapping
Creator:Jung-ran Park
Abstract:The goal of this project is evaluation of the current status of semantic mapping between cataloger-defined field names and Dublin Core metadata elements across digital image collections and identification of the most frequently occurring incorrect and null mappings. A pilot study has been conducted comparing and analyzing 20 digital image metadata templates and 659 metadata item records.
Comment:很实用的目标,大约发表于2004年。
Id:roscheisen97stanford.pdf
Title:The Stanford InfoBus and Its Service Layers Augmenting the Internet with Higher-Level Information Management Protocols
Creator:Martin Röscheisen, Michelle Baldonado, Kevin Chang, Luis Gravano, Steven Ketchpel, Andreas Paepcke
Abstract:The Stanford InfoBus is a prototype infrastructure developed as part of the Stanford Digital Libraries Project to extend the current Internet protocols with a suite of higherlevel information management protocols. This paper surveys the five service layers provided by the Stanford InfoBus: protocols for managing items and collections (DLIOP), metadata (SMA), search (STARTS), payment (UPAI), and rights and obligations (FIRM).
Comment:介绍InfoBus的论文,有中文版了。
Id:semantic_tech.pdf
Comment:一张项目介绍表格,关于地震信息系统的互操作架构。
Id:the-ndltd-and-issues.pdf
Title:The NDLTD and Issues of Long Term Preservation and Archiving: IT’S ABOUT TIME!
Creator:Gail McMillan
Date:May 22, 2003
Comment:作者参加ECDL2003的演讲。NDLTD应该算美国DLI项目中比较成功的,Gail算是去欧洲交流经验。
Id:wache01ontologybased.pdf
Title:Ontology-Based Integration of Information —A Survey of Existing Approaches
Creator:H.Wache, T. V¨ogele, U. Visser, H. Stuckenschmidt, G. Schuster, H. Neumann and S. Hubner
Abstract:We review the use on ontologies for the integration of heterogeneous information sources. Based on an in-depth evaluation of existing approaches to this problem we discuss how ontologies are used to support the integration task. We evaluate and compare the languages used to represent the ontologies and the use of mappings between ontologies as well as to connect ontologies with information sources. We also enquire into ontology engineering methods and tools used to develop ontologies for information integration. Based on the results of our analysis we summarize the state-of-the-art in ontology-based information integration and name areas of further research activities.
Comment:也是一篇综述,必读。
Id:WP-Tanko.pdf
Title:A Framework for Semantic Integration of eLearning Services
Creator:Tanko Ishaya
Abstract:The primary characteristic of the Semantic web is shared understanding, a fundamental challenge now facing e-Learning community. The aim of this paper is to highlight the role of ontologies for enabling the semantic interoperability of learning services and to propose a conceptual framework for integrating e-Learning services using ontologies.
Comment:3页
Id:WS105FiratA.pdf
Title:Multi-dimensional Ontology Views via Contexts in the ECOIN Semantic Interoperability Framework
Creator:Aykut Firat, Stuart Madnick, Frank Manola
Abstract:This paper describes the coupling of contexts and ontologies for semantic integration in the ECOIN semantic interoperability framework. Ontological terms in ECOIN correspond to multiple related meanings in different contexts. Each ontology includes a context model that describes how a generic ontological term can be modified according to contextual choices to acquire specialized meanings. Although the basic ECOIN concepts have been presented in the past, this paper is the first to show how ECOIN addresses the case of “single-ontology with multiple contexts” with an example of semantic integration using our new prototype implementation.
Id:ZieglerICSNW2004.pdf
Title:User-Specific Semantic Integration of Heterogeneous Data: The SIRUP Approach
Creator:Patrick Ziegler, Klaus R. Dittrich






