存档三月 6, 2006

资料目录

数字图书馆相关资料文件夹目录

前几天只整理了四个目录(右边),其它内容不作详细整理了,粗粗浏览一下,有价值的再放上来。

评论(10)

“闭关”阅读四

“数字图书馆语义互操作研究”相关资料,体系结构部分。

Id: 0502028.pdf
Title:aDORe: a modular, standards-based Digital Object Repository
Creator:Herbert Van de Sompel, Jeroen Bekaert, Xiaoming Liu, Luda Balakireva, Thorsten Schwander
Abstract:This paper describes the aDORe repository architecture, designed and implemented for ingesting, storing, and accessing a vast collection of Digital Objects at the Research Library of the Los Alamos National Laboratory. The aDORe architecture is highly modular and standards-based. In the architecture, the MPEG-21 Digital Item Declaration Language is used as the XML-based format to represent Digital Objects that can consist of multiple datastreams as Open Archival Information System Archival Information Packages (OAIS AIPs). Through an ingestion process, these OAIS AIPs are stored in a multitude of autonomous repositories. A Repository Index keeps track of the creation and location of all the autonomous repositories, whereas an Identifier Locator registers in which autonomous repository a given Digital Object or OAIS AIP resides. A front-end to the complete environment – the OAI-PMH Federator – is introduced for requesting OAIS Dissemination Information Packages (OAIS DIPs). These OAIS DIPs can be the stored OAIS AIPs themselves, or transformations thereof. This front-end allows OAI-PMH harvesters to recurrently and selectively collect batches of OAIS DIPs from aDORe, and hence to create multiple, parallel services using the collected objects. Another front-end – the OpenURL Resolver – is introduced for requesting OAIS Result Sets. An OAIS Result Set is a dissemination of an individual Digital Object or of its constituent datastreams. Both front-ends make use of an MPEG-21 Digital Item Processing Engine to apply services to OAIS AIPs, Digital Objects, or constituent datastreams that were specified in a dissemination request.
Tag:digital_library_architecture OAI OAIS aDORe
Comment:一个很有意思的项目,由OpenURL的发明人Sompel领导,刘晓明参与。包含了DLI的各类DL元素。除了是一个典型的DO为基础的数字图书馆之外,是一个完全模块化的、严格遵循各类标准的体系结构,模型遵从OAIS,数据以完全XML化的MPEG-21数字内容声明语言编码,支持流式访问,资源库独立,以OAI-PMH沟通,又支持OpenURL解析。五星级资料。

Id: DLF Service Framework for Digital Libraries.doc
Title:DLF Service Framework for Digital Libraries
Creator:Lorcan Dempsey and Brian Lavoie
Tag:Digital_Library Service
Comment:对于数字图书馆服务规范化的探讨。对于认识形式化的“数字图书馆”有帮助。

Id: E6.ppt
Title:Ontology Based Semantic Metadata for Digital library
Creator:YANG Hongshan
Affiliation:Donghua University
Tag:digital_library Ontology metadata
Comment:元数据和本体对于数字图书馆的应用探讨。

Id: Finding Hidden Semantics behind Reference Linkages an Ontological Approach for Scientific Digital Libraries.pdf
Title:Finding Hidden Semantics behind Reference Linkages : an Ontological Approach for Scientific Digital Libraries
Creator:Peixiang Zhao, Ming Zhang, Dongqing Yang, and Shiwei Tang
Abstract:The contents and topologies of inter-document linkages, such as citations and references among scientific literature, have received increasing research interests in recent years. Some technologies have been fully studied and utilized upon this meaningful information to improve the organization, analysis and evaluation of scientific digital libraries. In this paper, we present a CiteSeer-like system to access scientific papers in computer science discipline by reference linking technique. Moreover, implicit semantics behind reference indices are mined and organized to improve accessibility of scientific papers. In order to model scientific literature and their interlinked relationships, we develop a domain-specific ontology to analyze contents and citation anchor context of scientific papers. Compared with abstract of a specific paper written by authors themselves, we introduce an automatic summary generation algorithm to create objective descriptions from other scholars’ perspectives based on the ontology. Semantic queries can also be asked to discover interesting patterns in scientific libraries in order to provide a comprehensive and meaningful guidance for users.
Comment:北大唐世渭、杨冬青老师团队在数字图书馆方面的研究成果始终是国内一流的。

Id: grimoires.ppt
Title:The GRIMOIRES Service Registry
Creator:Weijian Fang
Affiliation:School of Electronics and Computer Science University of Southampton
Tag:registry UDDI SOA

Id: HPL-2005-189.pdf
Title:An assessment of RDF/OWL modelling
Creator:Dave Reynolds, Carol Thompson, Jishnu Mukerji1, Derek Coleman

Id: input to workshop.pdf
Title:BACKGROUND INFORMATION ON SELECTED CURRENT RESOURCE DISCOVERY SERVICES
Abstract:On 20 October 2005, the Research Information Network organised a small workshop aimed at securing a common understanding of the current state of key resource discovery services, and of priorities and plans for their further development both in the short and the longer term. The meeting was attended by representatives from the British Library, CURL, EDINA, JISC, MIMAS and UKOLN, as well as RIN. The timing of the event also provided an opportunity to feed into the Government’s e-Infrastructure strategy process.

Id:jisc-resource-discovery-landscape.pdf
Title:The JISC Resource Discovery Landscape:A personal reflection on the JISC Information Environment and related activities
Creator:Andy Powell
Date:May, 2005
Comments:把当今图书馆采用的主要资源发现技术介绍的清清楚楚。

Id: jisc-ie-soa.pdf
Title:A ‘service oriented’ view of the JISC Information Environment
Creator:Andy Powell
Date:November, 2005

Id:ie-google.pdf
Title:The JISC Information Environment and Google
Creator:Andy Powell
Date :November, 2004

Id: ScholOnto-IJoDL-2000.pdf
Title:ScholOnto: An Ontology-Based Digital Library Server for Research Documents and Discourse
Creator:SIMON BUCKINGHAM SHUM, ENRICO MOTTA, JOHN DOMINGUE
Abstract:The internet is rapidly becoming the first place for researchers to publish documents, but at present they receive little support in searching, tracking, analyzing or debating concepts in a literature from scholarly perspectives. This paper describes the design rationale and implementation of ScholOnto, an ontology-based digital library server to support scholarly interpretation and discourse. It enables researchers to describe and debate via a semantic network the contributions a document makes, and its relationship to the literature. The paper discusses the computational services that an ontology-based server supports, alternative user interfaces to support interaction with a large semantic network, usability issues associated with knowledge formalization, new work practices that could emerge, and related work.

Id: identifier\xri中文.ppt
Title:XRI简介:可扩展资源标识符(Extensible Resource Identifiers)概述
Creator:OASIS XRI技术委员会
Comment:标识符问题是体系结构里最重要的问题之一,但由于相对来说属于“功能设计”的副产品,而且实现起来依赖于体系架构,因而常常放到了从属和次要的地位。关于XRI的资料供查考,目前还不知与Weibel目前的研究课题有什么之间关系没有。

Id: identifier\2005-42.pdf
Title:Persistent Identification of Electronic Documents and the Future of Footnotes*
Creator:Susan Lyons
Abstract:Both the accuracy of scholarly footnotes and the long-term access to digital publications are threatened by link rot. Ms. Lyons discusses one possible solution: widespread use of a system for persistent identification of electronic documents.
Tag:Identifier URI URL

Id: iesr-cni.ppt
Title:Introduction to the IESR
Id: MIMAS\use.ppt
Title:Using IESR
Id:contribute.ppt
Title:Creating and Updating IESR Descriptions
Id:apps-niso-20050920.ppt
Title:Using a Registry to Disclose and Discover Resources for Metasearching
Id:contentmanager.ppt
Title:The IESR Content Manager
Creator:Ann Apps
Id: contribute-demo.ppt
Title:Data editing interface: demonstration
Abstract:
介绍了IESR系统描述的四方面内容:
资源集合Collections of resources
服务/存取方式Informational Services that provide access
主体/机构Agents: Owners / Administrators
事务处理Transactional Services
Comment:IESR系统的操作使用指南。这个系统是一个实用的资源集合层面的整合系统,具有分布式数字图书馆初级形态,其许多做法值得研究和借鉴。

Id: _DAPD_PaperMain.pdf
Title:Enhancing ebXML Registries to Make them OWL Aware
Creator:
ASUMAN DOGAC asuman@srdc.metu.edu.tr
YILDIRAY KABAK yildiray@srdc.metu.edu.tr
GOKCE B. LALECI banu@srdc.metu.edu.tr
CARL MATTOCKS carlmattocks@checkmi.com
JEFF POLLOCK jeff.pollock@networkinference.com
Abstract:In this paper, we address how ebXML registry semantics support can be further enhanced to make it OWL aware. There are basically three ways of achieving this: The first one is mapping OWL constructs to ebXML registry information model constructs without modifying the registry architecture and implementation. In this way, the semantic explicitly stored in the registry can be retrieved through querying; yet, the application program must contain additional code to process this semantics. The second approach is additionally providing predefined stored procedures in the registry for processing the OWL constructs. We believe that this approach is quite powerful to associate semantics with registry objects: it becomes possible to retrieve knowledge through queries and the enhancements to the registry are generic. The capabilities provided move the semantics support beyond what is currently available in ebXML registries and it does so by using a standard ontology language. The third approach is changing the ebXML registry to support OWL with full reasoning capabilities. However, this approach requires considerable changes in the registry architecture.
Comment:ebXML的应用环境应该说比数字图书馆复杂得多,本文介绍的架构和方法制的借鉴。

Id: ITM-TR2004-en-V1.pdf
Title:ITM Metadata Repository
Comment:这实际上是一家公司的宣传广告。但是从中可以看到一些架构方面的构想。当然作为企业的广告总有些虚虚实实、夸大其词。

Id: McGuinness_IJCAIws_2003.pdf
Title:Registry-Based Support for Information Integration
Creator:Deborah L. McGuinness and Paulo Pinheiro da Silva
Affiliation:Knowledge Systems Laboratory, Stanford University
Abstract:In order for agents and humans to leverage the growing wealth of heterogeneous information and services on the web, increasingly, they need to understand the information that is delivered to them. In the simplest case, an agent or human is retrieving “look-up” information and would benefit from having access to provenance information concerning recency, source authoritativeness, etc. In more complicated situations where information is manipulated before it is returned as an answer, agents and
humans would benefit from understanding the derivations and assumptions used. When services are involved, users and agents also would benefit from understanding what actions could be or were executed on the user’s behalf. In this paper, we introduce a strategy for registering information sources and question answering systems providing support for implementing distributed and cooperative web services. In this paper, we describe the inference web infrastructure that supports explanations in
distributed environments such as the web and describe the elements of its registry.

Id: MWSDI-ICWS04-final.doc
Id: MWSDI-ICWS04-final.pdf
Title:Discovery of Web Services in a Federated Registry Environment
Creator:Kaarthik Sivashanmugam, Kunal Verma, Amit Sheth
Abstract:The potential of a large scale growth of private and semi-private registries is creating the need for an infrastructure which can support discovery and publication over a group of autonomous registries. Recent versions of UDDI have made changes to accommodate interactions between distributed registries. In this paper, we discuss METEOR-S Web Service Discovery Infrastructure, which provides an ontology based infrastructure to provide access to registries divided based on business domains and grouped into federations. We also discuss how Web service discovery is carried out within a federation.

Comment:本文的应用环境是p2p的,不太适合我的主题。

Id: model&process\ 04-EMISA.pdf
Title:A Comparison of XML Interchange Formats for Business Process Modelling
Creator:Jan Mendling1, Gustaf Neumann1, and Markus Nuttgens
Abstract:This paper addresses heterogeneity of business process metamodels and related interchange formats. The different approaches towards interchange format design and effects of interchange format specification are presented first. In particular completeness is identified as an important design criterion for interchange formats. Afterwards the superset of metamodel concepts is extracted from 15 currently available XML-based specifications for business process modelling. Furthermore, these conceptsare used as a framework for comparing the completeness of 15 specifications.
Comment:其中值得一读的是对各类用于信息建模、交换和流程编码的XML语言的分析比较。

Id: model&process\ 1568938370.pdf
Title:Ontology-based Service Discovery in P2P Networks
Creator:Daniel Elenius, Magnus Ingmarsson
Abstract:The ubiquitous computing vision is to make knowledge and services easily available in our everyday environments. A wide range of devices, applications and services can be interconnected to provide intelligent and automatic systems that make our lives more enjoyable and our workplaces more efficient. Interaction typically is to be between peers rather than clients and servers. In this context, the JXTA peer-to-peer infrastructure, designed for interoperability, platform independence and ubiquity, is a suitable foundation to build future computer systems on. Peers need ways to effortlessly discover, consume and provide services, and to take advantage of new services as they become available in a dynamically changing network. However, JXTA does not currently handle this servicediscovery problem. In this paper, we examine several service-discovery architectures, to see whether they can be adapted to JXTA. We conclude that none of them adequately support the flexibility and expressiveness that ubiquitous computing requires. We therefore argue that Web Ontology Language (OWL) and OWL Services (OWL-S) ontologies should be used to express detailed semantic information about services, devices and other service-discovery concepts. This kind of approach allows peers to reason about service offerings and achieve intelligent service discovery by using an inference engine. We present an experimental implementation of this ontological approach to service-discovery, called Oden (Ontology-based Discovery-Enabled Network).
Comment:讲P2P服务发现的

Id: model&process\ AH-Book-2003-PreFinal.pdf
Title:SCARCE: an Adaptive Hypermedia Environment Based on Virtual Documents and Semantic Web
Creator:Serge Garlatti, Sébastien Iksal, Philippe Tanguy
Abstract:Flexible hypermedia or more precisely virtual documents can lead to methods facilitating web service design and maintenance. Indeed, they have the ability to compose on the fly real documents on user demand. We have designed a flexible environment for adaptive virtual documents based on a semantic web approach. In this environment, selection, filtering and organization are managed and specified at semantic level. These three declarative specifications which are composition engine parameters, are called a generic document. The filtering specification is defined by semantic properties associated with the generic document. These properties have the following roles: define how to evaluate the selected resources for classifying them in different equivalence classes according to a user model, determine how to choose an adaptive navigation technique according to the current user.
Comment:实际上是讲微内容的虚拟组合的。与数字图书馆微观结构有关,但与web2.0更相关

Id: model&process\ Burcea_etal.pdf
Title:I know what you mean: semantic issues in Internet-scale publish/subscribe systems?
Creator:Ioana Burcea, Milenko Petrovic, and Hans-Arno Jacobsen
Abstract: In recent years, the amount of information on the Internet has increased exponentially developing great interest in selective information dissemination systems. The publish/subscribe paradigm is particularly suited for designing systems for routing information and requests according to their content throughout wide-area network of brokers. Current publish/subscribe systems use limited syntax content-based routing. Since publishers and subscribers are anonymous and decoupled in time, space and location, often over wide-area network boundaries, they do not necessarily speak the same language of use the same data and language format. Consequently, adding semantics to current publish/subscribe systems is important. In this paper we identify and examine the issues in developing semantic-aware content-based routing for publish/subscribe broker networks.
Comment:题目不错,内容基于一种传统的Publish/Subscribe架构,似乎2.0正在颠覆这种架构。只是如何添加/标注语义信息(语义的计算)值得参考一下。

Id: model&process\ Campos.ppt
Title:An Architecture for Managing Distributed Scientific Resources
Creator:Maria Cláudia Cavalcanti
Date:24th – 26th July 2002
Format:ppt演示文件
Comment:没有完全看懂。很好的问题,很好的想法,很好的架构,但是实现得好像还是比较封闭。概念体系不同?

Id: model&process\ ContextIntegration.pdf
Title:Context-based Portlet Integration
Creator:Torsten Priebe
Date:26th August 2004
Abstract:Portals have become the de facto standard for Web application delivery. In particular, enterprise portals make an important contribution to enabling enterprise knowledge management by providing users with a consolidated, personalized user interface that allows ecient access to various types of (structured and unstructured) information. Today’s portal systems allow combining so-called portlets with access to dierent information sources and applications side by side on a single portal webpage. However, there is only little interaction between those portlets. If inter-portlet communication capabilities are oered, they require extensive individual programming. This paper presents a generic approach for communicating the user context (revealing the user’s information need) among portlets, utilizing Semantic Web technologies. For example, the context of an OLAP portlet, which provides access to structured data stored in a data warehouse, can be used by a search portlet in order to automatically provide the user with related intranet articles or documents found in the organization’s document management system. In order to describe the approach independently of a specic portal server product, we base our proposal on extensions to the Java Portlet Specication and WSRP
Comment:portlet倒是可以看成2.0技术的组成部分。本文主要讲portlet的集成。采用了Web服务。介绍了SAP和IBM等大公司的做法。

Id: model&process\ CSCS14_knowledge.pdf
Title:KNOWLEDGE-BASED E-LEARNING ON THE SEMANTIC WEB
Creator:Stefan Trausan-Matu, Valentin Cristea, Octavian Udrea
Abstract: The SINTEC environment for learning through the web is presented. Intelligent, knowledge-based tutoring facilities are provided, together to more classical learning management and colaborative learning facilities. The usage of ontologies and metadata, that use standard annotations languages based on XML and RDF enables the extension of the facilities of SINTEC for the future semantic web.
Comment:写得比较简单。同样:关注实现架构,主要是其中“语义”的实现。

Id: model&process\ D22ORDIv1.0.pdf
Title:An Ontology Representation and Data Integration (ORDI) Framework
Creator:Atanas Kiryakov, Damyan Ognyanov, Vesselin Kirov
Date:July 19th, 2004
Comment:一份技术报告,值得一读。

Id: model&process\ DavidFrankelSoftwareIndustrialization.pdf
Title:Software Industrialization A Perspective on MDA
Creator:David Frankel Consulting
Format:pdf演示文件
Comment:MDA的一个介绍

Id: model&process\ doan02learning.pdf
Title:Learning to Map between Ontologies on the Semantic Web
Creator:AnHai Doan, Jayant Madhavan, Pedro Domingos, and Alon Halevy
Affiliation:University of Washington, Seattle, WA, USA
ABSTRACT:Ontologies play a prominent role on the Semantic Web.
They make possible the widespread publication of machine
understandable data, opening myriad opportunities for automated information processing. However, because of the SemanticWeb’s distributed nature, data on it will inevitably come from many di erent ontologies. Information processing across ontologies is not possible without knowing the semantic mappings between their elements….
Comment:介绍本题映射的,似乎应该归入互操作目录。

Id: model&process\ ehrig03ontologyfocused.pdf
Title:Ontology­Focused Crawling of Web Documents
Creator:Marc Ehrig, Alexander Maedche
Abstract:The Web, the largest unstructured database of the world, has greatly improved access to documents. However, documents on the Web are largely disorganized. Due to the distributed nature of theWorld WideWeb it is difficult to use it as a tool for information and knowledge management. Therefore, users doing the difficult task of exploring the Web have to be supported by intelligent means. This paper proposes an approach for document discovery building on a comprehensive framework for ontology-focused crawling of Web documents. Our framework includes means for using a complex ontology and associated instance elements. It de_nes several relevance computation strategies and provides an empirical evaluation which has shown promising results.
Comment:本体如何用于抓取网络信息,抓来之后如何索引、使用?another swoogle?

Id: model&process\ essay2003-haoding.pdf
Title:Challenges in Building Semantic Interoperable Digital Library System
Creator:Hao Ding
Abstract:After a decade of research and development, digital libraries are becoming operational systems and services. This paper briefly summarizes some of the challenges in building such library services. In building the Semantic Web enabled digital library sytem, the interoperability and scalability will be definitely the most important problem to be solved. To make clear the present status, I concisely make a survey on the functions of the interoperability-related protocols and standards in digital library field, such as z39.50, OAI, etc. In this paper, I divided the interoperability into syntactic level and semantic level. The semantic interoperability is the most complex set of challenges in the arena of building Semantic Web enabled digital library system. Finally, I describe the key challenges we are now faced and also present my initial research approaches.
Comment:不错的选题,结论简单了点,但是值得参考。

Id: model&process\ esws2004.pdf
Title:Learning to Harvest Information for the Semantic Web
Creator:Fabio Ciravegna, Sam Chapman, Alexiei Dingli, and Yorick Wilks
Abstract:In this paper we describe a methodology for harvesting information from large distributed repositories (e.g. large Web sites) with minimum user intervention. The methodology is based on a combination of information extraction, information integration and machine learning techniques. Learning is seeded by extracting information from structured sources (e.g. databases and digital libraries) or a user-defined lexicon. Retrieved information is then used to partially annotate documents. Annotated documents are used to bootstrap learning for simple Information Extraction (IE) methodologies, which in turn will produce more annotation to annotate more documents that will be used to train more complex IE engines and so on. In this paper we describe the methodology and its implementation in the Armadillo system, compare it with the current state of the art, and describe the details of an implemented application. Finally we draw some conclusions and highlight some challenges and future work.

Id: model&process\ HCISWWA-2003.pdf
Title:Semantic Web Services for Smart Devices in a “Global Understanding Environment”
Creator:Vagan Terziyan
Abstract:Various Web resources and services are usually assumed to be used and accessed by human users (current Web) or by software agents on behalf of human users (emerging Semantic Web). However industry emerges also a new group of “users”, which are smart industrial devices, robots or any other objects, which can be adapted to the (Semantic) Web environment. They would need special services for e.g. online condition monitoring, information provisioning, remote diagnostics, maintenance support, etc. The goal of this paper is to specify main requirements to Web services that automatically follow up and predict the performance and maintenance needs of field devices. Semantic Web enabled services form a Service Network based on internal and external service platforms and OntoShell software. Concepts of a “Global Understanding Environment” and a “mobile service component” suppose that any component can be adapted to Semantic Web environment and executed at any platform from the Service Network, including service requestor side. This allows delivering not only a service results but also a service itself. Mobile service component within an OntoShell (agent) can move to a field device’s local environment (embedded agent platform) and perform its activities locally. Service components improve their performance through online learning and communication with other components. Heterogeneous service components’ discovery is based on semantic P2P search.
Comment:虽然这个论题必然涉及到“语义”,但是本文没有专门谈语义。

Id: model&process\ hess-iswc04.pdf
Title:ASSAM: A Tool for Semi-automatically Annotating Semantic Web Services
Creator:Andreas Heß, Eddie Johnston, and Nicholas Kushmerick
Abstract:The semantic Web Services vision requires that each service be annotated with semantic metadata. Manually creating such metadata is tedious and error-prone, and many software engineers, accustomed to tools that automatically generate WSDL, might not want to invest the additional effort. We therefore propose ASSAM, a tool that assists a user in creating semantic metadata for Web Services. ASSAM is intended for service consumers who want to integrate a number of services and therefore must annotate them according to some shared ontology. ASSAM is also relevant for service producers who have deployed a Web Service and want to make it compatible with an existing ontology. ASSAM’s capabilities to automatically create semantic metadata are supported by two machine learning algorithms. First, we have developed an iterative relational classification algorithm for semantically classifying Web Services, their operations, and input and output messages. Second, to aggregate the data returned by multiple semantically relatedWeb Services, we have developed a schema mapping algorithm that is based on an ensemble of string distance metrics.
Comment:比较现实的做法:半自动标注语义信息。

Id: model&process\ HT04WE_Garofalakis.pdf
Title:Web Service Discovery Mechanisms: Looking for a Needle in a Haystack?
Creator:John Garofalakis, Yannis Panagis, Evangelos Sakkopoulos, and Athanasios Tsakalidis
Abstract:The introduction of software development via Web Services has been the most significant web engineering paradigm, in the last years. The widely acknowledged importance of the Web Services’ concept lies in the fact that they provide a platform independent answer to the software component development question. Equally important are the mechanisms that allow for Web Service discovery, especially as the latter has turn to an arduous task. This paper reviews the latest methods, architectures, models and concerns that have arisen in the Web Service Discovery area.
Comment:目前在SWS方面还没有找到比较好的综述文章,能够说明语义服务自动发现的研究现状。

Id: model&process\ i_040913ErichNeuhold.doc
Title:The Role of Context for Information Mediation in Digital Libraries
Creator:Erich Neuhold, Claudia Niederée, Avaré Stewart, Ingo Frommholz, and Bhaskar Mehta
Abstract:Mediating between available information objects and individual information needs is a central issue within the functionality of a digital library. In the simplest case this is an information request answered by a search engine based on an analysis of information objects within the digital library’s information collection. However, neither the information access activity nor the information objects within the collection are isolated entities. They are both equipped with a multifaceted context. The invited talk, which is summarized by this paper, analyzes this context and discusses complementing approaches to make such context explicit and to use it for refining the mediation process within digital libraries.
Comment:看起来是一篇好文章。

Id: model&process\ I05-2 376.pdf
Title:Interoperability through integrating Semantic Web Technology, Web Services, and Workflow Modeling
Creator:John Krogstie, Csaba Veres, Guttorm Sindre
Abstract:A number of technologies are mentioned under the rubric of “The Semantic Web”, but good overviews of these technologies with an eye toward practical applications are scarce. Moreover, much of the early focus in this field has been on the development of representation languages for static conceptual information, while there has been less emphasis on how to make semantic web applications practically useful in the context of knowledge work. To achieve this, a better coupling is needed between ontology, service descriptions and workflow modeling. This paper reviews all the basic technologies involved in this, and outlines what can be achieved by merging them in the context of real world workflow descriptions.
Comment:很好的一篇综述性文章,五星级!

Id: model&process\ ICWS2004_final.pdf
Title:Dynamic Data Integration using Web Services
Creator:Fujun Zhub, Mark Turnera, Ioannis Kotsiopoulosc, Keith Bennettb, Michelle Russelld, David Budgena, Pearl Breretona, John Keanec, Paul Layzellc and Michael Rigbyd
Abstract:We address the problem of large scale data integration, where the data sources are unknown at design time, are from autonomous organisations, and may evolve. Experiments are described involving a demonstrator system in the field of health services data integration within the UK. Current web services technology has been used extensively and largely successfully in these distributed prototype systems. The work shows that web services provide a good infrastructure layer, but integration demands a higher level “broker” architectural layer; the paper identifies eight specific requirements for such an architecture that have emerged from the experiments, derived from an analysis of shortcomings which are collectively due to the static nature of the initial prototype. The way in which these are being met in the current version in order to achieve a more dynamic integration is described.
Key words: Data integration, Service-Based Software, Web services, Service-Oriented Architecture, Access Control, Change Management, Semantic
Comment:来不及看,好文章!

Id: model&process\ iksal_full.pdf
Title:Adaptive Web Information Systems: Architecture and Methodology for Reusing Content
Creator:Sébastien Iksal, Serge Garlatti
Abstract:Nowadays, adaptive web information systems use partially the Web to provide different kinds of content, navigation tools and layouts according to user needs. We focus on AWIS for which users share a common knowledge to work together. For us, AWIS design is an intensive knowledge driven process. We propose the methodology and architecture used in the flexible composition engine called SCARCE. The paper presents the key issues for reusing the content in the methodology: interoperability and W3C standards, consistency of the delivered document and the distinct specification and management of AWIS components. The main benefits of this approach are: i) a generic AWIS architecture which is reusable in different contexts, ii) this architecture is tuned to the explicit knowledge of communities and provide a method for AWIS design. Keywords. Adaptive Web Information Systems, Composition Engine, Semantic Web, Metadata, User Model.

Id: model&process\ isws2004_WSPDS.pdf
Title:WSPDS: Web Services Peer-to-peer Discovery Servicey
Creator:Farnoush Banaei-Kashani, Ching-Chien Chen, and Cyrus Shahabi
Abstract:The Web Services infrastructure is a distributed computing environment for service-sharing. In this environment, resource discovery is required as a primitive functionality for users to be able to locate the services, the shared resources. A discovery service with centralized architecture, such as UDDI, restricts the scalability of this environment as it grows to the scales comparable with the size of the web itself. In addition, current extensively used web service standards (e.g. UDDI, WSDL), do not support discovery at a semantic level. In this paper, we introduce WSPDS (Web Services Peer-to-peer Discovery Service), a fully decentralized and interoperable discovery service with semantic-level matching capability.(吸引人!) We believe the peer-to-peer architecture of the semantic-enabled WSPDS not only satisfies the design requirements for effcient and accurate discovery in distributed environments, but also is compatible with the nature of the Web Services environment as a self-organized federations of peer service-providers without any particular sponsor.
Keywords:Web Services discovery, Peer-to-peer discovery, Ontology, Semantic matching

Id: model&process\ jasis-paper.pdf
Title:Collection Metadata Solutions for Digital Library Applications
Creator:Linda L. Hill and Greg Janee,Ron Dolin,James Frew,Mary Larsgaard
Source:JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE. 50(13):1169–1181, 1999
Abstract:Within a digital library, collections may range from an adhoc set of objects that serve a temporary purpose to established library collections intended to persist through time. The objects in these collections vary widely, from library and data center holdings to pointers to real-world objects, such as geographic places, and the various metadata schemas that describe them. The key to integrated use of such a variety of collections in a digital library is collection metadata that represents the inherent and contextual characteristics of a collection. The Alexandria Digital Library (ADL) Project has designed and implemented collection metadata for several purposes: in XML form, the collection metadata “registers” the collection with the user interface client; in HTML form, it is used for user documentation; eventually, it will be used to describe the collection to network search agents; and it is used for internal collection management, including mapping the object metadata attributes to the common search parameters of the system.
Comment:一篇早期的关于数字图书馆资源集合描述的文章,从内容上看可能已经没有什么新奇了,但是有助于说明数字图书馆要搞高层(资源集合层面)互操作的原因,以及难点、思路。

Id: model&process\ Kerschberg_Knowledge_Sifter.pdf
Id: model&process\ Kerschberg_NASA_IST.ppt
Title:Knowledge Sifter: Ontology-Driven Search over Heterogeneous Databases
Creator:L. Kerschberg, M. Chowdhury, A. Damiano, H. Jeong, S. Mitchell, J. Si, and S. Smith
Abstract:Knowledge Sifter is a scaleable agent-based system that supports access to heterogeneous information sources such as the Web, open-source repositories, XML databases and the emerging Semantic Web. User query specification is supported by a user agent that accesses multiple ontologies using an integrated conceptual model expressed in the Web Ontology Language (OWL). A collection of cooperating agents supports interactive query specification and refinement, query decomposition, query processing, as well as result ranking and presentation. The Knowledge Sifter architecture is general and modular so that ontologies and information sources can be easily incorporated. A proof-of-concept implementation shows how Knowledge Sifter can search geo-spatial ontology services such as the USGS Geographic Names Information System (GNIS) and Princeton University’s WordNet as well as image databases including Lycos and TerraServer. Each Agent is implemented as a Web Service and the external sources are also accessed via Web Service Technology.
Comment:体系架构与方案考虑的非常完善,但毕竟还只是实验室的东西,写论文不错。

Id: model&process\ khoussainov03optimising.pdf
Title:Optimising Performance of Competing Search Engines in HeterogeneousWeb Environments
Creator:Rinat Khoussainov Nicholas Kushmerick
Abstract:Distributed heterogeneous search environments are an emerging phenomenon in Web search, in which topic-specific search engines provide search services, and metasearchers distribute user’s queries to only the most suitable search engines. Previous research has explored the performance of such environments from the user’s perspective (e.g., improved quality of search results). We focus instead on performance from the search service provider’s point of view (e.g, income from queries processed vs. resources used to answer them).
We analyse a scenario in which individual search engines compete for queries by indexing documents for which they think users are likely to query. We show that naive strategies (e.g, blindly indexing lots of popular documents) are ineffective, because a rational search engine’s indexing decisions should depend on the (unknown) decisions of its opponents.
We propose the COUGAR algorithm that specialized search engines can use to decide which documents to index on each particular topic. COUGAR is based on a game-theoretic analysis of heterogeneous search environments, and uses reinforcement learning techniques to exploit the sub-optimal behaviour of its competitors. Our evaluation of COUGAR against a variety of opponents based on queries submitted to 47 existing search engines demonstrates the feasibility of our approach.
Comment:比较微观的问题,包括算法。了解而已。

Id: model&process\ lewis-osullivan-wp-v2.pdf
Title:Semantically Driven Service Interoperability for Pervasive Computing
Creator:Declan O’Sullivan, Dave Lewis
Date:28th May 2003
Abstract:The common vision of pervasive computing environments requires a very large range of devices and software components to interoperate seamlessly. From the assumption that these devices and associated software permeate the fabric of everyday life, a massive increase looms in the number of software developers deploying functionality into pervasive computing environments. This poses a very large interoperability problem for which solutions reliant solely on interoperability standards will not scale. An interoperability problem of a similar scale is presented by the desire for a Semantic Web supporting autonomous machine communication over the WWW. Here, solutions based on service-oriented architectures and ontologies are being actively researched, and we examine how such an approach could be used to address pervasive computing’s interoperability problem. The paper outlines the potential role that semantic techniques offer in solving some key challenges, including candidate service discovery, intelligent matching, service adaptation and service composition. In particular the paper addresses the resulting requirement of semantic interoperability outlining initial results in dynamic gateway generation. In addition the paper proposes a roadmap identifying the different scenarios in which semantic techniques will contribute to the engineering and operation of pervasive computing systems.
Keywords:pervasive computing, service composition, semantic interoperability, topic maps, DAMLS.

Id: model&process\ marseilles271103.pdf
Title:SEWASIE: a semantic search engine
Creator:Sonia Bergamaschi
Format:pdf演示文件
Date:27 November, 2003
Comment:语义Web搜索引擎是语义搜索时代的一种解决方案。

Id: model&process\ p342-janee.pdf
Title:The ADEPT Digital Library Architecture
Creator:Greg Janee, James Frew
Comment:DLI1的著名项目,但是属于老资料了。

Id:model&process\ Paper_30.pdf
Title:A Metadata Model Based on the Concept of Structured Digital Object(SDO) and Its Application in Digital Libraries -From Concept to Prototype System
Creator:Ying LI, Hidehiro ISHIZUKA
Abstract:Metadata is data about data. It is considered an ideal and a very useful solution in describing/managing resources on the Internet. Especially, in the digital library field, metadata plays an important role for integrating digital resources and offering information service. However, from users standpoint, information service offered nowadays by any digital library is unable to satisfy their diversified needs, such as, knowledge information, individualization information, reusable and sharable information. A metadata model based on a new concept is required. As an extension of our previous research issue, we proposed the concept of Structured Digital Object (SDO is an abbreviation). SDOs are used for reorganizing/ restructuring existing digital resources, because SDO set is structured data about data, this paper call it metadata. Using the metadata model based-concept of SDO, restructure various existing resources in existing digital libraries, form the so-called “Global Digital Library “. The Global Digital Library can adopt Web Services for information services. It can solve not only interoperability problems among heterogeneous resources, heterogeneous systems and operating systems, but also can meet individual user need to different granularities information. We also used Topic Maps to associate SDOs with information resources that can be located in existing digital libraries or the global digital library. Furthermore, because all the metadata is described by XML, achieve reusing, sharing information. In the paper, we give some demonstrations to approve the points described above.
Keywords: Structured Digital Object(SDO), Metadata Model, Global Digital Library, Web
Services, Topic Maps.

Id: model&process\ paper1.pdf
Title:A Semantic Web Approach for Adaptive Hypermedia
Creator:Serge Garlatti and Sébastien Iksal
Abstract:Adaptation/personalization is one of the main issues for web services. Adaptive web applications have the ability to deal with different users’ needs for enhancing usability and comprehension and for dealing with large repositories. Indeed, adaptive web services – also often called Adaptive Hypermedia Systems - can provide different kinds of information, different layouts and different navigation tools according to users’ needs. We propose an open-ended adaptive hypermedia environment which is based on the virtual document and semantic web approaches and which is able to manage adaptive techniques at knowledge level. The aim is to simplify the creation and the management of adaptive web services by using ontologies and semantic properties for adaptation. Indeed, they are declarative parameters for computing on the fly services. Indeed, the specification of the adaptive mechanisms is defined by semantic properties associated to a hypermedia document by an author. These properties have the following roles: define how to evaluate the links/content for grouping them together in different classes according to a user model, determine how to manage these classes for each adaptive technique and assign user stereotypes to adaptive techniques. Then, an author can determine the relevant adaptive techniques for a given user group

Id: model&process\ paper8.pdf
Title:Exposing Cross-Domain Resources for Researchers and Learners
Creator:Ann Apps, Ross MacIntyre, Leigh Morris
Abstract:MIMAS is a national UK data centre which provides networked access to resources to support learning and research across a wide range of disciplines. There was no consistent way of discovering information within this cross-domain, heterogeneous collection of resources, some of which are access restricted. Further these resources must provide the interoperable interfaces required within the UK higher and further education ‘information environment’. To address both of these problems, consistent, high quality metadata records for the MIMAS services and collections have been created, based on Dublin Core, XML and standard classification schemes. The XML metadata repository, or ‘metadatabase’, provides World Wide Web, Z39.50 and Open Archives Initiative interfaces. In addition, a collection level database has been created with records based on the RSLP Collection Level Description schema. The MIMAS Metadatabase, which is freely available, provides a single point of access into the disparate, cross-domain MIMAS datasets and services. Keywords. Metadata, Dublin Core, cross-domain, collection level description, subject classification.

Id: model&process\ SAM_2004_two_pager_Myers.pdf
Title:Scientific Annotation Middleware
Creator:Lead Investigator: Jim Myers,
Abstract:The Scientific Annotation Middleware (SAM) system (http://www.scidac.org/SAM) provides significant advances in research documentation and data provenance tracking to support the effective management and coordination of complex, collaborative, cross-disciplinary, compute-intensive research, such as that enabled through the Scientific Discovery through Advanced Computing (SciDAC) initiative. The SAM system presents researchers, applications, LIMS, electronic notebooks, and software agents with a layered set of components and services that provide successively more specialized capabilities for the creation and management of metadata, the definition of semantic relationships among data objects (e.g. provenance), and the development of electronic research records.

Id: model&process\ sca.paper.rtf
Title:Runtime Classification of Agent Services
Creator:Peter Weinstein and William P. Birmingham
Abstract:The Service Classifier Agent maintains a dynamic ontology of agent capabilities. To advertise their services, agents define concepts at runtime. These concepts are automatically classified with description logic. Agents requesting services can select the best available to meet their needs, using queries that exploit rich knowledge about services and their relations to other services.
Runtime classification of agent services encourages the development of agents to provide new services. New agents may be utilized immediately upon joining a society, without requiring modification or even notification of existing agents.

Id:SemanticWebServices-SinanAral.pdf
Title:Automating Orchestration: Bridges toward Semantic Web Services
Creator:Sinan Aral
Abstract:Web Services are lauded as the next wave of enterprise computing and an extensible, flexible and reusable solution to current enterprise integration dilemmas. However, important advances in standards development, orchestration, automation and security must be realized before Web Services can contribute extensively to enterprise integration efforts. Automated and coordinated Web Services development requires a move toward Semantic Web Services designed to combine the intelligent aspects of the Semantic Web with the reusable, component based architecture underpinning Web Services. This paper attempts to survey recent technical and strategic developments in both the Web Services space and the Semantic Web to highlight areas of possible functional coordination between the two. Developments in the Web Services standards stack are presented and explored with a concentration on where Web Service standards around orchestration such as WSFL, BPEL and WSCI might be combined with automation efforts such as RDF and the DAML family of semantic markup languages. Process ontologies, rule based agent environments, and ontology bridging are suggested as possible avenues through which automation and orchestration might be integrated. The strategic landscape of the Web Services space is then reviewed to assess possible directions for future standards development and coordination with developments in the Semantic Web. The purpose of the paper is to give researchers in the Web Services domain and the Semantic Web domain a common roadmap for interaction and coordination and a current survey of developments on either side.
Keywords:Web Services, Semantic Web, Semantic Web Services, Orchestration, Automation, WSFL, BPEL, WSCI, DAML, DAML + OIL, DAML-S.

Id:TR2004-14.pdf
Title:Requirements Engineering and the Semantic Web: Part II. Representation, Management, and Validation of Requirements and System-Level Architectures
Creator:Vimal Mayank, Natalya Kositsyna and Mark Austin

Id:vol4_VDevedzic.pdf
Title:Web Intelligence and AIED
Creator:Vladan DEVEDŽIĆ
Abstract:This paper surveys important aspects of Web Intelligence (WI) in the context of AIED research. WI explores the fundamental roles as well as practical impacts of Artificial Intelligence (AI) and advanced Information Technology (IT) on the next generation of Web-related products, systems, services, and activities. As a direction for scientific research and development, WI can be extremely beneficial for the field of AIED. Some of the key components of WI have already attracted AIED researchers for quite some time – ontologies, adaptivity and personalization, and agents. The paper covers these issues only very briefly. It focuses more on other issues in WI, such as intelligent Web services, semantic markup, and Web mining, and proposes how to use them as the basis for tackling new and challenging research problems in AIED.
Keywords:Web intelligence, ontologies, Semantic Web, educational Web services, pedagogical agents.

Id:wism2004-8.ppt
Title:Modelling Data-Intensive Web Sites with OntoWeaver
Creator:Yuangui Lei, Enrico Motta, John Domingue
Format:ppt演示文件

Id:www11.pdf
Title:Manageable Approaches to the Semantic Web
Creator:Philippe Martin & Peter Eklund
Abstract:The Semantic Web is usually envisaged as a collection of Web accessible RDF documents that re-use RDF schemas. These schemas are expected to be most often independently designed and hence not sharing many categories. We are unconvinced that this approach is viable because the lack of semantic relationships between the categories will most often make it impossible for future Web search engines to semantically compare RDF statements, and hence use them for logical inferencing or even permit their retrieval. We believe a first requirement for a viable Semantic Web is to permit knowledge providers to use common vocabulary and representation means. This implies: (i) lexical, structural and ontological conventions; (ii) a high-level expressive notation guiding the knowledge representation process and restricting the ways things can be expressed (rather that what can be expressed); (iii) a rich ontology of knowledge represention primitives (or library of complementary ontologies); and (iv) a large ontology for natural language that knowledge providers can use and specialize to describe their domains. We have collected, complemented and integrated such conventions, notations and ontologies, and introduce them in this article. Such a set (not necessarily ours) needs to be recommended by the W3C (who else?) in order to be used and thus permit knowledge sharing. Another step (not involving W3C intervention) is the development of large-scale knowledge base (KB) servers allowing users to retrieve, re-use, complement, annotate and be guided by other users’ knowledge. An implementation of such a server, WebKB-2 (www.webkb.org), is described in this article. From an external viewpoint, WebKB-2 can be exploited as a large virtual document in RDF or other export formats. Mirroring techniques between KB servers can also be used; in this architecture no unique server is relied upon and the server where a Web user publishes information would be of no concern. Thus, this more centralized approach to the Semantic Web maintains the advantages of the expected “highly decentralized” approach while solving its problems.
Keywords: Semantic Web, Ontology, Ontology Server, Cooperation, Knowledge Representation/Retrieval/Engineering/Sharing/Re-use.

Id:WWW_FromProceedings.pdf
Title:Foundations for Service Ontologies: Aligning OWL-S to DOLCE
Creator:Peter Mika, Daniel Oberle
Abstract:Clarity in semantics and a rich formalization of this semantics are important requirements for ontologies designed to be deployed in large-scale, open, distributed systems such as the envisioned SemanticWeb. This is especially important for the description of Web Services, which should enable complex tasks involving multiple agents. As one of the first initiatives of the Semantic Web community for describing Web Services, OWL-S attracts a lot of interest even though it is still under development. We identify problematic aspects of OWL-S and suggest enhancements through alignment to a foundational ontology. Another contribution of our work is the Core Ontology of Services that tries to fill the epistemological gap between the foundational ontology and OWL-S. It can be reused to align other Web Service description languages as well. Finally, we demonstrate the applicability of our work by aligning OWL-S’ standard example called CongoBuy.

Id:ZieglerICSNW2004.pdf
Title:User-Specific Semantic Integration of Heterogeneous Data: The SIRUP Approach
Creator:Patrick Ziegler, Klaus R. Dittrich

留言