Semantic Web and E-Tourism (information science)

Introduction

Offering tourist services on the Internet has become a great business over the past few years. Heung (2003) revealed that approximately 30% of travelers use the Internet for reservation or purchase of travel products or services.

Classic sites of tourist agencies enable users to view and search for certain destinations and topic and pay for vacation packages. At a higher level of sophistication are tourism Web portals, which integrate the offers of many tourist agencies and enable searching from one point on the Web. Still, when using this kind of systems one is forced to spend a lot of time analyzing Web content with destinations that match his/her wishes. This problem is identified by Hepp, Siorpaes and Bachlechner (2006) as the “needle in the haystack” problem.

Applying artificial intelligence (AI) techniques in E-tour-ism could help resolve this problem by providing:

1. Data that are semantically enriched, structured, and thus represented in a machine readable form;

2. Easy integration of tourist sources from different applications;

3. Personalization of sites: the content can be created according to the user profile;

4. Improved system interactivity.

As an example of usingAI in e-tourism, we present Travel Guides—a prototype system that offers tourists complete information about numerous destinations. They can search destinations by using several criteria (e.g., accommodation type, food service, budget, activities during vacation, and user interests: sports, shopping, clubbing, art, museum, monuments, etc.). He/She can also read about the weather forecast and events in the destination.


In a way, Travel Guides complements traditional information systems of tourist agencies. These systems require a lot of maintenance effort in order to keep the huge amount of data about tourist destinations up-to-date.

Travel Guides is created to minimize the user’s input and his/her need to filter information. It shows how usage of se-mantically enriched data in a machine readable form can

• Increase interoperability in the area of tourism,

• Decrease maintenance efforts of tourist agents, and

• Offer tourists a better service.

Nowadays, there arejust a few e-tourism systems that use AI techniques. We briefly discuss them in the next section. In this article, we explain why it would be good to use such techniques and how Travel Guides does it. Specifically, using

Semantic Web technologies in the area of tourism can improve already existing systems (which are mostly available online) that do not use Semantic Web techniques yet. Likewise, the Semantic Web approach can help decrease the maintenance efforts required for existing e-tourism systems and ease the process of searching for vacation packages.

Travel Guides was initially developed as a large-scale expert system. Over time, it has evolved into a modern Semantic Web application.

background

According to Aichholzer, Spitzenberger and Winkler (2003), e-tourism comprises electronic services which include:

• Information services (e.g., destination, hotel information);

• Communication services (e.g., discussion forum);

• Transaction services (e.g., booking).

Transaction services are offered at many places on the Web, such as Expedia, Travelocity, and so forth. These Websites include some of the information services, but for complete details about certain destination (e.g., activities, climate, monuments, and events) one must search for other sources. Some Websites even help in planning the whole itinerary (e.g., HomeAndAbroad). Apparently, there is an “information gap” between these online services, and no interoperability. Semantic Web technologies can be used to overcome this problem and thus increase the quality of e-tourism.

Cunningham (2002) presents GATE (General Architecture for Text Engineering) as an infrastructure for developing and deploying software components that process human language. It can annotate documents and recognize concepts such as: locations, persons, organizations and dates. GATE can annotate documents with respect to a particular ontology. Some of the recently built Knowledge Management Platforms, like KIM (Popov, Kiryakov, Ognyanoff, Manov, & Kirilov, 2004), use GATE for information extraction and retrieval.

Similar to other AI technologies, Semantic Web is not frequently used in real-time tourism applications. Integrating AI tools into mainstream applications can result in benefits to both sides (Djuric, Devedzic & Gasevic, 2007). Standard organizations like the Internet Engineering Task Force and the World Wide Web Consortium (W3C) are making major efforts at developing languages for sharing meaning (Shadbolt, Berners-Lee & Hall, 2006). Speaking at the WWW2006 conference in Edinburgh in May 2006, W3C director Tim Berners-Lee pointed out that Semantic Web has all the standards and technologies it needs to succeed and that it was time for Web developers and content producers to start using semantic languages in addition to HTML (Bennett, 2006).

Cardoso (2006a) addresses the lack of standards in the tourism domain: the prices for tourism activities are expressed in different currencies; the time units also do not follow the standards. He argues that use of Semantic Web and ontologies could overcome this problem. In Cardoso (2006b) he describes the ontology developed to achieve integration and interoperability through the use of a shared vocabulary and meanings for terms with respect to other terms in the area of tourism. His system creates vacation packages dynamically using previously annotated data in respect to the ontology. This is performed with a service that builds itinerary by combining user preferences with flights, car rentals, hotel, and activities in a single price. Similar to this, Jakkilinki, Georgievski and Sharda (2007) presents a tool for tour planning that is intelligent in the meaning that it generates travel plans by matching user preferences and available tourist offers from different travel agents in respect to the ontology which enables reasoning.

To use Semantic Web in e-tourism, two approaches could be applied. One is to make applications from scratch, based on the existing standards. The other one is to enrich already existing content with annotations based on ontology. The first approach is not cost-effective for tourist agencies. Although the second approach sounds more reasonable, it seems that there are not enough data in the domain of tourism on the Web. Hepp et al. (2006) made a research in this field using a sample of 100 accommodations in Austria. Their results showed that neither some of the hotels had their Web pages, nor the biggest Austrian portal for e-tourism (Tiscover) had any information about them. An additional problem they noticed was the incompleteness of the details such as the availability of the accommodation and the prices.

Many e-tourism portals store their data internally, and not on the Web. This means that even a perfect annotation of the Web content would not be sufficient enough; hence, it is limited to persistently published information (Hepp, 2006). To exceed this problem Stojanovic, Stojanovic and Volz (2002) developed a mapping mechanism for migrating relational database schemas into ontologies in order to form the conceptual backbone for metadata annotations which are automatically created from the database. A better approach would be to use Semantic Web services, for example, Web Service Modeling Ontology-WSMO (Roman et al., 2005) or OWL-based Web service ontology – OWL-S (Smith & Alesso, 2005).

Dell’erba, Fodor, Hopken, and Werthner (2005) present the Harmonize project that integrates Semantic Web technologies and merge tourist electronic markets using ontology as a mediator. Their ontology was taken over by the E-tourism Working Group (2004) at Digital Enterprise Research Institute (DERI). This group plans to develop an advanced e-tourism Semantic Web portal, which will connect the customers and virtual travel agents. This portal could be of importance to the travel industry in Austria, whereas for the rest of the world it could be an example of using Semantic Web technologies in a real business system.

In 2001, the industry tried to address the interoperability issue by forming a consortium called the Open Travel Alliance. OTA is producing XML specifications (schemas) for messages to be exchanged between the trading partners, for example, availability checking, booking, rental, reservation, query services, insurance, etc. The precondition for this improvement method to succeed is that each travel agent’s application can produce and consume OTA-compliant messages.

Dogac et al. (2004) present the SATINE project as a peer-to-peer network that enables peers to deploy their se-mantically-enriched travel Web services and allows others to discover these services semantically.

In addition to attempts to semantically enrich tourism sources on the Internet, it has been very popular to develop Location Based Tourism Systems (LBTS). LBTS are computerized systems that depend on an automated location of a target which either deliver or collect information. Currently, LBTS applications are being used by mobile phones, iPods, and PDAs (Hawking et al., 2005). LBTS provide search for hotels and ATM machines near by the user’s current location and additional information when the user visits a city for the first time. An example of such a system is “Mobility Agent” (Edwards, Blythe, Scott & Weihong-Guo, 2006). This system delivers Internet-based travel and tourism-related services through fixed and mobile devices. Intelligent agent technology (Devedzic, 2003) was used to provide European visitors the dynamic, mobile, personalized, location-based information and services, especially related to travel in complex urban environments. Kanellopoulos and Kotsiantis (2006, p. 86) predict that “in a collaborative travel environment, agents will be essential in addressing the issues of security, negotiation, personalisation and Web Service procurement.”

TRAVEL GUIDES

Travel Guides is a Semantic Web portal in the area of tourism. Designed with the idea to gather all vacation packages from different tourist agencies, this system is built to help searching for a “perfect” vacation package. It is also personalized, so that its content is adapted to the user regarding his/her interests and activities. It is an example of using Semantic Web to increase the interoperability, provide better interaction, and intelligent reasoning in the domain of tourism.

Requirements for an E-Tourism Intelligent Portal

To make a tourism portal capable of intelligent reasoning, it is necessary to:

• Build some initial knowledge in the system,

• Introduce user profiles, and

• Maintain the knowledge automatically during the user’s interaction with the system.

That way, the knowledge collected in a machine-readable form reduces the user’s input and improves search.

In addition, the portal should be able to gather useful Web content and extract information of potential interest to the user.

In order to produce knowledge in a machine-readable form, it is necessary to develop and use a set of ontologies to represent all important concepts and their relations. Ontologies also enable knowledge sharing and reuse between applications, and development of intelligent Web services by using Semantic Web technologies and tools.

In case of search-intensive applications like tourism portals, ontologies bring up an important feature of semantic search. Unlike keyword-based syntactic search, ontology-supported semantic search returns better-quality information, because of the possibility to find the pages that contain semantically similar albeit possibly syntactically different text. A related concept is that of semantic interoperability. It is best understood by contrasting it to syntactic interoperability: syntactic interoperability is about parsing the data, while semantic interoperability means using ontologies to define mappings between data and known concepts. For example, a tourist agency may use the term “day trip” on its Web site, whereas another one may use “1-day excursion.” Ontology from the domain of tourism may be used to make the equivalence mapping between the two terms, thus enabling the portal to treat the related pieces of information equally.

User Profiles

Personalization implies adapting content to a particular user and enhances the interactivity of the system. User profiles are based on the data that the system has about the user. These data are provided in two ways:

1. The user is willing to fill the forms where he/she inputs data about him/herself: gender, birth date, social data, address, profession, education, languages, interests and activities, budget, or visited destinations (cities, countries).

2. The system collects data about the user’s interests and preferences while the user is visiting the portal and reading about or searching for vacation packages. Each time the user clicks on some of the vacation package details, the system stores his/her action in a database.

With the user profile, the system can adapt its look- and-feel and, more importantly, its content to the user. For example, the results of search for a destination (based on the requested criteria) can be additionally filtered according to the user profile, Figure 1. Each destination is identified by its latitude and longitude. When the system is aware of the user’s location, the search results could be sorted by destinations that are physically closer to the user.

Whenever possible, the user profile extension and maintenance is performed dynamically by the system itself without the user’s explicit intervention, using specific heuristics. There is a need to “observe” the user constantly and to use a built-in reasoner to infer the user’s preferences and intentions from the observations, that is, to create and maintain a valid user profile based on his/her behavior.

Figure 1. An adaptive portal using the user’s profile

 An adaptive portal using the user's profile

The Portal Architecture

Figure 2 depicts the architecture of the Travel Guides portal for tourism management. The User interface component collects data from the user and sends them to the Controller. Controller manages the requests from the interface and fires appropriate actions.

The User Manager takes care of the user data. It uses the User DAO (Data Access Object) to store or fetch the needed data from/to a database. These data are user details that are not subject to frequent change and are not important for determining the user profile: the username, password, first name, last name, address, birth date, phone and e-mail. The User Profile Expert is aware of the User ontology and also of the User profile knowledge base that contains instances of classes and relations from the User ontology.

The Travel Manager is responsible for fetching, storing and updating the data related to vacation packages. Storing and retrieving data from the database is performed by the Vacation Package DAO. The data stored in the database are those that are subject to frequent changes and are not important in the process of reasoning: start date, end date, prices (accommodation price, food service price, transport price), benefits, discounts, and documents that contain textual descriptions with details about the vacation packages. Some of these data are used in search queries as constraints. Actually, retrieving a “perfect” vacation package is performed in two steps:

1. Matching the user’s wishes to certain destinations—the user profile is matched with certain types of destinations. This is performed by the Travel Offer Expert and the World Expert.

2. The list of destinations retrieved in the first step is filtered using the constraints the user may want to set (e.g., the start/end dates of the vacation). The filtering is done by the Vacation Package DAO.

The Travel Manager aggregates the Tourist Offer Expert and World Expert components, which include inference engines. These inference engines are aware of the Travel and World ontology, respectively. Their main role is to validate the content in the knowledge bases.

The System Scheduler is running daily at scheduled times. It takes data about the type of destinations that the user has visited and updates his/her user profile. Those data are fetched from the logged user’s activities in the relational database, semantically processed and stored in a temporal knowledge base. After the temporal knowledge base is validated, it is merged with an already existing user-profile knowledge base. This validation is performed by an inference engine built inside the User Profile Expert.

A similar process is performed to update tourist offers in the knowledge base. This happens when a travel agent updates data about new vacation packages, as well as when the portal administrator updates the World knowledge base (see Figure 2).

The relations and restrictions stored in Travel Guides ontologies represent the core of this system. Ontologies allow machine-supported travel-related data interpretation and integration (Kanellopoulos, Kotsiantis, & Pintelas, 2006).

The World ontology contains concepts and relations from the real world: geographical terms, locations with coordinates, land types, time and date, time zone, currency, languages, and all other terms that are expressing some concepts that are in a way related to tourism or tourists, but not to vacation packages that could be offered by tourist agencies. Because Travel Guides is intended to support semantic annotation, indexing, and retrieval of documents, this ontology is also meant to contain the general concepts necessary for expressing semantic annotation, indexing, and retrieval.

The User ontology is meant to contain data about the users-the travelers who visit the Travel Guides portal. This ontology describes user interests and activities, age groups, favorite travel companies, and other data related to user profiles. Each user can have one or more user profiles, depending on his/her behavior while visiting the portal, and also depending on the data he/she has provided to the system.

Figure 2. The architecture of the Travel Guides portal for tourism management

The architecture of the Travel Guides portal for tourism management

The Travel (Tourism) ontology includes all terms being specific to types of vacations, traveler types, and vacation packages offered in tourist agencies and being important to travelers, like the type of accommodation, food service type, transport service, type of room in a hotel, and the like. It is an ontology that makes an indirect connection between users and destinations.

The ontologies are implemented in OWL (Antoniou & Harmelen, 2004) and developed using the Protege tool (Hor-ridge, Knublauch, Rector, Stevens, & Wroe, 2004).

Travel Guides is designed so that all users contribute to the creation and updating of the knowledge base. Each group is contributing to the knowledge base in its own way:

• End users (tourists) feed the system with their personal data, which then get analyzed by the system in order to create/update user profiles. The system also uses logged data about each user’s activities (mouse clicks) when updating the user’s profile.

• Tourist agents are creating vacation packages and similar offers in tourist agencies. They feed and update the knowledge base with new information about destinations, arrangements, excursions, etc. To do this, they fill a form about a destination which includes fields like name, accommodation, hotel name, parking provided by the hotel, swimming pool inside the hotel, activities, transport service, etc.

• Portal administrators—they mediate the knowledge base updates with destinations not covered by the tourist agencies. The idea is that tourist agents can then use those updates as the basis for creating new vacation packages and other tourist offers. To alleviate the creation, extensions, and maintenance of this part of the knowledge base, Travel Guides exploits Word-Net (Fellbaum, 1998), an open-source knowledge base that describes a number of concepts and terms. Some of these were copied to the Travel Guides knowledge base to avoid manual entrance of static (permanent) data about various destinations. A special tool is developed for copying instances of WordNet classes (concepts) and relations to instances of classes and relations of the Travel Guides World ontology.

future trends

There is room for future improvements in several directions:

• The use of the Semantic Web services.

• The system is designed to possibly include information about all places in the world and all vacation packages. To achieve this, it is necessary to feed it and save a great amount of data. The system’s prototype described here includes a limited collection of vacation packages, which must be extended and updated.

• Simplification of the document annotation process.

Currently, it requires the knowledge and understanding of the GATE.

• Finally, Travel Guides supports only management of hotel accommodation. This should be expanded with hostels, campgrounds, and private apartments.

conclusion

Artificial intelligence can play a significant role in improving e-tourism portals. The use of Semantic Web technologies can increase interoperability in the area of tourism, although the agreement of all involved parties about using the standards is essential. Without this agreement, the only useful thing that could be achieved with annotation process would be to analyze sites with content about countries, cities, beaches, and so forth. Most of the data regarding vacation packages are internally stored in the tourism portals, and hence only the usage of intelligent Web services would help their retrieval and representation in a machine readable form. Standardization of the way all tourist services are representing the data would speed up the process of their integration. This would ease searching for tourist deals from one place. The integration of geographical data would also decrease efforts of tourist agents who are responsible to feed the system and keep data up-to-date. If these data would be centralized in a repository available to tourist agents, this would significantly decrease the maintenance efforts.

The Travel Guides system is built to solve the problem of distributed tourist sources and to help users find a “perfect” vacation package quickly. The system includes intelligent components that perform reasoning and have some built-in heuristics. It is based on a number of ontologies, which support the process of adding semantics to data. The data are semantically enriched, which means that they can be understood by machines. The system is also personalized in order to adapt its content to each user. The more a user visits the portal, the more the system “knows” about him/her.

As this system is built using the latest Java technologies for building Web applications widely used in existing online tourism information systems nowadays, this prototype system is to show that such mainstream systems can benefit from integrating the Semantic Web components.

KEY TERMS

Dynamic Packaging: The combination of different travel components, bundled and priced in real time, in response to the requests of the consumer or booking agent.

Intelligent Agents: Software elements which help the user find information of specific interest to him/her without their explicit assistance.

Intelligent Reasoning: The act of using reason to derive a conclusion from certain premises using a given methodology.

Location Based Services: Services that provide context-sensitive information based on the mobile user’s location.

Ontology: A controlled vocabulary that describes obj ects and the relations between them in a formal way, and has a grammar for using the vocabulary terms to express something meaningful within a specified domain of interest.

OWL-Based Web Service Ontology (OWL-S): An ontology which supplies Web service providers with a core set of constructs for describing the properties and capabilities of their Web services in unambiguous, computer-inter-pretable form.

Semantic Web Services: Self-contained, self-describing, semantically marked-up software resources that can be published, discovered, composed and executed across the Web in a task-driven semiautomatic way.

Web Service Modeling Ontology (WSMO): A data model that provides the conceptual underpinning and a formal language for semantically describing all relevant aspects of Web services in order to facilitate the automation of discovering, combining and invoking electronic services over the Web.

Web Portal: A Web site or service that offers a broad array of resources and services, such as e-mail, forums, search engines, and online shopping malls.

Next post:

Previous post: