Semantic MediaWiki in applied life science and industry: building an Enterprise Encyclopaedia - Open Source Software in Life Science Research

Biomedical Engineering Reference

In-Depth Information

Semantic MediaWiki and Linked Data Triple Store

working in parallel

Figure 16.7

16.3.13 Incremental indexing of enterprise

search

As a fi rst application, we created an XML connector from KnowIt

to our enterprise search engine. SPARQL queries to the wiki can

now be used to control which content should be indexed. Integration

with existing search engines is important as it allows content of the

wiki to be seamlessly available to users unfamiliar with the wiki. For

instance, this allows scientists to search for data sources that

contain terms such as 'gene sequencing' and 'proteins' and see a

resulting list of all relevant data sources internal and external to the

enterprise.

Although this application is possible using the basic query mechanism

from Semantic MediaWiki, SPARQL queries make it possible to

formulate elaborate fi lters on pages to update, and index them based on

modifi cation date or other criteria. Eventually, this mechanism will allow

us to fi lter out pages that should not be indexed, and will provide the

search engine complete semantic annotations when RDF content is

available. This allows a reduction of the impact of crawling on the

server, better use of network bandwidth and better control of what to

index [19].

Search WWH ::

Custom Search

Home