Semantic Systems. The Power of AI and Knowledge Graphs

Maribel Acosta · Philippe Cudré-Mauroux · Maria Maleshkova · Tassilo Pellegrini · Harald Sack · York Sure-Vetter (Eds.) LNCS 11702 15th International Conference, SEMANTiCS 2019 Karlsruhe, Germany, September 9–12, 2019 Proceedings Semantic Systems The Power of AI and Knowledge Graphs Lecture Notes in Computer Science 11702 Founding Editors Gerhard Goos Karlsruhe Institute of Technology, Karlsruhe, Germany Juris Hartmanis Cornell University, Ithaca, NY, USA Editorial Board Members Elisa Bertino Purdue University, West Lafayette, IN, USA Wen Gao Peking University, Beijing, China Bernhard Steffen TU Dortmund University, Dortmund, Germany Gerhard Woeginger RWTH Aachen, Aachen, Germany Moti Yung Columbia University, New York, NY, USA More information about this series at http://www.springer.com/series/7409 Maribel Acosta • Philippe Cudr é -Mauroux • Maria Maleshkova • Tassilo Pellegrini • Harald Sack • York Sure-Vetter (Eds.) Semantic Systems The Power of AI and Knowledge Graphs 15th International Conference, SEMANTiCS 2019 Karlsruhe, Germany, September 9 – 12, 2019 Proceedings Editors Maribel Acosta Karlsruhe Institute of Technology Karlsruhe, Germany Philippe Cudr é -Mauroux University of Fribourg Fribourg, Switzerland Maria Maleshkova University of Bonn Bonn, Germany Tassilo Pellegrini St. P ö lten University of Applied Science St. P ö lten, Austria Harald Sack FIZ Karlsruhe – Leibniz Institute for Information Infrastructure Eggenstein-Leopoldshafen, Germany York Sure-Vetter Karlsruhe Institute of Technology Karlsruhe, Germany ISSN 0302-9743 ISSN 1611-3349 (electronic) Lecture Notes in Computer Science ISBN 978-3-030-33219-8 ISBN 978-3-030-33220-4 (eBook) https://doi.org/10.1007/978-3-030-33220-4 LNCS Sublibrary: SL3 – Information Systems and Applications, incl. Internet/Web, and HCI The Editor(s) (if applicable) and The Author(s) 2019. This book is an open access publication. Open Access This book is licensed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made. The images or other third party material in this book are included in the book's Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the book's Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. The use of general descriptive names, registered names, trademarks, service marks, etc. in this publication does not imply, even in the absence of a speci fi c statement, that such names are exempt from the relevant protective laws and regulations and therefore free for general use. The publisher, the authors and the editors are safe to assume that the advice and information in this book are believed to be true and accurate at the date of publication. Neither the publisher nor the authors or the editors give a warranty, expressed or implied, with respect to the material contained herein or for any errors or omissions that may have been made. The publisher remains neutral with regard to jurisdictional claims in published maps and institutional af fi liations. This Springer imprint is published by the registered company Springer Nature Switzerland AG The registered company address is: Gewerbestrasse 11, 6330 Cham, Switzerland Preface SEMANTiCS 2019 took place during September 9 – 12, 2019, in Karlsruhe, Germany. SEMANTiCS offers a forum for the exchange of latest scienti fi c results in semantic systems and complements these topics with new research challenges in areas like data science, machine learning, logic programming, content engineering, social computing, Semantic Web, and many more. This year was the 15th edition of the SEMANTiCS conference series, which has developed into an internationally visible and professional academic event. Participants learn from top researchers and industry experts about emerging trends and topics in the wide area of semantic computing. The SEMANTiCS community is highly diverse ; attendees have responsibilities in interlinking areas such as arti fi cial intelligence, knowledge discovery and management, big data analytics, e-commerce, enterprise search, technical documentation, document management, business intelligence, and enterprise vocabulary management. This year the SEMANTiCS conference ’ s subtitle was “ The Power of AI and Knowledge Graphs, ” and especially welcomed submissions to the following hot topics: – Web Semantics and Linked (Open) Data – Enterprise Knowledge Graphs, Graph Data Management, and Deep Semantics – Machine Learning and Deep Learning Techniques – Semantic Information Management and Knowledge Integration – Terminology, Thesaurus, and Ontology Management – Data Mining and Knowledge Discovery – Reasoning, Rules, and Policies – Natural Language Processing – Data Quality Management and Assurance – Explainable Arti fi cial Intelligence – Semantics in Data Science – Semantics in Blockchain and Distributed Ledger Technologies – Trust, Data Privacy, and Security with Semantic Technologies – Economics of Data, Data Services, and Data Ecosystems We additionally issued calls for two special tracks: – Digital Humanities and Cultural Heritage – LegalTech Following the great success of SEMANTiCS 2018 in Vienna, we received 88 submissions. In order to properly provide high-quality reviews to these submissions, we set up a Program Committee (PC) comprising of 111 members to help us select the papers with the highest impact and scienti fi c merit. For each submission, at least three reviews were written independently from the assigned reviewers in a single-blind review process (author names are visible to reviewers, but reviewers stay anonymous). After all reviews were submitted, the PC chairs compared the reviews and discussed discrepancies and different opinions with the reviewers to facilitate a meta-review and suggest a recommendation to accept or reject the paper. Overall, we accepted 20 full papers and 8 short papers from the 88 submissions which resulted in a full paper acceptance rate of 23%. The program of SEMANTiCS 2019 was structured as follows. In the main conference, the contributors of full papers including posters and industry talks gave their presentations in thematically grouped sessions. These presentations covered a broad palette on current trends and developments in semantic technologies. To support the knowledge transfer between the academic and industrial communities, scienti fi c papers and industry papers were grouped according to the following thematic sessions: – Semantic Information Management – Knowledge Discovery and Semantic Search – Knowledge Graphs – Knowledge Extraction – Natural Language Processing – Thesaurus and Ontology Management – Linked Data and Data Integration – Distributed Ledger Technologies – Smart Connectivity and Interlinking – Special Track: LegalTech – Special Track: Digital Humanities and Cultural Heritage – Special Track: Knowledge Organization and Application for Complex Industry Settings The Posters and Demos Track provided an opportunity to present late-breaking research results, smaller contributions, and innovative work in progress. 29 original submissions and 2 re-submissions from the research track were accepted to this track, selected with a peer-reviewing process from a total of 47 poster and demo submissions. The reviewing committee, which included 88 members, provided at least three reviews per submission. The accepted works have been published within the CEUR Workshop Proceedings series. Besides the scienti fi c track of the conference, a call for industry presentations was launched, which resulted in 47 submissions of which 37 were accepted for presentation in the industry track. Additionally, an exhibition took place where organizations presented their semantics-based products and services. Deliberate long breaks, in a well-suited venue, took place throughout the conference and social events provided excellent opportunities for networking with people interested in semantics-related topics from different disciplines and parts of the world. We are grateful to our keynote and invited speakers for sharing their ideas about the future development of knowledge management, new media, and semantic technologies with our attendees: vi Preface Keynote Speakers: – Michael J. Sullivan (Oracle): “ Hybrid Knowledge Management Architectures ” – Michel Dumontier (Maastricht University): “ Accelerating Biomedical Discovery with an Internet of FAIR Data and Services ” – Andy Boyd and Brendan Nielsen (Shell): “ High-grading Business Decisions through Semantic Technology ” – Valentina Presutti (Consiglio Nazionale delle Ricerche): “ Looking for Common Sense in the Semantic Web ” – Katja Hose (Aalborg University): “ Querying the Web of Data ” Invited Speakers: – Andreas Harth (Fraunhofer Institute): “ From Representing Knowledge to Representing Behaviour ” – Christian Dirschl (Wolters Kluwer): “ LegalTech – To whom it may concern ” Many thanks also go to all authors who submitted papers and of course to the PC who provided careful reviews in a quick turnaround time. Special thanks go to Christian Dirschl (Wolters Kluwer Germany) and Andreas Blumauer (Semantic Web Company) who organized all industry related activities. We also would like to thank Thomas Thurner and Martin Kaltenb ö ck from the Semantic Web Company for providing the organizational infrastructure and taking care of all the operational tasks. Additionally, we would also like to thank our local organization team Stefan Summesberger, Viviene Vetter, and Julia Holze, as well as all those helpful hands that are too many to name for supporting this year ’ s conference and turning it into a success. We would also like to thank our sponsors (i.a.o.): – Premium Sponsors: eccenca, PoolParty, FIZ Karlsruhe, and CAS – Gold Sponsors: Semiodesk, metaphacts, and i-views – Silver Sponsors: Siemens, Ontotext, Franz Inc., Allegrograph, Enterprise Knowl- edge, Deloitte, and HP Motion Content – Bronze and Research: CID, Fraunhofer IAIS, Bosch, inovex, Oracle, Pr ê t- à -LLOD, STI Innsbruck, GNOSS, Klarso, Ontopic, and SICK Special thanks also go to the partners of the conference who are: University of Basel, BID - Bibliothek & Information International, Cefriel, Connected Data London, Consiglo Nazionale delle Ricerche, Cyberforum, DBpedia, eccenca, FIZ Karlsruhe, GFWM, IBM, KIT - Karlsruhe Institute of Technology, TIB, University of Paderborn, University of Fribourg, Springer LNCS, Wolters Kluwer, and WU Vienna. Preface vii We hope that SEMANTiCS 2019 will provide you with new inspirations for your research and with opportunities for partnerships with other research groups, academic, and industrial participants. September 2019 Maribel Acosta Philippe Cudr é -Mauroux Maria Maleshkova Tassilo Pellegrini Harald Sack York Sure-Vetter viii Preface Organization Chairs Conference Chairs Harald Sack FIZ Karlsruhe – Leibniz Institute for Information Infrastructure, Germany York Sure Vetter Karlsruhe Institute of Technology, Germany Tassilo Pellegrini St. P ö lten University of Applied Sciences, Austria Research and Innovation Chairs Maribel Acosta Karlsruhe Institute of Technology, Germany Philippe Cudr é -Mauroux Universit é de Fribourg, Switzerland Special Track Chairs Sabrina Kirrane Institute for Information Business of WU Wien, Austria Victor de Boer Vrije Universiteit Amsterdam, The Netherlands Industry and Use Case Chairs Christian Dirschl Wolters Kluwer Germany, Germany Andreas Blumauer Semantic Web Company, Austria Poster and Demo Track Chairs Mehwish Alam FIZ Karlsruhe – Leibniz Institute for Information Infrastructure, Germany Ricardo Usbeck Paderborn University, Germany Workshop and Satellite Events Chairs Anna Lisa Gentile IBM Almaden Research Center, USA Irene Celino Cerfriel, Politecnico di Milano, Italy Proceedings Chairs Maria Maleshkova University of Bonn, Germany Tassilo Pellegrini St. P ö lten University of Applied Sciences, Austria Promotion Chairs Thomas Thurner Semantic Web Company, Austria Julia Holze AKSW, InfAI, Leipzig University, Germany Stefan Summesberger plantsome communication, Austria Local Chairs Thomas Thurner Semantic Web Company, Austria Vivien Vetter FIZ Karlsruhe – Leibniz Institute for Information Infrastructure, Germany Sponsoring Chair Stefan Summesberger plantsome communication, Austria Permanent Advisory Board S ö ren Auer Fraunhofer Institute for Intelligent Analysis and Information Systems, Germany Andreas Blumauer Semantic Web Company, Austria Tobias B ü rger BMW Group, Germany Christian Dirschl Wolters Kluwer Germany, Germany Victor de Boer Vrije Universiteit Amsterdam, The Netherlands Anna Fensel Semantic Technology Institute (STI) Innsbruck, Austria Dieter Fensel Semantic Technology Institute (STI) Innsbruck, Austria Mike Heininger GfWM Austria, Austria Sebastian Hellmann Institute of Applied Informatics e.V. at the University of Leipzig, Germany Ute John GfWM Germany, WissensWertSch ö pfung, Germany Martin Kaltenb ö ck Semantic Web Company, Austria Elmar Kiesling TU Wien, Austria Tassilo Pellegrini St. P ö lten University of Applied Sciences, Austria Axel Polleres Institute for Information Business of WU Wien, Austria Felix Sasaki DFKI, W3C Fellow, Germany Harald Sack FIZ Karlsruhe – Leibniz Institute for Information Infrastructure and Karlsruhe Institute of Technology (KIT), Germany Program Committee - Research and Innovation Track and Special Tracks Harith Alani The Open University Vito Walter Anelli Politecnico di Bari Luigi Asprino University of Bologna, STLab (ISTC-CNR) S ö ren Auer TIB, University of Hannover x Organization Chairs Nathalie Aussenac-Gilles IRIT, CNRS Sebastian Bader Fraunhofer-Institut f ü r Intelligente Analyse- und Informationssysteme IAIS Stefan Bischof Siemens AG Ö sterreich Carlos Bobed everis, NTT Data Loris Bozzato Fondazione Bruno Kessler Carlos Buil-Aranda Universidad T é cnica Federico Santa Mar í a Paul Buitelaar Insight Centre for Data Analytics, National University of Ireland Galway Irene Celino Ceriel Davide Ceolin Vrije Universiteit Amsterdam Pierre-Antoine Champin Liris, Universit é Claude Bernard Lyon1 Vinay Chaudhri SRI International, USA Ioannis Chrysakis FORTH-ICS, Greece Ioana-Georgiana Ciuciu Babes-Bolyai University Oscar Corcho Universidad Polit é cnica de Madrid Gianluca Correndo University of Southampton Enrico Daga The Open University Ben De Meester Ghent University Elena Demidova L3S Research Center Sylvie Despres Laboratoire d ’ Informatique M é dicale et de BIOinformatique (LIM&BIO) Chiara Di Francescomarino Fondazione Bruno Kessler-Irst Stefan Dietze GESIS - Leibniz Institute for the Social Sciences Anastasia Dimou Ghent University Jens D ö rpinghaus Fraunhofer Mauro Dragoni Fondazione Bruno Kessler-Irst Anca Dumitrache Vrije Universiteit Amsterdam J é r ô me Euzenat Inria, University of Grenoble Alpes Victoria Eyharabide STIH Laboratory, Sorbonne University Michael F ä rber University of Freiburg Catherine Faron Zucker Universit é Nice Sophia Antipolis Said Fathalla University of Bonn Ingo Feinerer University of Applied Sciences Wiener Neustadt Javier D. Fern á ndez Vienna University of Economics and Business Agata Filipowska Poznan University of Economics Nuno Freire INESC-ID Roberto Garcia Universitat de Lleida Ra ú l Garc í a-Castro Universidad Polit é cnica de Madrid Daniel Garijo Information Sciences Institute Annalisa Gentile IBM Jose Manuel Gomez-Perez ExpertSystem Michael Granitzer University of Passau Alasdair Gray Heriot-Watt University Paul Groth University of Amsterdam Peter Haase metaphacts Organization Chairs xi Benjamin Heitmann RWTH Aachen University Lars Heling Karlsruhe Institute of Technology Eelco Herder Radboud University Pieter Heyvaert IDLab Ghent University – imec, Belgium Rinke Hoekstra University of Amsterdam Geert-Jan Houben Delft University of Technology Zhisheng Huang Vrije Universiteit Amsterdam Shimaa Ibrahim Bonn University Marc Jacobs Fraunhofer Tobias K ä fer Karlsruhe Institute of Technology Lucie-Aim é e Kaffee University of Southampton Elias K ä rle STI-Innsbruck Tomi Kauppinen Aalto University School of Science Dimitris Kontokostas University of Leipzig Efstratios Kontopoulos Information Technologies Institute, Centre for Research & Technology – Hellas, Greece Tobias Kuhn Vrije Universiteit Amsterdam Christoph Lange Fraunhofer FIT, Germany Maxime Lefran ç ois MINES Saint-Etienne Isaac Lera UIB Steffen Lohmann Fraunhofer Vanessa Lopez IBM Vincent Lully Sorbonne Universit é , France Nicole Merkle FZI Forschungszentrum Informatik am KIT Lyndon Nixon MODUL Technology GmbH Leo Obrst MITRE Jan Oevermann University of Bremen, German Research Center for Arti fi cial Intelligence (DFKI) Harshvardhan Jitendra Pandit ADAPT, Trinity College Dublin Heiko Paulheim University of Mannheim Catia Pesquita LaSIGE, Universidade de Lisboa Jasmin Pielorz Austrian Institute of Technology J ę drzej Potoniec Poznan University of Technology C é dric Pruski Luxembourg Institute of Science and Technology Filip Radulovic S é page in Paris, France Alessandro Raganato University of Helsinki Artem Revenko Semantic Web Company GmbH Giuseppe Rizzo LINKS Foundation Oscar Rodr í guez Rocha Inria Anisa Rula University of Milano-Bicocca Marta Sabou Vienna University of Technology Vadim Savenkov Vienna University of Economics and Business (WU) Stefan Schlobach Vrije Universiteit Amsterdam Pavel Shvaiko Informatica Trentina Ruben Taelman Ghent University – imec xii Organization Chairs Sanju Tiwari Ontology Engineering Group Konstantin Todorov LIRMM, University of Montpellier Riccardo Tommasini Politecnico di Milano J ü rgen Umbrich Vienna University of Economy and Business (WU) Victoria Uren Aston University Mathias Uslar OFFIS Herbert Van De Sompel Data Archiving Networked Services Frank Van Harmelen Vrije Universiteit Amsterdam Maria Esther Vidal Universidad Simon Bolivar Joerg Waitelonis yovisto GmbH Shenghui Wang OCLC Research Ziqi Zhang Shef fi eld University Additional Reviewers Wazed Ali TIB Imran Asif Heriot Watt University Javad Chamanara L3S Andrea Cimmino Arriaga Universidad de Sevilla Diego Collarana IAIS Fraunhofer Mirette Elias University of Bonn Simon Gottschalk L3S Prashant Khare The Open University Allard Oelen TIB Nicolas Tempelmeier L3S Organization Chairs xiii Contents Web Semantics and Linked (Open) Data Usage of Semantic Web in Austrian Regional Tourism Organizations . . . . . . 3 Christina Lohvynenko and Dietmar Nedbal Test-Driven Approach Towards GDPR Compliance. . . . . . . . . . . . . . . . . . . 19 Harshvardhan J. Pandit, Declan O ’ Sullivan, and Dave Lewis Linked Data Supported Content Analysis for Sociology . . . . . . . . . . . . . . . . 34 Tabea Tietz and Harald Sack LinkedSaeima: A Linked Open Dataset of Latvia ’ s Parliamentary Debates . . . 50 Uldis Boj ā rs, Roberts Dar ģ is, Uldis Lavrinovi č s, and P ē teris Paikens MusicKG: Representations of Sound and Music in the Middle Ages as Linked Open Data. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 57 Victoria Eyharabide, Vincent Lully, and Florentin Morel Machine Learning and Deep Learning Techniques Improving NLU Training over Linked Data with Placeholder Concepts . . . . . 67 Tobias Schmitt, Cedric Kulbach, and York Sure-Vetter Using Weak Supervision to Identify Long-Tail Entities for Knowledge Base Completion. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 83 Yaser Oulabi and Christian Bizer Semantic Information Management and Knowledge Integration Evaluating Generalized Path Queries by Integrating Algebraic Path Problem Solving with Graph Pattern Matching . . . . . . . . . . . . . . . . . . . . . . 101 Abhisha Bhattacharyya, Ilya Baldin, Yufeng Xin, and Kemafor Anyanwu Building a Conference Recommender System Based on SciGraph and WikiCFP . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 117 Andreea Iana, Steffen Jung, Philipp Naeser, Aliaksandr Birukou, Sven Hertling, and Heiko Paulheim V4Ann: Representation and Interlinking of Atom-Based Annotations of Digital Content . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 124 Georgios Meditskos, Stefanos Vrochidis, and Ioannis Kompatsiaris RSP-QL H : Enabling Statement-Level Annotations in RDF Streams . . . . . . . . 140 Robin Keskis ä rkk ä , Eva Blomqvist, Leili Lind, and Olaf Hartig Terminology, Thesaurus and Ontology Management The Semantic Asset Administration Shell . . . . . . . . . . . . . . . . . . . . . . . . . . 159 Sebastian R. Bader and Maria Maleshkova Taxonomy Extraction for Customer Service Knowledge Base Construction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 175 Bianca Pereira, Cecile Robin, Tobias Daudert, John P. McCrae, Pranab Mohanty, and Paul Buitelaar An Ontology Alignment Approach Combining Word Embedding and the Radius Measure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 191 Molka Tounsi Dhouib, Catherine Faron Zucker, and Andrea G. B. Tettamanzi Ontology Design Rules Based on Comparability via Particular Relations . . . . 198 Philippe A. Martin, Olivier Corby, and Catherine Faron Zucker From Monolingual to Multilingual Ontologies: The Role of Cross-Lingual Ontology Enrichment . . . . . . . . . . . . . . . . . . . . . . . . . . . 215 Shimaa Ibrahim, Said Fathalla, Hamed Shariat Yazdi, Jens Lehmann, and Hajira Jabeen MELT - Matching EvaLuation Toolkit. . . . . . . . . . . . . . . . . . . . . . . . . . . . 231 Sven Hertling, Jan Portisch, and Heiko Paulheim Data Mining and Knowledge Discovery Interaction Network Analysis Using Semantic Similarity Based on Translation Embeddings . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 249 Awais Manzoor Bajwa, Diego Collarana, and Maria-Esther Vidal CACAO: Conditional Spread Activation for Keyword Factual Query Interpretation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 256 Edgard Marx, Gustavo Correa Publio, and Thomas Riechert Fine-Grained Named Entity Recognition in Legal Documents . . . . . . . . . . . . 272 Elena Leitner, Georg Rehm, and Julian Moreno-Schneider Extracting Literal Assertions for DBpedia from Wikipedia Abstracts . . . . . . . 288 Florian Schrage, Nicolas Heist, and Heiko Paulheim xvi Contents Towards a Scalable Semantic-Based Distributed Approach for SPARQL Query Evaluation. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 295 Gezim Sejdiu, Damien Graux, Imran Khan, Ioanna Lytra, Hajira Jabeen, and Jens Lehmann Automatic Facet Generation and Selection over Knowledge Graphs. . . . . . . . 310 Leila Feddoul, Sirko Schindler, and Frank L ö f fl er Knowledge Graph Exploration: A Usability Evaluation of Query Builders for Laypeople . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 326 Emil Kuric, Javier D. Fern á ndez, and Olha Drozd QUANT - Question Answering Benchmark Curator . . . . . . . . . . . . . . . . . . 343 Ria Hari Gusmita, Rricha Jalota, Daniel Vollmers, Jan Reineke, Axel-Cyrille Ngonga Ngomo, and Ricardo Usbeck Simple-ML: Towards a Framework for Semantic Data Analytics Workflows . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 359 Simon Gottschalk, Nicolas Tempelmeier, G ü nter Kniesel, Vasileios Iosifidis, Besnik Fetahu, and Elena Demidova Semantics in Blockchain and Distributed Ledger Technologies Incorporating Blockchain into RDF Store at the Lightweight Edge Devices . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 369 Anh Le-Tuan, Darshan Hingu, Manfred Hauswirth, and Danh Le-Phuoc Verifying the Integrity of Hyperlinked Information Using Linked Data and Smart Contracts . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 376 Christoph Braun and Tobias K ä fer Author Index . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 391 Contents xvii Web Semantics and Linked (Open) Data Usage of Semantic Web in Austrian Regional Tourism Organizations Christina Lohvynenko and Dietmar Nedbal ( & ) University of Applied Sciences Upper Austria, Wehrgrabengasse 1-3, 4400 Steyr, Austria christina-lohvynenko@gmx.at, dietmar.nedbal@fh-steyr.at Abstract. Tourism is one of the most important economic sectors in Austria. Given the high internationality degree of Austrian visitors, the websites of regional tourism organizations (RTOs) are an essential source of information. A state-of-the-art tourism website should include semantic markup for touristic topics so that search engines and other intelligent software applications can access and understand the presented data. This paper empirically studies the usage of Semantic Web formats, ontologies and topics relevant for tourism on the websites of all 137 Austrian RTOs. Results show that 59% of the RTOs use semantic markup. Most regions adhere to the recommendations of leading search engines utilizing ontologies such as Schema.org and the formats Microdata and JSON-LD. While most semantic markup incorporates basic information (e.g. navigation, addresses, corporate data), only few Austrian RTOs annotate touristic relevant topics that would contribute to unlock the full potential of the Semantic Web such as regional events, accommodations, blog posts, images or social media. Keywords: Semantic Web Regional tourism organizations Survey Austria 1 Introduction With nearly 45 million resident and non-resident guests in 2018, tourism is one of the most important Austrian economic sectors [1]. In the last years, the tourism and leisure industry contributed around 16% to the Austrian gross domestic product through direct and indirect effects [2]. Even in international comparison, the country occupies an important place among the top 20 tourism destinations [3]. The tourism regions, which are in the midst of the hierarchical organization of this industry in Austria, contribute signi fi cantly to the promotion of certain tourism destinations and to addressing a broad target group [4]. These regional tourism organizations (RTO) are also given an important role in the possible weakening of dependence on international online travel agencies (OTA), which dominate the tourism market. Given the growth of the Internet usage and due to the high internationality degree of Austrian visitors, the websites of tourism providers are becoming increasingly important. A state-of-the-art website that implements innovative web technologies is therefore essential [5, 6]. © The Author(s) 2019 M. Acosta et al. (Eds.): SEMANTiCS 2019, LNCS 11702, pp. 3 – 18, 2019. https://doi.org/10.1007/978-3-030-33220-4_1 The use of Semantic Web and Linked Data has long been a standard in website optimization and intends to make important content-bearing elements of web pages machine-readable by means of semantic markup so that access to data for search engines and other intelligent software applications is facilitated. The semantic anno- tation of structured data to a website is one of the most common search engine opti- mization practices, which is also recommended by leading search engines. Thus, it can increase the online visibility of the web page and the sales fi gures on the Internet [7 – 9]. However, the empirical analysis of the use of Semantic Web by the hotel websites in Austria has shown that the use of direct providers in contrast to OTAs is very moderate and often fl awed [6, 10]. Such a weak use of structured data in the hotel industry suggests that the Semantic Web has not yet become a standard in Austria ’ s tourism industry. With the RTOs playing an important role in the Austrian tourism, the current paper aims to elucidate the usage status of the Semantic Web among these websites. It fi rst discusses the background and related work on the use of structured data in tourism in Sect. 2. Further, the results on an empirical investigation are reported. For this purpose, the selection of the examination objects and preparation of the data for analysis are described in Sect. 3. The results of the evaluation are presented in Sect. 4, followed by a discussion (Sect. 5). Finally, Sect. 6 provides concluding remarks. 2 Background and Related Work One of the most important communication channels of a tourism organization is the website, which should adhere the current state-of-the-art. In this context it has been recognized that innovative software providing interoperability through ontologies is critical for further innovation in the tourism industry [11]. Although there has been progress in the last ten years, a recent study highlights the still current and growing importance of semantics and ontologies in tourism. The authors further state that academic research in these disciplines is still in its infancy [12]. Website owners and content managers of tourism regions face several challenges when attempting to semantically enrich data on their website. First of all the selection of the appropriate vocabulary, format and content is not a trivial task. In addition to common vocabularies independent of the domain, several domain-speci fi c ontologies for tourism have also been developed which makes it dif fi cult to select the most suitable and, at the same time, a future-proof vocabulary. The Linked Open Vocabularies project, for example, provides a central information point about well-documented vocabularies [13]. The constantly growing website lists 660 high quality vocabularies as of Feb. 2019. Measured by the number of vocabularies that reuse the vocabulary, the most popular ontologies are Dublin Core Metadata Terms (dcterms), Dublin Core Metadata Element Set (dce), Friend of a Friend vocabulary (foaf), A vocabulary for annotating vocabulary descriptions (vann), Simple Knowledge Organization System (skos), Creative Commons Rights Expression Language (cc), SemWeb Vocab Status ontology (vs) and Schema.org vocabulary (schema) [14]. The problem of common vocabularies often lies in the level of precision over domain-speci fi c ontologies. For example, until version 3.0, Schema.org lacked the ability to describe the number of 4 C. Lohvynenko and D. Nedbal