From Semi-automated to Automated Methods of Ontology Learning from Twitter Data



Alajlan, Saad ORCID: 0000-0001-7192-820X, Coenen, Frans ORCID: 0000-0003-1026-6649 and Mandya, Angrosh
(2020) From Semi-automated to Automated Methods of Ontology Learning from Twitter Data. .

[img] Text
ic3kBook_alajlan.pdf - Author Accepted Manuscript

Download (3MB) | Preview

Abstract

This paper presents four different mechanisms for ontology learning from Twitter data. The learning process involves the identification of entities and relations from a specified Twitter data set, which is then used to produce an ontology. The initial two methods considered, the Stanford and GATE based ontology learning frameworks, are both semi-automated methods for identifying the relations in the desired ontology. Although the two frameworks effectively create an ontology supported knowledge resource, the frameworks feature a particular disadvantage; the time-consuming and cumbersome task of manually annotating a relation extraction training data sets. As a result two other ontology learning frameworks are proposed, one using regular expressions which reduces the required resource, and one that combines Shortest Path Dependency parsing and Word Mover’s Distance to fully automate the process of creating relation extraction training data. All four are analysed and discussed in this paper.

Item Type: Conference or Workshop Item (Unspecified)
Uncontrolled Keywords: Networking and Information Technology R&D (NITRD)
Depositing User: Symplectic Admin
Date Deposited: 07 Sep 2020 09:33
Last Modified: 14 Mar 2024 21:44
DOI: 10.1007/978-3-030-66196-0_10
Related URLs:
URI: https://livrepository.liverpool.ac.uk/id/eprint/3099941