Ontology Learning from Twitter Data

Alajlan, Saad ORCID: 0000-0001-7192-820X, Coenen, Frans ORCID: 0000-0003-1026-6649, Konev, Boris ORCID: 0000-0002-6507-0494 and Mandya, Angrosh (2019) Ontology Learning from Twitter Data. In: 11th International Conference on Knowledge Engineering and Ontology Development, 2019-9-17 - 2019-9-19.

Text
keod_OntologyLearning_2019.pdf - Author Accepted Manuscript
Download (10MB) | Preview

Official URL: http://dx.doi.org/10.5220/0008067600940103

Abstract

This paper presents and compares three mechanisms for learning an ontology describing a domain of discoursed as defined in a collection of tweets. The task in part involves the identification of entities and relations in the free text data, which can then be used to produce a set of RDF triples from which an ontology can be generated. The first mechanism is therefore founded on the Stanford CoreNLP Toolkit.; in particular the Named Entity Recognition and Relation Extraction mechanisms that come with this tool kit. The second is founded on the GATE General Architecture for Text Engineering which provides an alternative mechanism for relation extraction from text. Both require a substantial amount of training data. To reduce the training data requirement the third mechanism is founded on the concept of Regular Expressions extracted from a training data “seed set”. Although the third mechanism still requires training data the amount of training data is significantly reduced without adversely affecting the quality of the ontologies generated.

Item Type:	Conference or Workshop Item (Unspecified)
Uncontrolled Keywords:	Ontology Learning, RDF, Relation Extraction, Twitter, Name Entity Recognition, Regular Expression
Depositing User:	Symplectic Admin
Date Deposited:	17 Sep 2019 08:13
Last Modified:	04 Feb 2023 17:21
DOI:	10.5220/0008067600940103
Related URLs:	Author Publisher
URI:	https://livrepository.liverpool.ac.uk/id/eprint/3054851