Retrieving similar cases for construction project risk management using Natural Language Processing techniques



Zou, Y ORCID: 0000-0001-6150-6126, Kiviniemi, A ORCID: 0000-0001-6570-0188 and Jones, SW ORCID: 0000-0002-8977-8403
(2017) Retrieving similar cases for construction project risk management using Natural Language Processing techniques. AUTOMATION IN CONSTRUCTION, 80. pp. 66-76.

This is the latest version of this item.

[img] Text
AiC accepted_SWJ.docx - Author Accepted Manuscript
Available under License : See the attached licence file.

Download (1MB)
[img] Text
Retrieving similar cases for construction project risk management .pdf - Author Accepted Manuscript
Available under License : See the attached licence file.

Download (867kB)

Abstract

Case-based reasoning (CBR) is an important approach in construction project risk management. It emphasises that previous knowledge and experience of accidents and risks are highly valuable and could contribute to avoiding similar risks in new situations. In the CBR cycle, retrieving useful information is the first and the most important step. To facilitate the CBR for practical use, some researchers and organisations have established construction accident databases and their size is growing. However, as those documents are written in everyday language using different ways of expression, how information in similar cases is retrieved quickly and accurately from the database is still a huge challenge. In order to improve the efficiency and performance of risk case retrieval, this paper proposes an approach of combining the use of two Natural Language Processing (NLP) techniques, i.e. Vector Space Model (VSM) and semantic query expansion, and outlines a framework for this Risk Case Retrieval System. A prototype system is developed using the Python programming language to support the implementation of the proposed method. Preliminary test results show that the proposed system is capable of retrieving similar cases automatically and returning, for example, the top 10 similar cases.

Item Type: Article
Uncontrolled Keywords: Risk management, Case-based reasoning (CBR), Natural Language Processing (NLP), Vector Space Model (VSM), Query expansion, Case retrieval
Depositing User: Symplectic Admin
Date Deposited: 07 Dec 2017 10:52
Last Modified: 19 Jan 2023 06:53
DOI: 10.1016/j.autcon.2017.04.003
Related URLs:
URI: https://livrepository.liverpool.ac.uk/id/eprint/3010235

Available Versions of this Item