García-Constantino, Matías, Atkinson, Katie ORCID: 0000-0002-5683-4106, Bollegala, Danushka ORCID: 0000-0003-4476-7003, Chapman, Karl, Coenen, Frans ORCID: 0000-0003-1026-6649, Roberts, Claire and Robson, Katy
(2017) CLIEL. In: ICAIL '17: Sixteenth International Conference on Artificial Intelligence and Law, London.

[img] Text
CLIEL_ICAIL2017.pdf - Author Accepted Manuscript

Download (426kB)


The eectiveness of document Information Extraction (IE) is greatly aected by the structure and layout of the documents being considered. In the case of legal documents relating to commercial law, an additional challenge is the many dierent and varied formats, structures and layouts used. In this paper, we present work on a exible and scalable IE environment, the CLIEL (Commercial Law Information Extraction based on Layout) environment, for application to commercial law documentation that allows layout rules to be derived and then utilised to support IE. The proposed CLIEL environment operates using NLP (Natural Language Processing) techniques, JAPE (Java Annotation Patterns Engine) rules and some GATE (General Architecture for Text Engineering) modules. The system is fully described and evaluated using a commercial law document corpus. The results demonstrate that considering the layout is benecial for extracting data point instances from legal document collections.

Item Type: Conference or Workshop Item (Unspecified)
Depositing User: Symplectic Admin
Date Deposited: 11 May 2017 10:39
Last Modified: 19 Jan 2023 07:04
DOI: 10.1145/3086512.3086520
Related URLs: