A convolutional neural network approach to classifying urban spaces using generative tools for data augmentation



Medel-Vera, Carlos ORCID: 0000-0003-0343-4202, Vidal-Estévez, Pelayo and Mädler, Thomas ORCID: 0000-0001-5076-3362
(2024) A convolutional neural network approach to classifying urban spaces using generative tools for data augmentation. International Journal of Architectural Computing.

[img] PDF
medel-vera-et-al-2024-a-convolutional-neural-network-approach-to-classifying-urban-spaces-using-generative-tools-for.pdf - Open Access published version

Download (3MB) | Preview

Abstract

<jats:p> This article discusses an application for classifying urban spaces using convolutional neural networks (CNNs). A seed dataset was initially generated composed of 630 photographs of urban spaces from the Adobe Stock repository. This dataset was topped up with images produced by two generative artificial intelligence (AI) engines, namely, Deep Dream Generator and Midjourney, making two additional augmented datasets, each composed of 2200 images. The training process was carried out using four well-known CNNs, namely, GoogLeNet, ResNet-18, ShuffleNet, and MobileNet-v2. The results show an increase of roughly 30% in the predicting capabilities in both augmented datasets when compared to the seed dataset. Furthermore, performance metrics are generally higher when using ResNet-18 which may suggest that this CNN architecture is more applicable to urban classification projects. Finally, although both generative AI engines have similar performance, Midjourney seems to slightly outperform Deep Dream Generator as a data augmentation engine for urban spaces. </jats:p>

Item Type: Article
Divisions: Faculty of Humanities and Social Sciences > School of the Arts
Depositing User: Symplectic Admin
Date Deposited: 04 Jan 2024 16:05
Last Modified: 18 Jan 2024 08:51
DOI: 10.1177/14780771231225697
Related URLs:
URI: https://livrepository.liverpool.ac.uk/id/eprint/3177714