Detecting Textual Reuse in News Stories, At Scale



Nicholls, Tom ORCID: 0000-0002-6971-8614
(2019) Detecting Textual Reuse in News Stories, At Scale. INTERNATIONAL JOURNAL OF COMMUNICATION, 13. pp. 4173-4197.

Access the full-text of this item by clicking on the Open Access link.

Abstract

Motivated by the debate around “churnalism” and online media, this article develops, evaluates, and validates a computational method for detecting shared text between different news articles, at scale, using n-gram shingling. It differentiates between newswire copy, public relations material, source-to-source copying, and common-source and incidental overlaps. I evaluate the method, quantitatively and qualitatively, and show that it can effectively handle newswire content, copying, and other forms of reuse. Substantively, I find lower levels of news agency and press release copy reuse than is suggested by previous studies, and conclude that the news agency finding is robust, but the lack of press release copy found might reflect limitations of the method and the changing practices of journalists.

Item Type: Article
Uncontrolled Keywords: computational methods, news production, churnalism, news agency, automated content analysis, online news
Divisions: Faculty of Humanities and Social Sciences > School of the Arts
Depositing User: Symplectic Admin
Date Deposited: 12 Apr 2021 08:31
Last Modified: 18 Jan 2023 22:53
Open Access URL: https://ijoc.org/index.php/ijoc/article/view/9904
Related URLs:
URI: https://livrepository.liverpool.ac.uk/id/eprint/3118812