High precision but variable recall – comparing the performance of five deduplication tools
Main Article Content
Abstract
Deduplication methods for multiple database searches conducted for evidence syntheses differ in terms of time invested, accuracy, and comprehensiveness of identified duplicates. Deduplication tools can significantly contribute to a more efficient conduct of the search task in evidence syntheses. Widely-used tools for deduplication include reference management software (e.g. EndNote), built-in deduplication features in systematic review software (e.g. Covidence, Rayyan), and automated deduplication tools (e.g. Deduklick, SRA Deduplicator). Newer tools leverage machine learning algorithms crafted by information specialists, that encompass natural language normalization and rule-based approaches. We investigated five frequently used automated and semi-automated deduplication tools regarding their performance, core features and time efficiency in comparison to manual deduplication in EndNote using six datasets.
Article Details
How to Cite
1.
High precision but variable recall – comparing the performance of five deduplication tools. J Eur Assoc Health Info Libr [Internet]. 2024 Mar. 17 [cited 2024 Oct. 30];20(1):12-7. Available from: https://ojs.eahil.eu/JEAHIL/article/view/607
Issue
Section
Feature Articles
JEAHIL is licensed under a Creative Commons Attribution 4.0 International (CC BY 4.0) licence, unless otherwise stated. Please read our Policies page for more information on Open Access, copyright and permissions.
How to Cite
1.
High precision but variable recall – comparing the performance of five deduplication tools. J Eur Assoc Health Info Libr [Internet]. 2024 Mar. 17 [cited 2024 Oct. 30];20(1):12-7. Available from: https://ojs.eahil.eu/JEAHIL/article/view/607