0 Datasets
0 Files
Get instant academic access to this publication’s datasets.
Yes. After verification, you can browse and download datasets at no cost. Some premium assets may require author approval.
Files are stored on encrypted storage. Access is restricted to verified users and all downloads are logged.
Yes, message the author after sign-up to request supplementary files or replication code.
Join 50,000+ researchers worldwide. Get instant access to peer-reviewed datasets, advanced analytics, and global collaboration tools.
✓ Immediate verification • ✓ Free institutional access • ✓ Global collaborationJoin our academic network to download verified datasets and collaborate with researchers worldwide.
Get Free AccessImported cases present a considerable challenge to the elimination of malaria. Traditionally, patient travel history has been used to identify imported cases, but the long-latency liver stages confound this approach in Plasmodium vivax . Molecular tools to identify and map imported cases offer a more robust approach, that can be combined with drug resistance and other surveillance markers in high-throughput, population-based genotyping frameworks. Using a machine learning approach incorporating hierarchical FST (HFST) and decision tree (DT) analysis applied to 831 P. vivax genomes from 20 countries, we identified a 28-Single Nucleotide Polymorphism (SNP) barcode with high capacity to predict the country of origin. The Matthews correlation coefficient (MCC), which provides a measure of the quality of the classifications, ranging from −1 (total disagreement) to 1 (perfect prediction), exceeded 0.9 in 15 countries in cross-validation evaluations. When combined with an existing 37-SNP P. vivax barcode, the 65-SNP panel exhibits MCC scores exceeding 0.9 in 17 countries with up to 30% missing data. As a secondary objective, several genes were identified with moderate MCC scores (median MCC range from 0.54-0.68), amenable as markers for rapid testing using low-throughput genotyping approaches. A likelihood-based classifier framework was established, that supports analysis of missing data and polyclonal infections. To facilitate investigator-lead analyses, the likelihood framework is provided as a web-based, open-access platform (vivaxGEN-geo) to support the analysis and interpretation of data produced either at the 28-SNP core or full 65-SNP barcode. These tools can be used by malaria control programs to identify the main reservoirs of infection so that resources can be focused to where they are needed most.
Hidayat Trimarsanto, Roberto Amato, Richard D. Pearson, Edwin Sutanto, Rintis Noviyanti, Leily Trianty, Jutta Marfurt, Zuleima Pava, Diego F. Echeverry, Tatiana M. Lopera-Mesa, Lidia Madeline Montenegro, Alberto Tobón-Castaño, Matthew J. Grigg, Bridget E. Barber, Timothy William, Nicholas M. Anstey, Sisay Getachew, Beyene Petros, Abraham Aseffa, Ashenafi Assefa, Ghulam Rahim Awab, Nguyen Hoang Chau, Tran Tinh Hien, Mohammad Shafiul Alam, Wasif Ali Khan, Benedikt Ley, Kamala Thriemer, Sonam Wangchuck, Yaghoob Hamedi, Ishag Adam, Yaobao Liu, Qi Gao, Kanlaya Sriprawat, Marcelo U. Ferreira, Alyssa E. Barry, Ivo Müeller, Eleanor Drury, Sónia Gonçalves, Victoria J. Simpson, Olivo Miotto, Alistair Miles, Sir Nicholas White, François Nosten, Dominic Kwiatkowski, Ric N. Price, Sarah Auburn (2019). A molecular barcode and online tool to identify and map imported infection with <i>Plasmodium vivax</i>. bioRxiv (Cold Spring Harbor Laboratory), DOI: 10.1101/776781.
Datasets shared by verified academics with rich metadata and previews.
Authors choose access levels; downloads are logged for transparency.
Students and faculty get instant access after verification.
Type
Preprint
Year
2019
Authors
46
Datasets
0
Total Files
0
Language
English
Journal
bioRxiv (Cold Spring Harbor Laboratory)
DOI
10.1101/776781
Access datasets from 50,000+ researchers worldwide with institutional verification.
Get Free Access