RDL logo
About
Aims and ScopeAdvisory Board Members
More
Who We Are?
User Guide
​
​
Sign inGet started
​
​

About
Aims and ScopeAdvisory Board Members
More
Who We Are?
User Guide

Sign inGet started
RDL logo

Verified research datasets. Instant access. Built for collaboration.

Navigation

About

Aims and Scope

Advisory Board Members

More

Who We Are?

Add Raw Data

User Guide

Legal

Privacy Policy

Terms of Service

Support

Got an issue? Email us directly.

Email: info@rawdatalibrary.netOpen Mail App
​
​

© 2025 Raw Data Library. All rights reserved.
PrivacyTerms
  1. Raw Data Library
  2. /
  3. Publications
  4. /
  5. Diagnosing migraine from genome-wide genotype data: a machine learning analysis

Verified authors • Institutional access • DOI aware
50,000+ researchers120,000+ datasets90% satisfaction
Article
en
2025

Diagnosing migraine from genome-wide genotype data: a machine learning analysis

0 Datasets

0 Files

en
2025
DOI: 10.1093/brain/awaf172

Get instant academic access to this publication’s datasets.

Create free accountHow it works

Frequently asked questions

Is access really free for academics and students?

Yes. After verification, you can browse and download datasets at no cost. Some premium assets may require author approval.

How is my data protected?

Files are stored on encrypted storage. Access is restricted to verified users and all downloads are logged.

Can I request additional materials?

Yes, message the author after sign-up to request supplementary files or replication code.

Advance your research today

Join 50,000+ researchers worldwide. Get instant access to peer-reviewed datasets, advanced analytics, and global collaboration tools.

Get free academic accessLearn more
✓ Immediate verification • ✓ Free institutional access • ✓ Global collaboration
Access Research Data

Join our academic network to download verified datasets and collaborate with researchers worldwide.

Get Free Access
Institutional SSO
Secure
This PDF is not available in different languages.
No localized PDFs are currently available.
Paul M Ridker
Paul M Ridker

Harvard University

Verified
Antonios Danelakis
Tjaša Kumelj
Bendik S Winsvold
+73 more

Abstract

Abstract Migraine has an assumed polygenic basis, but the genetic risk variants identified in genome-wide association studies only explain a proportion of the heritability. We aimed to develop machine learning models, capturing non-additive and interactive effects, to address the missing heritability. This was a cross-sectional population-based study of participants in the second and third Trøndelag Health Study. Individuals underwent genome-wide genotyping and were phenotyped based on validated modified criteria of the International Classification of Headache Disorders. Four datasets of increasing number of genetic variants were created using different thresholds of linkage disequilibrium and univariate genome-wide associated p-values. A series of machine learning and deep learning methods were optimized and evaluated. The genotype tools PLINK and LDPred2 were used for polygenic risk scoring. Models were trained on a partition of the dataset and tested in a hold-out set. The area under the receiver operating characteristics curve was used as the primary scoring metric. Classification by machine learning was statistically compared to that of polygenic risk scoring. Finally, we explored the biological functions of the variants unique to the machine learning approach. 43,197 individuals (51% women), with a mean age of 54.6 years, were included in the modelling. A light gradient boosting machine performed best for the three smallest datasets (108, 7,771 and 7,840 variants), all with hold-out test set area under curve at 0.63. A multinomial naïve Bayes model performed best in the largest dataset (140,467 variants) with a hold-out test set area under curve of 0.62. The models were statistically significantly superior to polygenic risk scoring (area under curve 0.52 to 0.59) for all the datasets (p<0.001 to p=0.02). Machine learning identified many of the same genes and pathways identified in genome-wide association studies, but also several unique pathways, mainly related to signal transduction and neurological function. Interestingly, pathways related to botulinum toxins, and pathways related to the calcitonin gene-related peptide receptor also emerged. This study suggests that migraine may follow a non-additive and interactive genetic causal structure, potentially best captured by complex machine learning models. Such structure may be concealed where the data dimensionality (high number of genetic variants) is insufficiently supported by the scale of available data, leaving a misleading impression of purely additive effects. Future machine learning models using substantially larger sample sizes could harness both the additive and the interactive effects, enhancing precision and offering deeper understanding of genetic interactions underlying migraine.

How to cite this publication

Antonios Danelakis, Tjaša Kumelj, Bendik S Winsvold, Marte‐Helene Bjørk, Parashkev Nachev, Manjit Matharu, Dominic Giles, Verneri Anttila, Ville Artto, Andrea Carmine Belin, Anna Bjornsdottir, Gyða Björnsdóttir, Dorret I Boomsma, Sigrid Børte, Mona Ameri Chalmer, Daniel I. Chasman, Bru Cormand, Ester Cuenca-León, George Davey Smith, Irene de Boer, Martin Dichgans, Tõnu Esko, Tobias Freilinger, Padhraig Gormley, Lyn R. Griffiths, Eija Hämäläinen, Thomas Hansen, Aster V. E. Harder, Heidi Hautakangas, Marjo Hiekkala, Maria Gudlaug Hrafnsdottir, M Arfan Ikram, Paul M Ridker, Risto Kajanne, Mikko Kallela, Jaakko Kaprio, Mari Kaunisto, Lisette J. A. Kogelman, Espen Saxhaug Kristoffersen, Christian Kubisch, Mitja Kurki, Tobias Kurth, Lenore J. Launer, Terho Lehtimäki, Davor Lessel, Lannie Ligthart, Sigurður H. Magnússon, Rainer Malik, Bertram Müller−Myhsok, Carrie A. M. Northover, Dale R. Nyholt, Jes Olesen, Aarno Palotie, Priit Palta, Linda M. Pedersen, Nancy L. Pedersen, Matti Pirinen, Daniëlle Posthuma, Patricia Pozo‐Rosich, Alice Pressman, Olli T. Raitakari, Caroline Ran, Gudrun R. Sigurdardottir, Hreinn Stefánsson, Hreinn Stefánsson, Ólafur Sveinsson, Gisela M. Terwindt, Thorgeir E Thorgeirsson, Anouk den Braber, Cornelia M. van Duijn, Maija Wessman, Bendik S Winsvold, John‐Anker Zwart, Erling Tronvik, Helge Langseth, Anker Stubberud (2025). Diagnosing migraine from genome-wide genotype data: a machine learning analysis. , DOI: https://doi.org/10.1093/brain/awaf172.

Related publications

Why join Raw Data Library?

Quality

Datasets shared by verified academics with rich metadata and previews.

Control

Authors choose access levels; downloads are logged for transparency.

Free for Academia

Students and faculty get instant access after verification.

Publication Details

Type

Article

Year

2025

Authors

76

Datasets

0

Total Files

0

Language

en

DOI

https://doi.org/10.1093/brain/awaf172

Join Research Community

Access datasets from 50,000+ researchers worldwide with institutional verification.

Get Free Access