Identification of significant genome-wide associations and QTL underlying variation in seed protein composition in pea (Pisum sativum L.).

gold Gold open access

Pulses are a valuable source of plant proteins for human and animal nutrition and have various industrial applications. Understanding the genetic basis for the relative abundance of different seed storage proteins is crucial for developing cultivars with improved protein quality and functional properties. In this study, we employed two complementary approaches, genome-wide association study (GWAS) and quantitative trait locus (QTL) mapping, to identify genetic loci underlying seed protein composition in pea (Pisum sativum L.). Sodium dodecyl sulfate-polyacrylamide gel electrophoresis was used to separate the seed proteins, and their relative abundance was quantified using densitometric analysis. For GWAS, we analyzed a diverse panel of 209 accessions genotyped with an 84,691 single-nucleotide polymorphism (SNP) array and identified genetic loci significantly associated with globulins, such as convicilin, vicilin, legumins, and non-globulins, including lipoxygenase, late embryogenesis abundant protein, and annexin-like protein. Additionally, using QTL mapping with 96 recombinant inbred lines, we mapped 11 QTL, including five that overlapped with regions identified by GWAS for the same proteins. Some of the significant SNPs were within or near the genes encoding seed proteins and other genes with predicted functions in protein biosynthesis, trafficking, and modification. This comprehensive genetic mapping study serves as a foundation for future breeding efforts to improve protein quality in pea and other legumes.