A Solanum lycopersicoides reference genome facilitates insights into tomato specialized metabolism and immunity.
Wild relatives of tomato are a valuable source of natural variation in tomato breeding, as many can be hybridized to the cultivated species (Solanum lycopersicum). Several, including Solanum lycopersicoides, have been crossed to S. lycopersicum for the development of ordered introgression lines (ILs), facilitating breeding for desirable traits. Despite the utility of these wild relatives and their associated ILs, few finished genome sequences have been produced to aid genetic and genomic studies. Here we report a chromosome-scale genome assembly for S. lycopersicoides LA2951, which contains 37?938 predicted protein-coding genes. With the aid of this genome assembly, we have precisely delimited the boundaries of the S. lycopersicoides introgressions in a set of S. lycopersicum cv. VF36?×?LA2951 ILs. We demonstrate the usefulness of the LA2951 genome by identifying several quantitative trait loci for phenolics and carotenoids, including underlying candidate genes, and by investigating the genome organization and immunity-associated function of the clustered Pto gene family. In addition, syntenic analysis of R2R3MYB genes sheds light on the identity of the Aubergine locus underlying anthocyanin production. The genome sequence and IL map provide valuable resources for studying fruit nutrient/quality traits, pathogen resistance, and environmental stress tolerance. We present a new genome resource for the wild species S. lycopersicoides, which we use to shed light on the Aubergine locus responsible for anthocyanin production. We also provide IL boundary mappings, which facilitated identifying novel carotenoid quantitative trait loci of which one was likely driven by an uncharacterized lycopene ß-cyclase whose function we demonstrate.