Supplemental Material for Chen et al., 2019 ChenXilong LiShiming ZhangDong HanMingyu JinXin ZhaoCaipin WangSongbo XingLibo MaJuanjuan AnNa 2019 Supplemental figures and tables of article "Sequencing of a wild apple (<i>Malus baccata</i>) genome unravels the differences between cultivated and wild apple species regarding disease resistance and cold tolerance"<div><p>Figure S1. Distribution of 17-mers in raw sequence data from the <i>M. baccata</i> genomes.</p><p>Figure S2. Genetic and physical maps of <i>M. baccata</i> genome.</p><p>Figure S3. Indication of relationship between genetic distance and physical distance for the <i>M. baccata</i> genome chromosomes.</p><p>The left blue bars indicate pseudomolecules, and the right green bars represent linkage groups corresponding to 17 chromosomes. The light green blocks indicate ordered regions and markers, and the dark green blocks and lines represent unordered but matched scaffolds to the genetic map.</p><p>Figure S4. Stacked bar plots of the categories of BUSCO groups searched in <i>M. baccata</i>genome and other Rosaceae plants.</p><p>Different color represents complete single copy, complete duplicated copy, fragmented and missing BUSCOs respectively. (Mb, <i>Malus baccata</i>; Md, <i>Malus domestica</i>; Pb, <i>Pyrus bretschneideri</i>; Pp, <i>Prunus persica</i>; Fv, <i>Fragaria vesca</i>)</p><p>Figure S5. The distribution of the distance between SNPs in <i>M. baccata</i> genome.</p><p>Figure S6. SNP frequency per gene in <i>M. baccata</i> genome.</p><p>Figure S7. Venn diagram of genes with homology or functional classifications by different methods.</p><p>Figure S8. Histogram of Ka/Ks values for orthologous genes from <i>M. baccata</i> and <i>M. domestica</i>.</p><p>Figure S9. Divergence time between 10 species.</p><p>Figure S10. Synteny analyses among Rosaceae species.</p><p>The numbers of nodes represent divergence time in MYA (million years ago). The red node represents known divergence time range. The polygonal star represents WGD invent according to previous studies.</p><p></p><p>Table S1 <i>M. baccata</i> genome size estimation of 17-mer</p><p>Table S2 Global statistics of <i>M. baccata</i> genome sequencing.</p><p>Table S3 Summary of the <i>M. baccata</i> genome assembly</p><p>Table S4 Statistics for each pseudo-chromosome</p><p>Table S5 Statistics for linkage groups construction</p><p>Table S6 Assessment of sequence coverage of the <i>M. baccata</i> genome assembly by <i>Malus.</i> EST sequences downloaded from NCBI.</p><p>Table S7 Assessing the gene region assembly by RNA sequencing and assembly.</p><p>Table S8 Evaluation of gene space coverage using core eukaryotic gene mapping approach (CEGMA).</p><p>Table S9 Summarized BUSCO benchmarking for <i>M. baccata</i> genome</p><p>Table S10 Annotation of heterozygosity SNP s detected in <i>M. baccata</i> genome.</p><p>Table S11 Significantly (Q value < 0.05) enriched KEGG pathways among genes with high frequencies (>3%) of SNPs in <i>M. baccata</i> genome.</p><p>Table S12 Statistics of repeats of <i>M. baccata</i> genome</p><p>Table S13 Summary and content analysis of different types of TEs in the <i>M. baccata</i>genome.</p><p>Table S14 Summary of identified TEs in sequenced Rosaceae plant</p><p>Table S15 Statistics of predicted protein-coding genes in <i>M. baccata</i> using five different methods.</p><p>Table S16 Comparison of the gene sets of <i>M. baccata</i> with those of other Rosaceae plant</p><p>Table S17 Number of genes with homology or functional classifications by different methods. Four protein databases were used for predicting gene functions.</p><p>Table S18 Summary of predicted non-coding RNAs in the <i>M. baccata</i>.</p><p>Table S19 Significantly (Q value < 0.05) enriched KEGG pathways among genes with Ka/Ks >1 from <i>M. baccata</i> and <i>M. domestica</i>.</p><p>Table S20 Summary of the annotation of genes with Ka/Ks >1 and P < 0.05 from <i>M. baccata</i> and <i>M. domestica</i>.</p><p>Table S21 Significantly enriched KEGG pathways among genes expanded in <i>M. baccata</i>genome.</p><p>Table S22 Significantly (Q value < 0.05) enriched KEGG pathways among <i>M. baccata</i>and <i>M. domestica</i> genes from expanded and contracted gene families</p><p>Table S23 Summary of synteny blocks with at least four gene pairs between <i>M. baccata</i>and the other Rosaceae genomes.</p><p>Table S24 Transcription factors (TF) present in sequenced Rosaceae plant.</p><p>Table S25 Number of NBS genes cluster in chromosome of <i>M. baccata</i> and <i>M. domestica</i>.</p></div>