学科分类
/ 1
1 个结果
  • 简介:Thelargeamountofrepeats,especiallyhighcopyrepeats,inthegenomesofhigheranimalsandplantsmakeswholegenomeassembly(WGA)quitedifficult.Inordertosolvethisproblem,wetriedtoidentifyrepeatsandmaskthempriortoassemblyevenatthestageofgenomesurvey.Itisknownthatrepeatsofdifferentcopynumberhavedifferentprobabilitiesofappearanceinshotgundata,sobasedonthisprinciple,weconstructedastatisticalmodelandinferredcriteriaformathematicallydefinedrepeats(MDRs)atdifferentshotguncoverages.Accordingtothesecriteria,wedevelopedsoftwareMDRmaskertoidentifyandmaskMDRsinshotgundata.Withrepeatsmaskedpriortoassembly,thespeedofassemblywasincreasedwithlowererrorprobability.Inaddition,clone-insertsizeaffectstheaccuracyofrepeatassemblyandscaffoldconstruction.Wealsodesignedlengthdistributionofclone-insertsusingourmodel.Inoursimulatedgenomesofhumanandrice,thelengthdistributionofrepeatsisdifferent,sotheiroptimallengthdistributionsofclone-insertswerenotthesame.Thuswithoptimallengthdistributionofclone-inserts,agivengenomecouldbeassembledbetteratlowercoverage.

  • 标签: 重复试验 嵌入式克隆 基因克隆 MDR 统计方法