Enabling Highly Efficient k-Means Computations on the SW26010 Many-Core Processor of Sunway TaihuLight-中国期刊网

摘要 Withtheadventofthebigdataera,theamountsofsamplingdataandthedimensionsofdatafeaturesarerapidlygrowing.Itishighlydesiredtoenablefastandefficientclusteringofunlabeledsamplesbasedonfeaturesimilarities.Asafundamentalprimitivefordataclustering,thek-meansoperationisreceivingincreasinglymoreattentionstoday.Toachievehighperformancek-meanscomputationsonmodernmulti-core/many-coresystems,weproposeamatrix-basedfusedframeworkthatcanachievehighperformancebyconductingcomputationsonadistancematrixandatthesametimecanimprovethememoryreusethroughthefusionofthedistance-matrixcomputationandthenearestcentroidsreduction.Weimplementandoptimizetheparallelk-meansalgorithmontheSW26010many-coreprocessor,whichisthemajorhorsepowerofSunwayTaihuLight.Inparticular,wedesignataskmappingstrategyforload-balancedtaskdistribution,adatasharingschemetoreducethememoryfootprintandaregisterblockingstrategytoincreasethedatalocality.Optimizationtechniquessuchasinstructionreorderinganddoublebufferingarefurtherappliedtoimprovethesustainedperformance.Discussionsonblock-sizetuningandperformancemodelingarealsopresented.Weshowbyexperimentsonbothrandomlygeneratedandreal-worlddatasetsthatourparallelimplementationofk-meansonSW26010cansustainadouble-precisionperformanceofover348.1Gflops,whichis46.9%ofthepeakperformanceand84%ofthetheoreticalperformanceupperboundonasinglecoregroup,andcanachieveanearlyidealscalabilitytothewholeSW26010processoroffourcoregroups.Performancecomparisonswiththepreviousstate-of-the-artonbothCPUandGPUarealsoprovidedtoshowthesuperiorityofouroptimizedk-meanskernel.

1李卫军. K-means聚类算法的研究综述.计算机科学与技术,2014-08.
2余锦华;郑颖青;吴启树;林金凎;龚振彬. K-MEANS CLUSTERING FOR CLASSIFICATION OF THE NORTHWESTERN PACIFIC TROPICAL CYCLONE TRACKS.大气科学及气象学,2016-02.
3Youquan He Qianqian Zhen. Logistics Customer Segmentation Modeling on Attribute Reduction and K-Means Clustering.计算机应用技术,2013-08.
4李曼赵松林. K-means聚类算法分析应用研究.社会学,2011-03.
5admin. 基于聚类分析的K-means算法研究及应用.自动化与计算机技术,2019-03.
6郑晓霞;赵青杉;陈文杰. 基于改进K-means算法的彩色超声图像分割.教育学,2017-02.
7作者,刘硕,高一凡,程思晗. 基于K-means测距算法列车预警终端的研究.建筑技术科学,2024-04.
8姜云飞. 基于K-means聚类技术的博士招生质量研究.教育学,2018-05.
9尹玉芬. 基于K-means算法的客户分群模型构建与应用.电力系统及自动化,2016-09.
10李白燕;禹定臣. 基于K-means均值聚类的车牌定位算法研究.通信与信息系统,2013-03.

Enabling Highly Efficient k-Means Computations on the SW26010 Many-Core Processor of Sunway TaihuLight

来源期刊

相关推荐

同分类资源更多

相关关键词

Enabling Highly Efficient k-Means Computations on the SW26010 Many-Core Processor of Sunway TaihuLight

来源期刊

相关推荐

同分类资源 更多

相关关键词

同分类资源更多