Indexing the bit-code and distance for fast KNN search in high-dimensional spaces

(整期优先)网络出版时间:2007-06-16
/ 1
Variousindexstructureshaverecentlybeenproposedtofacilitatehigh-dimensionalKNNqueries,amongwhichthetechniquesofapproximatevectorpresentationandone-dimensional(1D)transformationcanbreakthecurseofdimensionality.Basedonthetwotechniquesabove,anovelhigh-dimensionalindexisproposed,calledBit-codeandDistancebasedindex(BD).BDisbasedonaspecialpartitioningstrategywhichisoptimizedforhigh-dimensionaldata.Bythedefinitionsofbitcodeandtransformationfunction,ahigh-dimensionalvectorcanbefirstapproximatelyrepresentedandthentransformedintoa1Dvector,thekeymanagedbyaB+-tree.AnewKNNsearchalgorithmisalsoproposedthatexploitsthebitcodeanddistancetoprunethesearchspacemoreeffectively.ResultsofextensiveexperimentsusingbothsyntheticandrealdatademonstratedthatBDout-performstheexistingindexstructuresforKNNsearchinhigh-dimensionalspaces.