Global-local feature attention network with reranking strategy for image caption generation

在线阅读 下载PDF 导出详情
摘要 Inthispaper,anovelframework,namedasglobal-localfeatureattentionnetworkwithrerankingstrategy(GLAN-RS),ispresentedforimagecaptioningtask.Ratherthanonlyadoptingunitaryvisualinformationintheclassicalmodels,GLAN-RSexplorestheattentionmechanismtocapturelocalconvolutionalsalientimagemaps.Furthermore,weadoptrerankingstrategytoadjustthepriorityofthecandidatecaptionsandselectthebestone.TheproposedmodelisverifiedusingtheMicrosoftCommonObjectsinContext(MSCOCO)benchmarkdatasetacrosssevenstandardevaluationmetrics.ExperimentalresultsshowthatGLAN-RSsignificantlyoutperformsthestate-of-the-artapproaches,suchasmultimodalrecurrentneuralnetwork(MRNN)andGoogleNIC,whichgetsanimprovementof20%intermsofBLEU4scoreand13pointsintermsofCIDERscore.
机构地区 不详
关键词
出版日期 2017年06月16日(中国期刊网平台首次上网日期,不代表论文的发表时间)
  • 相关文献