1、分类号密级硕士学位论文题目:聚类分析在文本挖掘中的应用与研究英文并列题目:Cluster Analysis Application andResearch of Text Mining研究生:盛华专业:计算机科学与技术研究方向:计算机软件与理论导师:张桂珠指导小组成员:学位授予日期:2016年6月答辩委员会主席:张曦煌江南大学地址:无锡市蠡湖大道1800号二○一六年六月摘要摘要Web2.0时代的到来,使得网络上的文本信息呈现出爆炸性的增长,人们在对互联网上所需信息查阅整理所花费的精力时间也越来越多,导致如何从这些海量有噪音的文本中及时准确地搜索
5、行了比较与分析。实验表明改进的聚类算法在文本挖掘应用中的聚类效果、准确性以及稳定性方面都有很大的提升。关键词:文本聚类;k-means算法;快速密度峰值搜索算法;文本挖掘IAbstractAbstractThe arrival of Web2.0 era, making the text information on the network showing explosive growth, people in the information required on the Internet to organize Now it takes m
6、ore and more energy and time, lead to information on how these massive noise from text timely and accurately search for information useful to the user is required to wait one kind of problem. In this context, the use of text clustering technology for large text information f
7、iltering and automatic archiving, and extracts the main text feature from this information, can greatly reduce the manual workload Now finishing the document, improve document retrieval efficiency is a very far-reaching significance and application prospects. By studying the
8、 density of fast peak search algorithm (CFSFDP) and it is proposed to impro