the item-set tree a data structure for data mining

the item-set tree a data structure for data mining

ID:34389117

大小:138.17 KB

页数:10页

时间:2019-03-05

the item-set tree a data structure for data mining_第1页
the item-set tree a data structure for data mining_第2页
the item-set tree a data structure for data mining_第3页
the item-set tree a data structure for data mining_第4页
the item-set tree a data structure for data mining_第5页
资源描述:

《the item-set tree a data structure for data mining》由会员上传分享,免费在线阅读,更多相关内容在教育资源-天天文库

1、*TheItem-SetTree:ADataStructureforDataMining121AlaaeldinHafez,JitenderDeogun,andVijayV.RaghavanAbstract.Enhancementsindatacapturingtechnologyhaveleadtoexponentialgrowthinamountsofdatabeingstoredininformationsystems.Thisgrowthinturnhasmotivatedresearchers

2、toseeknewtechniquesforextractionofknowledgeimplicitorhiddeninthedata.Inthispaper,wemotivatetheneedforanincrementaldataminingapproachbasedondatastructurecalledtheitem-settree.Themotivatedapproachisshowntobeeffectiveforsolvingproblemsrelatedtoefficiencyofh

3、andlingdataupdates,accuracyofdataminingresults,processinginputtransactions,andansweringuserqueries.Wepresentefficientalgorithmstoinserttransactionsintotheitem-settreeandtocountfrequenciesofitemsetsforqueriesaboutstrengthofassociationamongitems.Weprovetha

4、ttheexpectedcomplexityofinsertingatransactionis»O(1),andthatoffrequencycountingisO(n),wherenisthecardinalityofthedomainofitems.1IntroductionAssociationminingthatdiscoversdependenciesamongvaluesofanattributewasintroducedbyAgrawaletal.[1]andhasemergedasapr

5、ominentresearcharea.Theassociationminingproblemalsoreferredtoasthemarketbasketproblemcanbeformallydefinedasfollows.LetI={i1,i2,...,in}beasetofitemsasS={s1,s2,...,sm}beasetoftransactions,whereeachtransactionsiÎSisasetofitemsthatissiÍI.Anassociationruleden

6、otedbyXÞY,whereX,YÌIandXÇY=F,describestheexistenceofarelationshipbetweenthetwoitemsetsXandY.SeveralmeasureshavebeenintroducedtodefinethestrengthoftherelationshipbetweenitemsetsXandYsuchassupport,confidence,andinterest.Thedefinitionsofthesemeasures,fromap

7、robabilisticmodelaregivenbelow.I.Support(XÞY)=P(X,Y),orthepercentageoftransactionsinthedatabasethatcontainbothXandY.II.Confidence(XÞY)=P(X,Y)/P(X),orthepercentageoftransactionscontainingYintransactionsthosecontainX.III.Interest(XÞY)=P(X,Y)/P(X)P(Y)repres

8、entsatestofstatisticalindependence.*ThisresearchwassupportedinpartbytheU.S.DepartmentofEnergy,GrantNo.DE-FG02-97ER1220,andbytheArmyResearchOffice,GrantNo.DAAH04-96-1-0325,underDEPSCoRprogramofAdvancedResearchProjectsAgency

当前文档最多预览五页,下载文档查看全文

此文档下载收益归作者所有

当前文档最多预览五页,下载文档查看全文
温馨提示:
1. 部分包含数学公式或PPT动画的文件,查看预览时可能会显示错乱或异常,文件下载后无此问题,请放心下载。
2. 本文档由用户上传,版权归属用户,天天文库负责整理代发布。如果您对本文档版权有争议请及时联系客服。
3. 下载前请仔细阅读文档内容,确认文档内容符合您的需求后进行下载,若出现内容与标题不符可向本站投诉处理。
4. 下载文档时可能由于网络波动等原因无法下载或下载错误,付费完成后未能成功下载的用户请联系客服处理。