利用Hadoop构建云计算基础教程

利用Hadoop构建云计算基础教程

ID:43753250

大小:539.42 KB

页数:67页

时间:2019-10-13

利用Hadoop构建云计算基础教程_第1页
利用Hadoop构建云计算基础教程_第2页
利用Hadoop构建云计算基础教程_第3页
利用Hadoop构建云计算基础教程_第4页
利用Hadoop构建云计算基础教程_第5页
资源描述:

《利用Hadoop构建云计算基础教程》由会员上传分享,免费在线阅读,更多相关内容在工程资料-天天文库

1、BigDalaplacetjAl***■八JTV■ti&*»丁fl\k/Jpj73•J^^—<、.4=•Home•BigData•HadoopTutorials•Cassandra•HectorAPI•RequestTutorial•AboutLABELS:HADOOP-TUTORIAL,HDFS3OCTOBER2013HadoopTutorial:Part1-WhatisHadoop?(anOverview)Hadoopisanopensourcesoftwareframeworkthatsupportsdataintensivedistributedapplicationsw

2、hichislicensedunderApachev2license.At-leastthisiswhatyouaregoingtofindasthefirstlineofdefinitiononHadoopinWikipedia.Sowhatisdataintensivedistributedapplications?WelldataintensiveisnothingbutBigData(datathathasoutgrowninsize)anddistributedapplicationsaretheapplicationsthatworksonnetworkbycommunic

3、atingandcoordinatingw让heachotherbypassingmessages.(sayusingaRPCinterprocesscommunicationorthroughMessage-Queue)HenceHadoopworksonadistributedenvironmentandisbuildtostore,handleandprocesslargeamountofdataset(inpetabytes,exabyteandmore).Nowheresinceiamsayingthathadoopstorespetabytesofdata,thisdoes

4、n'tmeanthatHadoopisadatabase.Againremember让saframeworkthathandleslargeamountofdataforprocessing.YouwillgettoknowthedifferencebetweenHadoopandDatabases(orNoSQLDatabases,wellthat'swhatwecallBigDatafsdatabases)asyougodownthelineinthecomingtutorials.HadoopwasderivedfromtheresearchpaperpublishedbyGoo

5、gleonGoogleFileSystem(GFS)andGoogle'sMapReduce・SotherearetwointegralpartsofHadoop:HadoopDistributedFileSystem(HDFS)andHadoopMapReduce.HadoopDistributedFileSystem(HDFS)HDFSisafilesystemdesignedforstoringverylargefileswithstreamingdataaccesspatterns,runningonclustersofcommodityhardware.WellLetsget

6、intothedetailsofthestatementmentionedabove:VeryLargefiles:Nowwhenwesayverylargefileswemeanherethatthesizeofthefilewillbeinarangeofgigabyte,terabyte,petabyteormaybemore.Streamingdataaccess:HDFSisbuiltaroundtheideathatthemostefficientdataprocessingpatternisawilte-oncezread-many-timespattern.Adatas

7、etistypicallygeneratedorcopiedfromsource,andthenvariousanalysesareperformedonthatdatasetovertime.Eachanalysiswillinvolvealargeproportion,ifnotall,ofthedataset,sothetimetoreadthewholedatasetismoreimportantthanthelatencyinread

当前文档最多预览五页,下载文档查看全文

此文档下载收益归作者所有

当前文档最多预览五页,下载文档查看全文
温馨提示:
1. 部分包含数学公式或PPT动画的文件,查看预览时可能会显示错乱或异常,文件下载后无此问题,请放心下载。
2. 本文档由用户上传,版权归属用户,天天文库负责整理代发布。如果您对本文档版权有争议请及时联系客服。
3. 下载前请仔细阅读文档内容,确认文档内容符合您的需求后进行下载,若出现内容与标题不符可向本站投诉处理。
4. 下载文档时可能由于网络波动等原因无法下载或下载错误,付费完成后未能成功下载的用户请联系客服处理。