nosql: a distributed solution of big data

Upload: mobin-ranjbar

Post on 04-Jun-2018

222 views

Category:

Documents


0 download

TRANSCRIPT

  • 8/13/2019 NoSQL: A Distributed Solution of Big Data

    1/4

    1

    KdVQN9A Obstrbjutmo Vdnutbdk dg Jbe Oata

    Hdjbk Zakijar

    Azao ]kbvmrsbty dg Oahe`ak

    Oahe`ak, Brak

    AJV_ZAF_

    _`m rmfmkt aovakfm bk obstrbjutmo oata

    hakaemhmkt tmf`kbqums arm erdwbke, tdoay

    t`m wdrno kmmo oatajasms td jm ajnm td stdrm

    ako prdfmss jbe oata mggmftbvmny, omhako gdr

    vmry `be`-pmrgdrhakfm w`mk rmaobke ako

    wrbtbke, t`msm mggmfts mspmfbanny bk narem sfanm

    ako `be` fdkfurrmkfy appnbfatbdks, suf` as

    smarf` mkebkms `mkfm t`m traobtbdkan oatajasmnbhbts btsmng gdr suf` fdhpnmx rmqubrmhmkts,

    t`mrmgdrm varbdus typms dg kdk-rmnatbdkan

    oatajasms t`at arm fdhhdkny rmgmrrmo td as

    KdVQN oatajasms w`bf` bs ajjrmvbatbdk dg

    Kdt dkny strufturmo qumry nakeuaem.

    Cmywdros

    KdVQN, Cmy-Wanum Vtdrm, Jbe_ajnm, Oata-

    jdttnmkmfc, Fd mrmkfm.

    1. BK_ZDO]F_BDK

    _`m prdjnmhs wbt` Zmnatbdk oatajasm bs t`at bt

    nafcmo `akonbke mxpdkmktban erdwt` dg oata.

    Haky dreakbzatbdks fdnnmft vast ahdukts dg

    fustdhmr, sfbmktbgbf, sanms, ako dt`mr oata gdr

    guturm akanysbs. _raobtbdkanny, hdst dg t`msm

    dreakbzatbdks `avm stdrmo strufturmo oata bk

    rmnatbdkan oatajasms gdr sujsmqumkt affmss ako

    akanysbs. @dwmvmr, a erdwbke kuhjmr dg

    omvmndpmrs ako usmrs `avm jmeuk turkbke td

    varbdus typms dg kdk-rmnatbdkan, kdw

    grmqumktny fannmo KDVQN oatajasms S:X._`mbr

    prbhary aovaktaem dg KDVQN Oatajasm bs

    t`at, uknbcm rmnatbdkan oatajasms t`my `akonm

    ukstrufturmo oata suf` as odfuhmkts, m-habn,

    huntbhmoba ako sdfban hmoba mggbfbmktny. _`m

    fdhhdk gmaturms dg KDVQN oatajasms fak jm

    suhharbzmo as `be` sfanajbnbty ako rmnbajbnbty,

    vmry sbhpnm oata hdomn, vmry sbhpnm

    (prbhbtbvm) qumry nakeuaem, nafc dg hmf`akbsh

    gdr `akonbke ako hakaebke oata fdksbstmkfy

    ako bktmerbty fdkstrabkts habktmkakfm S1X.

    Gbeurm. 1 Vfanajnm dg oata sbzm grdh :

  • 8/13/2019 NoSQL: A Distributed Solution of Big Data

    2/4

    :

    wbt` t`m erdwbke oata kuhjmrs pnus t`mrm bs ak

    bhpdrtakt bssum dg oata sykf`rdkbzatbdk

    jmfausm a f`akem at a sbkenm oata sbtm wduno

    rmqubrm ann t`m oata sbtms td jm upoatmo ako

    `mkfm rmsunts bktd akdt`mr prdjnmh dg oata

    obstrbjutmo ndfcbke. _d avdbo t`m prdjnmh dg

    obstrbjutmo ndfcbke akdt`mr apprdprbatm

    sdnutbdk bs td s`aro t`m oata rmpdsbtdry bktdhuntbpnm oata stdraem wbt`dut rmpnbfatbke

    w`dnm oata nbcm obvbobke t`m fustdhmr ako

    prdouft oatajasm sd t`at jdt` t`m prdjnmh dg

    oata jdttnmkmfcbke ako obstrbjutmo ndfcbke bs

    sdnvmo jut a prdjnmh dg oata sykf`rdkbzatbdk

    bs mkfduktmrmo wbt` suf` oata s`arobke.

    _raobtbdkan ZOJHV mkfduktmrs a oatajasm

    sf`mha w`bf` fdksbsts dg a tdd haky idbks ako

    rmnatbdks`bps w`bf` hacms t`m oata stdraem afdhpnmx bssum. Vmt dg bkstruftbdks w`bf` arm

    kdt rmqubrmo gdr t`m appnbfatbdk arm dgtmk

    pmrgdrhmo ako `mkfm bkfrmasms t`m dvmr`maos

    dk t`m systmh, _`m systmh `avm td oman wbt`

    smvmran tajnms ako appny t`m pdssbjnm idbks bk

    t`mh w`bf` rmsunt bk oata ndfcbke ako natf`bke

    ansd gdnndwbke t`m strbft runms dg AFBO w`bf`

    stakos gdr Atdhbfbty, Fdksbstmkfy, Bsdnatbdk

    ako Ourajbnbty gurt`mr aoos up td t`m

    fdhpnmxbty t`at hacms sfanbke dg t`m oatajasma fdhpnmx tasc as w`mk oatajasm kmmo td jm

    fdksbstmkt varbdus bkstruftbdks arm tacmk farm

    dg nbcm juggmr hakaemr, `ako fdom

    dptbhbzatbdk mtf w`mrmas t`m dvmrann usmgun

    wdrc odkm bs fdksbomrajny ndw t`at bkfrmasms

    ukkmmomo fdhpnmxbtbms ako dvmr`mao.

    Gbeurm :. Oatajasm V`arobke

    6. KDVQN FA_MEDZBMV

    KDVQN fak jm jrdcmk bktd ? obggmrmkt

    fatmedrbms.

    Cmy Wanum Vtdrms Jbe _ajnm Odfuhmkt Oatajasms Erap` Oatajasms

    Maf` oatajasm bs bkobvbouanny eddo at omanbke

    wbt` sbzm ako fdhpnmxbtbms.

    6.1 Cmy Wanum Vtdrms

    Cmy vanum oata hdomn hmaks t`at a vanum

    fdrrmspdkos td a Cmy. Ant`due` t`m strufturm

    bs sbhpnmr, t`m qumry spmmo bs `be`mr t`ak

    rmnatbdkan oatajasm, suppdrts hass stdraem ako

    `be` fdkfurrmkfy, mtf., Bt prdvbomo suppdrt gdr

    qumry ako hdobgy dpmratbdks gdr oata t`rdue`

    t`m prbhary cmy S6X. Cmy vanums rmprmsmkt

    jufcmt dg oata. Gdr mxahpnm, bk fasm dg a

    s`dppbke fart hmktbdkmo bk Gbeurm 6, maf`

    s`dppbke fart arm rmprmsmktmo bk bkobvbouan

    jufcmts ako rmprmsmktmo usbke a cmy vanum

    w`bf` fduno jm usmr bo.

    _`m cmy vanums fak jm smrbanbzmo usbke mbt`mr

    iava smrbanbzatbdk dr PHN. _`bs way bs vmrygast td stdrm as bt iust wrbtms jbts td t`m obsfs.

    Vdhm dg cmy vanum stdrms avabnajnm bk harcmt

    arm Jmrcmnmy OJ, _dcyd _yrakt, Wdnomhart,

    Frassakora.

    6.: JBE_AJNM

    Vmarf` mkebkm Yvmkts omvmndp dpmk sdurfm

    obstrbjutmo oata stdraem systmh `ypmr tajnm S6X

    jy orawbke jbe tajnm.

    A Jbe_ajnm bs a nbe`t, sfattmrmo, fdkstakt

    huntbobhmksbdkan sdrtmo hap. Bkomxbke dg t`m

    hap bs odkm jy a rdw cmy, fdnuhk cmy, ako a

    tbhmstahp. Bk Jbe_ajnm, uk-bktmrprmtmo arrays

    dg jytms arm usmo as vanums. Jbe_ajnm stdrms

    strufturmo oata. Aky typm dg oata grdh tmxt td

    smrbanbzmo djimfts fak jm stdrmo jy

  • 8/13/2019 NoSQL: A Distributed Solution of Big Data

    3/4

    6

    appnbfatbdks. Bt odms kdt bhpdsm aky sbzm

    fdkstrabkt gdr maf` vanum. A tajnm bs anndwmo td

    `avm nbhbtnmss kuhjmr dg fdnuhks. Oata bs

    bkomxmo usbke rdw ako fdnuhk kahms t`at fak

    jm arjbtrary strbkes S=X.

    Eddenm omvmndpmo bts dwk jbe tajnm hdomn

    fannmo Eddenm Jbe_ajnm. Eddenm Jbe_ajnm `asjmmk omsbekmo td sfanm bktd t`m pmtajytm rakem

    afrdss `ukormos dr mvmk t`dusakos dg

    fdhputmrs, ako ansd td masm t`m aoobtbdk dg

    hdrm haf`bkms wbt`dut huf` rmfdkgbeuratbdk,

    t`mrmjy hacbke t`m gunnmst usm dg t`m rmsdurfms

    S=X.

    Eddenm Jbe_ajnm bs jubnt dk tdp dg t`m Eddenm

    Gbnm Vystmh, F`ujjy ako stdrmo bk ak

    bhhutajnm oata strufturm fannmo VV_ajnm

    w`bf` gafbnbtatms t`m stdraem dg nde ako oata

    gbnms S=X. F`ujjy bs usmo jy Jbe_ajnm td stdrm

    t`m rddt tajnmt, sf`mha omtabns, affmss fdktrdn

    nbsts, fddrobkatm ako bomktbgy tajnmt smrvmrs S=X.

    @jasm bs t`m dpmk sdurfm vmrsbdk dg Jbe_ajnm.

    @jasm mhunatms hdst dg t`m gukftbdkanbtbms

    prdvbomo jy Jbe_ajnm. Nbcm hdst kdk VQN

    oatajasm systmhs, @jasm bs wrbttmk bk Iava.

    @jasm bs ak Apaf`m dpmk sdurfm prdimft ako

    abhs td prdvbom a stdraem systmh sbhbnar tdJbe_ajnm bk t`m @aoddp obstrbjutmo fdhputbke

    mkvbrdkhmkt. @aoddp Obstrbjutmo Gbnm Vystmh

    (@OGV) bs a obstrbjutmo gbnm systmh strufturm

    gdr dpmratbke dk fdhhdk `arowarm strufturms

    (fdhhdobty fdhputmrs) f`araftmrbzmo jy ndw

    fdst bhpnmhmktatbdk.

    6.6 ODF]HMK_ OA_AJAVM

    Odfuhmkt oatajasm bs kdt fdkfmrkmo ajdut

    `be` pmrgdrhakfm rmao ako wrbtm fdkfurrmkt,

    jut rat`mr td mksurm t`at jbe oata stdraem ako

    eddo qumry pmrgdrhakfm S6X. _ypbfan

    odfuhmkt oatajasm arm Hdked OJ, Fduf`

    OJ.

    Fduf` OJ bs dkm hdst pdpunar odfuhmkt

    oatajasm w`bf` bs gnmxbjnm, gaunt-tdnmrakt

    oatajasm, w`bf` suppdrts oata gdrhat fannmo

    IVDK. Bt prdvboms ZMV_-stynm A\B td mksurm

    oata fdksbstmkfy, Fduf` OJ fdhpny wbt`

    AFBO prdpmrtbms. Bk aoobtbdk, Fduf` OJ

    prdvboms a \:\-jasmo obstrbjutmo oatajasm

    sdnutbdk t`at suppdrts jbobrmftbdkan rmpnbfatbdk.

    @dwmvmr, bt ansd `as sdhm nbhbtatbdks, suf` as

    dkny prdvbobke ak bktmrgafm jasmo dk @__\ZMV_, fdkfurrmkt rmao ako wrbtm pmrgdrhakfm

    bs kdt boman ako sd dk S6X.

    Hdked OJ bs kdk-rmnatbdkan oatajasm, w`bf`

    gmaturms t`m rbf`mst ako hdst nbcm t`m rmnatbdkan

    oatajasm. Bt suppdrts fdhpnmx oata typms

    w`bf` usms JIVDK oata strufturms td stdrm

    fdhpnmx oata typm S6X. Bt usms pdwmrgun qumry

    nakeuaem w`bf` anndws hdst dg gukftbdks nbcm

    qumry bk sbkenm-tajnm dg rmnatbdkan oatajasms,ako ansd suppdrt bkomx. @be`-spmmo affmss td

    hass oata9 w`mk t`m oata mxfmmos =

  • 8/13/2019 NoSQL: A Distributed Solution of Big Data

    4/4

    ?

    _raobtbdkan oatajasm arf`btmfturms `avm prdvmo

    td jm bkapprdprbatm gdr haky usm fasms

    jmfausm bk furrmkt sfmkarbd Vpmmo ako

    sfanajbnbty arm kmmo dg ak `dur. _`mrmgdrm

    kdwaoays appnbfatbdks arm s`bgtbke tdwaros Bk-

    Hmhdry oata stdraem w`bf` fduno jddst t`m

    oata affmss ako t`m systmh fduno nddc gdrwaro

    gdr oatajasms w`bf` fduno wdrc affdrobke tdt`m usm fasms ako KdVqn bs t`m sdnutbdk gdr usm

    fasms w`mrm AFBO bs kdt t`m haidr fdkfmrk. Bk

    t`bs rmearo a vmry gbkm prdouft dg drafnm t`at bs

    Fd`mrmkfm prdvboms us wbt` t`rmm jasbf

    mktmrprbsm gukftbdkanbtbms b.m. spmmo, sfanajbnbty

    ako gaunt tdnmrakfm. Bt `as kd sbkenm pdbkts dg

    gabnurm, bt autdhatbfanny ako traksparmktny gabns

    dvmr ako rmobstrbjutms bts fnustmrmo oata

    hakaemhmkt smrvbfms w`mk a smrvmr jmfdhms

    bkdpmratbvm dr bs obsfdkkmftmo grdh t`mkmtwdrc. Bt _raksparmktny rmobstrbjutm t`m

    fnustmr ndao, Fd`mrmkfm ruks dk t`m iava

    pnatgdrh ako bs ak mxahpnm dg KdVQN

    oatajasms , Bt odms `dwmvmr suppdrt ak djimft

    jasmo qumry nakeuaem w`bf` bs kdt obssbhbnar

    td VQN Bt bs omsbekmo gdr vmry gast oata affmss

    vba nddcups jasmo dk sbhpnm attrbjutms, jut

    `avm nbhbtatbdks dg bts dwk t`at w`mk

    fdhpnmxbty bkfrmasms ako oata kmmos td jm

    fmktranbzmo t`mk fd`mrmkfm nbhbts btsmng as oatapartbtbdkbke jmfdhms obggbfunt Bt bs kdt subtmo

    gdr fdhpnmx oata dpmratbdks dr ndke

    traksaftbdks.

    _`m fdhjbkatbdk dg rmnatbdkan oatajasms ako

    KdVQN wbnn jrbke a jbe f`akem bk oata stdraem

    ako jy t`m pdpunarbty eabkmo jy KdVQN

    oatajasms bt smmhs as bg agtmr tmk ymars hay jm

    t`m traobtbdkan oatajasm wduno emt mraobfatmo

    ako KdVQN wduno tacm up bts pdsbtbdk.

    =. AFCKDUNMOEHMK_

    B ah `martbny t`akcgun td Or. Ha`hduo

    Hdannmh w`dsm mkfduraemhmkt, euboakfmako

    suppdrt mkajnmo hm td prdvbom ak

    ukomrstakobke dg t`m sujimft.

    7. ZMGMZMKFMV

    S1X Dchak, Nbdr3 Ean-Dz, Kurbt3 Edkmk,

    Rardk3 Euoms, M`uo3 Ajrahdv, Imkky3 ,

    "Vmfurbty Bssums bk KdVQN Oatajasms," _rust,

    Vmfurbty ako \rbvafy bk Fdhputbke ako

    Fdhhukbfatbdks (_rustFdh), :