久久久久久久av_日韩在线中文_看一级毛片视频_日本精品二区_成人深夜福利视频_武道仙尊动漫在线观看

<i id='2ddLT'><tr id='2ddLT'><dt id='2ddLT'><q id='2ddLT'><span id='2ddLT'><b id='2ddLT'><form id='2ddLT'><ins id='2ddLT'></ins><ul id='2ddLT'></ul><sub id='2ddLT'></sub></form><legend id='2ddLT'></legend><bdo id='2ddLT'><pre id='2ddLT'><center id='2ddLT'></center></pre></bdo></b><th id='2ddLT'></th></span></q></dt></tr></i><div class="qwawimqqmiuu" id='2ddLT'><tfoot id='2ddLT'></tfoot><dl id='2ddLT'><fieldset id='2ddLT'></fieldset></dl></div>
  • <small id='2ddLT'></small><noframes id='2ddLT'>

    <legend id='2ddLT'><style id='2ddLT'><dir id='2ddLT'><q id='2ddLT'></q></dir></style></legend>
    • <bdo id='2ddLT'></bdo><ul id='2ddLT'></ul>

      1. <tfoot id='2ddLT'></tfoot>
      2. 即時(shí)搜索 PB 級(jí)數(shù)據(jù)

        instant searching in petabyte of data(即時(shí)搜索 PB 級(jí)數(shù)據(jù))
          <tbody id='Z2Ifd'></tbody>
        <i id='Z2Ifd'><tr id='Z2Ifd'><dt id='Z2Ifd'><q id='Z2Ifd'><span id='Z2Ifd'><b id='Z2Ifd'><form id='Z2Ifd'><ins id='Z2Ifd'></ins><ul id='Z2Ifd'></ul><sub id='Z2Ifd'></sub></form><legend id='Z2Ifd'></legend><bdo id='Z2Ifd'><pre id='Z2Ifd'><center id='Z2Ifd'></center></pre></bdo></b><th id='Z2Ifd'></th></span></q></dt></tr></i><div class="qwawimqqmiuu" id='Z2Ifd'><tfoot id='Z2Ifd'></tfoot><dl id='Z2Ifd'><fieldset id='Z2Ifd'></fieldset></dl></div>

          <legend id='Z2Ifd'><style id='Z2Ifd'><dir id='Z2Ifd'><q id='Z2Ifd'></q></dir></style></legend>
          • <tfoot id='Z2Ifd'></tfoot>
            1. <small id='Z2Ifd'></small><noframes id='Z2Ifd'>

                • <bdo id='Z2Ifd'></bdo><ul id='Z2Ifd'></ul>
                  本文介紹了即時(shí)搜索 PB 級(jí)數(shù)據(jù)的處理方法,對(duì)大家解決問題具有一定的參考價(jià)值,需要的朋友們下面隨著小編來一起學(xué)習(xí)吧!

                  問題描述

                  我需要在 CSV 格式文件中搜索超過 PB 的數(shù)據(jù).使用 LUCENE 建立索引后,索引文件的大??小是原始文件的兩倍.是否可以減少索引文件的大??小???HADOOP中如何分發(fā)LUCENE索引文件以及如何在搜索環(huán)境中使用?還是有必要,我應(yīng)該使用 solr 來分發(fā) LUCENE 索引嗎???我的要求是對(duì) PB 的文件進(jìn)行即時(shí)搜索....

                  I need to search over petabyte of data in CSV formate files. After indexing using LUCENE, the size of the indexing file is doubler than the original file. Is it possible to reduce the indexed file size??? How to distribute LUCENE index files in HADOOP and how to use in searching environment? or is it necessary, should i use solr to distribute the LUCENE index??? My requirement is doing instant search over petabyte of files....

                  推薦答案

                  任何體面的現(xiàn)成搜索引擎(如 Lucene)都應(yīng)該能夠提供超過您擁有的數(shù)據(jù)大小的搜索功能.您可能需要預(yù)先做一些工作來設(shè)計(jì)索引并配置搜索的工作方式,但這只是配置.

                  Any decent off the shelf search engine (like Lucene) should be able to provide search functionality over the size of data you have. You may have to do a bit of work up front to design the indexes and configure how the search works, but this is just config.

                  您不會(huì)立即獲得結(jié)果,但您也許能夠很快獲得結(jié)果.速度可能取決于您的設(shè)置方式以及運(yùn)行的硬件類型.

                  You won't get instant results but you might be able to get very quick results. The speed will probably depend on how you set it up and what kind of hardware you run on.

                  您提到索引大于原始數(shù)據(jù).這是可以預(yù)料的.索引通常包括某種形式的非規(guī)范化.索引的大小通常是與速度的權(quán)衡;提前對(duì)數(shù)據(jù)進(jìn)行切片和切塊的方式越多,找到參考的速度就越快.

                  You mention that the indexes are larger than the original data. This is to be expected. Indexing usually includes some form of denormalisation. The size of the indexes is often a trade off with speed; the more ways you slice and dice the data in advance, the quicker it is to find references.

                  最后你提到了分發(fā)索引,這幾乎肯定不是你想做的事情.分發(fā)許多 PB 數(shù)據(jù)的實(shí)用性非常令人生畏.您可能想要的是將索引放在某處的大型計(jì)算機(jī)上并提供數(shù)據(jù)搜索服務(wù)(將查詢帶到數(shù)據(jù)中,不要將數(shù)據(jù)帶到查詢中).

                  Lastly you mention distributing the indexes, this is almost certainly not something you want to do. The practicalities of distributing many petabytes of data are pretty daunting. What you probably want is to have the indexes sat on a big fat computer somewhere and provide search services on the data (bring the query to the data, don't take the data to the query).

                  這篇關(guān)于即時(shí)搜索 PB 級(jí)數(shù)據(jù)的文章就介紹到這了,希望我們推薦的答案對(duì)大家有所幫助,也希望大家多多支持html5模板網(wǎng)!

                  【網(wǎng)站聲明】本站部分內(nèi)容來源于互聯(lián)網(wǎng),旨在幫助大家更快的解決問題,如果有圖片或者內(nèi)容侵犯了您的權(quán)益,請(qǐng)聯(lián)系我們刪除處理,感謝您的支持!

                  相關(guān)文檔推薦

                  How can I detect integer overflow on 32 bits int?(如何檢測(cè) 32 位 int 上的整數(shù)溢出?)
                  Local variables before return statements, does it matter?(return 語句之前的局部變量,這有關(guān)系嗎?)
                  How to convert Integer to int?(如何將整數(shù)轉(zhuǎn)換為整數(shù)?)
                  How do I create an int array with randomly shuffled numbers in a given range(如何在給定范圍內(nèi)創(chuàng)建一個(gè)隨機(jī)打亂數(shù)字的 int 數(shù)組)
                  Inconsistent behavior on java#39;s ==(java的行為不一致==)
                  Why is Java able to store 0xff000000 as an int?(為什么 Java 能夠?qū)?0xff000000 存儲(chǔ)為 int?)
                • <tfoot id='vUHoc'></tfoot>
                • <legend id='vUHoc'><style id='vUHoc'><dir id='vUHoc'><q id='vUHoc'></q></dir></style></legend>

                      1. <i id='vUHoc'><tr id='vUHoc'><dt id='vUHoc'><q id='vUHoc'><span id='vUHoc'><b id='vUHoc'><form id='vUHoc'><ins id='vUHoc'></ins><ul id='vUHoc'></ul><sub id='vUHoc'></sub></form><legend id='vUHoc'></legend><bdo id='vUHoc'><pre id='vUHoc'><center id='vUHoc'></center></pre></bdo></b><th id='vUHoc'></th></span></q></dt></tr></i><div class="qwawimqqmiuu" id='vUHoc'><tfoot id='vUHoc'></tfoot><dl id='vUHoc'><fieldset id='vUHoc'></fieldset></dl></div>
                          <bdo id='vUHoc'></bdo><ul id='vUHoc'></ul>

                          <small id='vUHoc'></small><noframes id='vUHoc'>

                            <tbody id='vUHoc'></tbody>
                            主站蜘蛛池模板: 精品一区二区三区在线观看国产 | 久久久精品综合 | 久久高清| 91久久网站| 亚洲欧洲精品成人久久奇米网 | 国产精品自拍视频网站 | 成年人在线观看视频 | 国产在线精品一区二区 | 91视频电影 | 精品亚洲一区二区三区 | 99热精品在线观看 | 亚洲精品一区二区 | 在线视频一区二区三区 | 亚洲www啪成人一区二区 | 久久久久国产一区二区三区 | 欧美国产日韩一区二区三区 | 亚洲综合在线视频 | 成人黄色三级毛片 | 国产伦精品一区二区三区高清 | 91国产在线视频在线 | 一区二区三区四区在线 | 久久国产三级 | 国产精品福利久久久 | 中文字幕在线一区 | 久久蜜桃av | 精品日韩欧美一区二区 | 久久不射网 | 久久国产日韩欧美 | 欧美不卡在线 | se婷婷 | 国产精品视频一区二区三区四区国 | 欧美精品久久久久 | 成人h动漫亚洲一区二区 | 亚洲一区二区三区在线视频 | 成人午夜精品一区二区三区 | 2019天天干夜夜操 | 一级免费毛片 | 天天看天天摸天天操 | www.精品一区 | 国产精品视频在线播放 | 国产精品特级毛片一区二区三区 |