کتاب های Hernández-Tamames, Juan Antonio; Mato Abad, Virginia; García-Álvarez, Roberto; González-Zabaleta, Javier; Pereira-Loureiro, Javier; Álvarez-Linera, Juan

download pdf Effect of water T2 shortening in the quantification of in-vitro proton MR spectroscopy, دانلود Effect of water T2 shortening in the quantification of in-vitro proton MR spectroscopy,Hernández-Tamames, Juan Antonio; Mato Abad, Virginia; García-Álvarez, Roberto; González-Zabaleta, Javier; Pereira-Loureiro, Javier; Álvarez-Linera, Juan, کتاب های Hernández-Tamames, Juan Antonio; Mato Abad, Virginia; García-Álvarez, Roberto; González-Zabaleta, Javier; Pereira-Loureiro, Javier; Álvarez-Linera, Juan,Wiley 2015-05-21, لیست کتاب های Wiley 2015-05-21,WorldCat, کتاب های WorldCat

گت بلاگز Internet Universal indexes for highly repetitive document collections / دانلود فایل

مشخصات کلی Universal indexes for highly repetitive document collections

نویسنده کتاب (Author):

Claude, Francisco; Fariña, Antonio; Martínez Prieto, Miguel A.; Navarro, Gonzalo

انتشارات (Publisher):

Elsevier Ltd 2016-11

ویرایش و نوع فایل (Edition/Format):

 Downloadable article : English

منبع (Database):

WorldCat

عنوان ژورنال (Publication):

francisco-claude-antonio-farin%cc%83a-miguel-a-martinez-prieto-gonzalo-navarro-universal-indexes-for-highly-repetitive-document-collections-information-systems-volume-61-october

موضوع (Subject):

Repetitive collections       Inverted index       Self-index      

توضیحات خلاصه (Summary):

[Abstract] Indexing highly repetitive collections has become a relevant problem with the emergence of large repositories of versioned documents, among other applications. These collections may reach huge sizes, but are formed mostly of documents that are near-copies of others. Traditional techniques for indexing these collections fail to properly exploit their regularities in order to reduce space. We introduce new techniques for compressing inverted indexes that exploit this near-copy regularity. They are based on run-length, Lempel–Ziv, or grammar compression of the differential inverted lists, instead of the usual practice of gap-encoding them. We show that, in this highly repetitive setting, our compression methods significantly reduce the space obtained with classical techniques, at the price of moderate slowdowns. Moreover, our best methods are universal, that is, they do not need to know the versioning structure of the collection, nor that a clear versioning structure even exists. We also introduce compressed self-indexes in the comparison. These are designed for general strings (not only natural language texts) and represent the text collection plus the index structure (not an inverted index) in integrated form. We show that these techniques can compress much further, using a small fraction of the space required by our new inverted indexes. Yet, they are orders of magnitude slower.  Read more…

ژانر / فرم:info:eu-repo/semantics/article

موضوع:Internet resource

نوع منبع:Internet Resource, Article

تمام نویسندگان / همکاران: Claude, Francisco; Fariña, Antonio; Martínez Prieto, Miguel A.; Navarro, Gonzalo

شناسه OCLC:979265251

Language Note:English

فهرست محتوا:0306-4379 1873-6076 http://hdl.handle.net/2183/18163 10.1016/j.is.2016.04.002

نویسنده : getblogs