StarDB: a large-scale DBMS for strings

Handle URI:
http://hdl.handle.net/10754/578861
Title:
StarDB: a large-scale DBMS for strings
Authors:
Sahli, Majed ( 0000-0002-4576-9708 ) ; Mansour, Essam; Kalnis, Panos ( 0000-0002-5060-1360 )
Abstract:
Strings and applications using them are proliferating in science and business. Currently, strings are stored in file systems and processed using ad-hoc procedural code. Existing techniques are not flexible and cannot efficiently handle complex queries or large datasets. In this paper, we demonstrate StarDB, a distributed database system for analytics on strings. StarDB hides data and system complexities and allows users to focus on analytics. It uses a comprehensive set of parallel string operations and provides a declarative query language to solve complex queries. StarDB automatically tunes itself and runs with over 90% efficiency on supercomputers, public clouds, clusters, and workstations. We test StarDB using real datasets that are 2 orders of magnitude larger than the datasets reported by previous works.
KAUST Department:
Computer Science Program; Computer, Electrical and Mathematical Sciences and Engineering (CEMSE) Division
Publisher:
VLDB Endowment
Journal:
Proceedings of the VLDB Endowment
Conference/Event name:
Proceedings of the 41st International Conference on Very Large Data Bases
Issue Date:
1-Aug-2015
DOI:
10.14778/2824032.2824082
Type:
Conference Paper
ISSN:
StarDB 2015, 8 (12):1844 Proceedings of the VLDB Endowment; 21508097
Additional Links:
http://dl.acm.org/citation.cfm?doid=2824032.2824082
Appears in Collections:
Conference Papers; Computer Science Program; Computer, Electrical and Mathematical Sciences and Engineering (CEMSE) Division

Full metadata record

DC FieldValue Language
dc.contributor.authorSahli, Majeden
dc.contributor.authorMansour, Essamen
dc.contributor.authorKalnis, Panosen
dc.date.accessioned2015-09-29T10:20:25Zen
dc.date.available2015-09-29T10:20:25Zen
dc.date.issued2015-08-01en
dc.identifier.issnStarDB 2015, 8 (12):1844 Proceedings of the VLDB Endowmenten
dc.identifier.issn21508097en
dc.identifier.doi10.14778/2824032.2824082en
dc.identifier.urihttp://hdl.handle.net/10754/578861en
dc.description.abstractStrings and applications using them are proliferating in science and business. Currently, strings are stored in file systems and processed using ad-hoc procedural code. Existing techniques are not flexible and cannot efficiently handle complex queries or large datasets. In this paper, we demonstrate StarDB, a distributed database system for analytics on strings. StarDB hides data and system complexities and allows users to focus on analytics. It uses a comprehensive set of parallel string operations and provides a declarative query language to solve complex queries. StarDB automatically tunes itself and runs with over 90% efficiency on supercomputers, public clouds, clusters, and workstations. We test StarDB using real datasets that are 2 orders of magnitude larger than the datasets reported by previous works.en
dc.publisherVLDB Endowmenten
dc.relation.urlhttp://dl.acm.org/citation.cfm?doid=2824032.2824082en
dc.rightsThis work is licensed under the Creative Commons AttributionNonCommercialNoDerivs 3.0 Unported License. To view a copy of this license, visit http://creativecommons.org/licenses/byncnd/ 3.0/. Obtain permission prior to any use beyond those covered by the license. Contact copyright holder by emailing info@vldb.org.en
dc.titleStarDB: a large-scale DBMS for stringsen
dc.typeConference Paperen
dc.contributor.departmentComputer Science Programen
dc.contributor.departmentComputer, Electrical and Mathematical Sciences and Engineering (CEMSE) Divisionen
dc.identifier.journalProceedings of the VLDB Endowmenten
dc.conference.dateAugust 1, 2015en
dc.conference.nameProceedings of the 41st International Conference on Very Large Data Basesen
dc.conference.locationKohala Coast, Hawaiien
dc.eprint.versionPublisher's Version/PDFen
dc.contributor.institutionQatar Computing Research Institute, Doha, Qataren
kaust.authorSahli, Majeden
kaust.authorKalnis, Panosen
All Items in KAUST are protected by copyright, with all rights reserved, unless otherwise indicated.