Top-k term publish/subscribe for geo-textual data streams
dc.contributor.author | Chen, Lisi | |
dc.contributor.author | Shang, Shuo | |
dc.contributor.author | Jensen, Christian S. | |
dc.contributor.author | Xu, Jianliang | |
dc.contributor.author | Kalnis, Panos | |
dc.contributor.author | Yao, Bin | |
dc.contributor.author | Shao, Ling | |
dc.date.accessioned | 2020-03-25T08:45:41Z | |
dc.date.available | 2020-03-25T08:45:41Z | |
dc.date.issued | 2020-03-09 | |
dc.date.submitted | 2019-01-19 | |
dc.identifier.citation | Chen, L., Shang, S., Jensen, C. S., Xu, J., Kalnis, P., Yao, B., & Shao, L. (2020). Top-k term publish/subscribe for geo-textual data streams. The VLDB Journal. doi:10.1007/s00778-020-00607-8 | |
dc.identifier.doi | 10.1007/s00778-020-00607-8 | |
dc.identifier.uri | http://hdl.handle.net/10754/662293 | |
dc.description.abstract | Massive amounts of data that contain spatial, textual, and temporal information are being generated at a rapid pace. With streams of such data, which includes check-ins and geo-tagged tweets, available, users may be interested in being kept up-to-date on which terms are popular in the streams in a particular region of space. To enable this functionality, we aim at efficiently processing two types of general top-k term subscriptions over streams of spatio-temporal documents: region-based top-k spatial-temporal term (RST) subscriptions and similarity-based top-k spatio-temporal term (SST) subscriptions. RST subscriptions continuously maintain the top-k most popular trending terms within a user-defined region. SST subscriptions free users from defining a region and maintain top-k locally popular terms based on a ranking function that combines term frequency, term recency, and term proximity. To solve the problem, we propose solutions that are capable of supporting real-life location-based publish/subscribe applications that process large numbers of SST and RST subscriptions over a realistic stream of spatio-temporal documents. The performance of our proposed solutions is studied in extensive experiments using two spatio-temporal datasets. | |
dc.description.sponsorship | This work was supported by the National Natural Science Foundation of China (61932004, 61922054, 61872235, 61729202, 61832017, U1636210), the National Key Research and Development Program of China (2018YFC1504504, 2016YFB0700502), and Hong Kong RGC Grant 12201018. | |
dc.publisher | Springer Nature | |
dc.relation.url | http://link.springer.com/10.1007/s00778-020-00607-8 | |
dc.rights | Archived with thanks to VLDB Journal | |
dc.title | Top-k term publish/subscribe for geo-textual data streams | |
dc.type | Article | |
dc.contributor.department | Computer Science Program | |
dc.contributor.department | Computer, Electrical and Mathematical Sciences and Engineering (CEMSE) Division | |
dc.contributor.department | InfoCloud Research Group | |
dc.identifier.journal | The VLDB Journal | |
dc.rights.embargodate | 2021-03-09 | |
dc.eprint.version | Post-print | |
dc.contributor.institution | University of Electronic Science and Technology of China, Chengdu, China | |
dc.contributor.institution | Aalborg University, Aalborg, Denmark | |
dc.contributor.institution | Hong Kong Baptist University, Kowloon Tong, Hong Kong | |
dc.contributor.institution | Shanghai Jiao Tong University, Shanghai, China | |
dc.contributor.institution | Inception Institute of Artificial Intelligence, Abu Dhabi, UAE | |
kaust.person | Kalnis, Panos | |
dc.date.accepted | 2020-02-21 | |
refterms.dateFOA | 2020-12-29T10:30:19Z | |
dc.date.published-online | 2020-03-09 | |
dc.date.published-print | 2020-09 |
Files in this item
This item appears in the following Collection(s)
-
Articles
-
Computer Science Program
For more information visit: https://cemse.kaust.edu.sa/cs -
Computer, Electrical and Mathematical Science and Engineering (CEMSE) Division
For more information visit: https://cemse.kaust.edu.sa/