FAST: Frequency-Aware Spatio-Textual Indexing for In-Memory Continuous Filter Query Processing
نویسندگان
چکیده
The ubiquity of spatio-textual data comes from the popularity of GPS-enabled smart devices, e.g., smartphones. These devices provide a platform that supports a wide range of applications that generate and process spatio-textual data. These applications include social networks, micro-blogs, web-search for local attractions and events, and location-aware ad targeting. These applications need to process massive amounts of spatio-textual data in real-time. For example, in location-aware ad targeting systems, it is required to disseminate millions of ads and promotions to millions of users based on the locations and textual profiles of users. To support this data scale, these applications require efficient spatio-textual indexing. There exist several related spatio-textual indexes that typically integrate a spatial index with a textual index. However, these indexes usually have a high demand for main-memory and assume that the entire vocabulary of keywords is known in advance. Also, these indexes do not successfully capture the variations of the frequencies of keywords across different spatial regions and treat frequent and infrequent keywords similarly. Moreover, existing indexes do not adapt to the changes in workload over space and time. For example, some keywords may be trending at certain times in certain locations and this may change as time passes. To maintain high performance, the index needs to adapt to these changes. In this paper, we introduce FAST, a F requency-Aware Spatio-T extual index. FAST is a main-memory index that requires up to one third of the memory needed by the state-of-the-art index. FAST does not assume prior knowledge of the entire vocabulary of indexed objects. FAST accounts for the frequencies of keywords within their corresponding spatial regions to automatically choose the best indexing approach that optimizes the insertion and search times. Extensive experimental evaluation using two real datasets demonstrates that FAST is up to 3x faster in search time and 5x faster in insertion time than the state-of-the-art indexes.
منابع مشابه
STQP: Spatio-Temporal Indexing and Query Processing
In this ongoing work, the location-aware ranking query (LRQ) are considered, an important category of location-aware query. Types of location-aware ranking query are the k-nearest neighbour (NN) query and location-aware keyword query(LKQ). NN LKQs and inquiries have vast applications in many domains. However, there are a great number of location-aware datasets that demand better and flexible lo...
متن کاملIndexing and Query Processing Techniques in Spatio-temporal Data
Indexing and query processing is an emerging research field in spatio temporal data. Most of the real-time applications such as location based services, fleet management, traffic prediction and radio frequency identification and sensor networks are based on spatiotemporal indexing and query processing. All the indexing and query processing applications is any one of the forms, such as spatio in...
متن کاملOn Processing Top-k Spatio-Textual Preference Queries
In this paper we propose a novel query type, termed top-k spatio-textual preference query, that retrieves a set of spatiotextual objects ranked by the goodness of the facilities in their neighborhood. Consider for example, a tourist that looks for “hotels that have nearby a highly rated Italian restaurant that serves pizza”. The proposed query type takes into account not only the spatial locati...
متن کاملContinuous Query Processing of Spatio-temporal Data Streams in PLACE
The tremendous increase in the use of cellular phones, GPS-like devices, and RFIDs results in highly dynamic environments where objects as well as queries are continuously moving. In this paper, we present a continuous query processor designed specifically for highly dynamic environments (e.g., location-aware environments). We implemented the proposed continuous query processor inside the PLACE...
متن کاملSEAL: Spatio-Textual Similarity Search
Location-based services (LBS) have become more and more ubiquitous recently. Existing methods focus on finding relevant points-of-interest (POIs) based on users’ locations and query keywords. Nowadays, modern LBS applications generate a new kind of spatio-textual data, regions-of-interest (ROIs), containing region-based spatial information and textual description, e.g., mobile user profiles wit...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1709.02529 شماره
صفحات -
تاریخ انتشار 2017