Trajectory Similarity Measurement: An Efficiency Perspective (2311.00960v3)
Abstract: Trajectories that capture object movement have numerous applications, in which similarity computation between trajectories often plays a key role. Traditionally, the similarity between two trajectories is quantified by means of heuristic measures, e.g., Hausdorff or ERP, that operate directly on the trajectories. In contrast, recent studies exploit deep learning to map trajectories to d-dimensional vectors, called embeddings. Then, some distance measure, e.g., Manhattan or Euclidean, is applied to the embeddings to quantify trajectory similarity. The resulting similarities are inaccurate: they only approximate the similarities obtained using the heuristic measures. As distance computation on embeddings is efficient, focus has been on achieving embeddings yielding high accuracy. Adopting an efficiency perspective, we analyze the time complexities of both the heuristic and the learning-based approaches, finding that the time complexities of the former approaches are not necessarily higher. Through extensive experiments on open datasets, we find that, on both CPUs and GPUs, only a few learning-based approaches can deliver the promised higher efficiency, when the embeddings can be pre-computed, while heuristic approaches are more efficient for one-off computations. Among the learning-based approaches, the self-attention-based ones are the fastest to learn embeddings that also yield the highest accuracy for similarity queries. These results have implications for the use of trajectory similarity approaches given different application requirements.
- 2013. OpenStreetMap Planet. https://wiki.openstreetmap.org/wiki/Planet.gpx.
- 2015. Porto Taxi Trajectory Dataset. https://www.kaggle.com/c/pkdd-15-predict-taxi-service-trajectory-i.
- 2018. DiDi GAIA Open Dataset. https://outreach.didichuxing.com/.
- 2020. Trajectory Distance Library. https://github.com/bguillouet/traj-dist.
- Efficient Similarity Search in Sequence Databases. In International Conference on Foundations of Data Organization and Algorithms. 69–84.
- Helmut Alt. 2009. The Computational Geometry of Comparing Shapes. In Efficient Algorithms: Essays Dedicated to Kurt Mehlhorn on the Occasion of His 60th Birthday. 235–248.
- Helmut Alt and Michael Godau. 1995. Computing the Fréchet Distance between Two Polygonal Curves. International Journal of Computational Geometry & Applications 5, 01n02 (1995), 75–91.
- Efficient Trajectory Joins using Symbolic Representations. In MDM. 86–93.
- The R*-tree: An Efficient and Robust Access Method for Points and Rectangles. In SIGMOD. 322–331.
- Jon Louis Bentley. 1975. Multidimensional Binary Search Trees Used for Associative Searching. Commun. ACM 18, 9 (1975), 509–517.
- On Accurate Computation of Trajectory Similarity via Single Image Super-resolution. In IJCNN. 1–9.
- Contrastive Trajectory Similarity Learning with Dual-Feature Attention. In ICDE. 2933–2945.
- Sub-Trajectory Similarity Join with Obfuscation. In SSDBM. 181–192.
- Spatial Structure-Aware Road Network Embedding via Graph Contrastive Learning. In EDBT. 144–156.
- Trip2Vec: A Deep Embedding Approach for Clustering and Profiling Taxi Trip Purposes. Personal and Ubiquitous Computing 23 (2019), 53–66.
- Real-time Distributed Co-movement Pattern Detection on Streaming Trajectories. PVLDB 12, 10 (2019), 1208–1220.
- Lei Chen and Raymond Ng. 2004. On the Marriage of LP-norms and Edit Distance. In PVLDB. 792–803.
- Robust and Fast Similarity Search for Moving Object Trajectories. In SIGMOD. 491–502.
- Embedding-based Similarity Computation for Massive Vehicle Trajectory Data. IEEE Internet of Things Journal 9, 6 (2021), 4650–4660.
- Towards Robust Trajectory Similarity Computation: Representation-based Spatio-temporal Similarity Quantification. World Wide Web 26 (2023), 1271–1294.
- Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling. In NIPS Workshop on Deep Learning.
- Online Clustering of Trajectory Data Stream. In MDM. 112–121.
- Efficient Trajectory Similarity Computation with Contrastive Learning. In CIKM. 365–374.
- BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In NAACL. 4171–4186.
- Parallel Computing the Longest Common Subsequence (LCS) on GPUs: Efficiency and Language Suitability. In International Conference on Advanced Communications and Computation. 143–148.
- Approximating the Fréchet Distance for Realistic Curves in Near Linear Time. In SoCG. 365–374.
- Thomas Eiter and Heikki Mannila. 1994. Computing Discrete Fréchet Distance. Technical Report. Technical University of Vienna.
- E2DTC: An End to End Deep Trajectory Clustering Framework via Self-training. In ICDE. 696–707.
- Spatio-temporal Trajectory Similarity Learning in Road Networks. In KDD. 347–356.
- Tao-Yang Fu and Wang-Chien Lee. 2020. Trembr: Exploring Road Networks for Trajectory Representation Learning. ACM Transactions on Intelligent Systems and Technology 11, 1 (2020), 10:1–25.
- Trajectory Clustering and an Application to Airspace Monitoring. IEEE Transactions on Intelligent Transportation Systems 12, 4 (2011), 1511–1524.
- Antonin Guttman. 1984. R-trees: A Dynamic Index Structure for Spatial Searching. In SIGMOD. 47–57.
- A Graph-based Approach for Trajectory Similarity Computation in Spatial Networks. In KDD. 556–564.
- TraSS: Efficient Trajectory Similarity Search Based on Key-Value Data Stores. In ICDE. 2306–2318.
- Sepp Hochreiter and Jürgen Schmidhuber. 1997. Long Short-Term Memory. Neural Computation 9, 8 (1997), 1735–1780.
- String Similarity Joins: An Experimental Evaluation. PVLDB 7, 8 (2014), 625–636.
- Billion-Scale Similarity Search with GPUs. IEEE Transactions on Big Data 7, 3 (2019), 535–547.
- Eamonn Keogh and Chotirat Ann Ratanamahatana. 2005. Exact Indexing of Dynamic Time Warping. Knowledge and Information Systems 7, 3 (2005), 358–386.
- Fast Subtrajectory Similarity Search in Road Networks under Weighted Edit Distance Constraints. PVLDB 13, 12 (2020), 2188–2201.
- Trajectory Clustering: A Partition-and-group Framework. In SIGMOD. 593–604.
- Similarity Search for Multidimensional Data Sequences. In ICDE. 599–608.
- Effective Online Group Discovery in Trajectory Databases. IEEE Transactions on Knowledge and Data Engineering 25, 12 (2012), 2752–2766.
- Deep Representation Learning for Trajectory Similarity Computation. In ICDE. 617–628.
- Bin Lin and Jianwen Su. 2008. One Way Distance: For Shape based Similarity Search of Moving Object Trajectories. GeoInformatica 12 (2008), 117–142.
- A Symbolic Representation of Time Series, with Implications for Streaming Algorithms. In SIGMOD Workshop on Research Issues in Data Mining and Knowledge Discovery. 2–11.
- Representation Learning with Multi-level Attention for Activity Trajectory Similarity Computation. IEEE Transactions on Knowledge and Data Engineering 34, 5 (2022), 2387–2400.
- CSTRM: Contrastive Self-Supervised Trajectory Representation Model for Trajectory Similarity Computation. Computer Communications 185 (2022), 159–167.
- Yu A Malkov and Dmitry A Yashunin. 2018. Efficient and Robust Approximate Nearest Neighbor Search using Hierarchical Navigable Small World Graphs. IEEE Transactions on Pattern Analysis and Machine Intelligence 42, 4 (2018), 824–836.
- Pierre-François Marteau. 2008. Time Warp Edit Distance with Stiffness Adjustment for Time Series Matching. IEEE Transactions on Pattern Analysis and Machine Intelligence 31, 2 (2008), 306–318.
- Michael D Morse and Jignesh M Patel. 2007. An Efficient and Accurate Method for Evaluating Time Series Similarity. In SIGMOD. 569–580.
- Jonathan F O’Connell and Christine L Mumford. 2014. An Exact Dynamic Programming Based Method to Solve Optimisation Problems Using GPUs. In International Symposium on Computing and Networking. 347–353.
- Deep Trajectory Clustering with Autoencoders. In International Conference for Research in Air Transportation.
- Hae-Sang Park and Chi-Hyuck Jun. 2009. A Simple and Fast Algorithm for K-medoids Clustering. Expert Systems with Applications 36, 2 (2009), 3336–3341.
- Improving Language Understanding by Generative Pre-Training. OpenAI (2018).
- Indexing and Matching Trajectories under Inconsistent Sampling Rates. In ICDE. 999–1010.
- Hanan Samet. 1984. The Quadtree and Related Hierarchical Data Structures. Comput. Surveys 16, 2 (1984), 187–260.
- Trajectory Similarity Join in Spatial Networks. PVLDB 10, 11 (2017), 1178–1189.
- DITA: Distributed In-memory Trajectory Analytics. In SIGMOD. 725–740.
- Implementing the Dynamic Time Warping Algorithm in Multithreaded Environments for Real Time and Unsupervised Pattern Discovery. In International Conference on Computer and Communication Technology. 394–398.
- A Survey of Trajectory Distance Measures and Performance Evaluation. The VLDB Journal 29 (2020), 3–32.
- A Comparative Analysis of Trajectory Similarity Measures. GIScience & Remote Sensing 58, 5 (2021), 643–669.
- Similar Trajectory Search with Spatio-temporal Deep Representation Learning. ACM Transactions on Intelligent Systems and Technology 12, 6 (2021), 77:1–26.
- LLaMA: Open and Efficient Foundation Language Models. arXiv preprint arXiv:2302.13971 (2023).
- Christopher Tralie and Elizabeth Dempsey. 2020. Exact, Parallelizable Dynamic Time Warping Alignment with Linear Memory. arXiv preprint arXiv:2008.02734 (2020).
- Attention Is All You Need. In NIPS. 6000–6010.
- Indexing Multi-Dimensional Time-Series with Support for Multiple Distance Measures. In KDD. 216–225.
- Discovering Similar Multidimensional Trajectories. In ICDE. 673–684.
- A Survey on Trajectory Data Management, Analytics, and Learning. Comput. Surveys 54, 2 (2021), 39:1–36.
- Torch: A Search Engine for Trajectory Data. In SIGIR. 535–544.
- Representative Routes Discovery from Massive Trajectories. In KDD. 4059–4069.
- AIS Ship Trajectory Clustering based on Convolutional Auto-encoder. In Intelligent Systems and Applications. 529–546.
- Similar Sports Play Retrieval with Deep Reinforcement Learning. IEEE Transactions on Knowledge and Data Engineering 35, 4 (2023), 4253–4266.
- Effective and Efficient Sports Play Retrieval with Deep Representation Learning. In KDD. 499–509.
- Real-time Continuous Intersection Joins over Large Sets of Moving Objects Using Graphic Processing Units. VLDBJ 23, 6 (2014), 965–985.
- Parallelizing Dynamic Time Warping Algorithm Using Prefix Computations on GPU. In IEEE International Conference on High Performance Computing and Communications & IEEE International Conference on Embedded and Ubiquitous Computing. 294–299.
- On the Robust Mapping of Dynamic Programming onto a Graphics Processing Unit. In ICPADS. 26–33.
- Distributed Trajectory Similarity Search. PVLDB 10, 11 (2017), 1478–1489.
- Kiyoung Yang and Cyrus Shahabi. 2004. A PCA-based Similarity Measure for Multivariate Time Series. In ACM International Workshop on Multimedia Databases. 65–74.
- TMN: Trajectory Matching Networks for Predicting Similarity. In ICDE. 1700–1713.
- T3S: Effective Representation Learning for Trajectory Similarity Computation. In ICDE. 2183–2188.
- LightPath: Lightweight and Scalable Path Representation Learning. In KDD. 2999–3010.
- Computing Trajectory Similarity in Linear Time: A Generic Seed-guided Neural Netric learning approach. In ICDE. 1358–1369.
- TrajGAT: A Graph-based Long-term Dependency Modeling Approach for Trajectory Similarity Computation. In KDD. 2275–2285.
- Trajectory Clustering via Deep Representation Learning. In IJCNN. 3880–3887.
- A Simple Efficient Approximation Algorithm for Dynamic Time Warping. In SIGSPATIAL. 21:1–10.
- Haitao Yuan and Guoliang Li. 2019. Distributed In-memory Trajectory Similarity Search and Join on Road Network. In ICDE. 1262–1273.
- DETECT: Deep Trajectory Clustering for Mobility-behavior Analysis. In IEEE International Conference on Big Data. 988–997.
- Trajectory Similarity Learning with Auxiliary Supervision and Optimal Matching. In IJCAI. 11–17.
- REPOSE: Distributed Top-k Trajectory Similarity Search with Local Reference Point Tries. In ICDE. 708–719.
- Geolife: A Collaborative Social Networking Service among User, Location and Trajectory. IEEE Data Engineering Bulletin 33, 2 (2010), 32–39.
- Spatial-temporal Fusion Graph Framework for Trajectory Similarity Computation. World Wide Web 26 (2023), 1501–1523.
- GRLSTM: Trajectory Similarity Computation with Graph-based Residual LSTM. In AAAI. 4972–4980.