Non-Neighbors Also Matter to Kriging: A New Contrastive-Prototypical Learning (2401.12681v1)
Abstract: Kriging aims at estimating the attributes of unsampled geo-locations from observations in the spatial vicinity or physical connections, which helps mitigate skewed monitoring caused by under-deployed sensors. Existing works assume that neighbors' information offers the basis for estimating the attributes of the unobserved target while ignoring non-neighbors. However, non-neighbors could also offer constructive information, and neighbors could also be misleading. To this end, we propose ``Contrastive-Prototypical'' self-supervised learning for Kriging (KCP) to refine valuable information from neighbors and recycle the one from non-neighbors. As a pre-trained paradigm, we conduct the Kriging task from a new perspective of representation: we aim to first learn robust and general representations and then recover attributes from representations. A neighboring contrastive module is designed that coarsely learns the representations by narrowing the representation distance between the target and its neighbors while pushing away the non-neighbors. In parallel, a prototypical module is introduced to identify similar representations via exchanged prediction, thus refining the misleading neighbors and recycling the useful non-neighbors from the neighboring contrast component. As a result, not all the neighbors and some of the non-neighbors will be used to infer the target. To encourage the two modules above to learn general and robust representations, we design an adaptive augmentation module that incorporates data-driven attribute augmentation and centrality-based topology augmentation over the spatiotemporal Kriging graph data. Extensive experiments on real-world datasets demonstrate the superior performance of KCP compared to its peers with 6% improvements and exceptional transferability and robustness. The code is available at https://github.com/bonaldli/KCP
- Kriging convolutional networks. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 34, pages 3187–3194.
- Adaptive graph convolutional recurrent network for traffic forecasting. Advances in neural information processing systems, 33:17804–17815.
- Bostan, P. (2017). Basic kriging methods in geostatistics. Yuzuncu Yıl University Journal of Agricultural Sciences, 27(1):10–20.
- Xgboost: A scalable tree boosting system. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 785–794.
- Self-pu: Self boosted and calibrated positive-unlabeled training. In International Conference on Machine Learning, pages 1510–1519. PMLR.
- Bayesian low-rank matrix completion with dual-graph embedding: Prior analysis and tuning-free inference. Signal Processing, 204:108826.
- Fast-moco: Boost momentum-based contrastive learning with combinatorial patches. In Computer Vision–ECCV 2022: 17th European Conference, Tel Aviv, Israel, October 23–27, 2022, Proceedings, Part XXVI, pages 290–306. Springer.
- Cuturi, M. (2013). Sinkhorn distances: lightspeed computation of optimal transport. In Proceedings of the 26th International Conference on Neural Information Processing Systems, pages 2292–2300.
- Temporal multi-view graph convolutional networks for citywide traffic volume inference. In 2021 IEEE International Conference on Data Mining (ICDM), pages 1042–1047. IEEE.
- Graph neural networks with precomputed node features. arXiv preprint arXiv:2206.00637.
- Performance evaluation of predictive models for missing data imputation in weather data. In International Conference on Advances in Computing, Communications and Informatics (ICACCI), pages 1327–1334. IEEE.
- Spatiotemporal multi-graph convolution network for ride-hailing demand forecasting. In Proceedings of the AAAI conference on Artificial Intelligence, volume 33, pages 3656–3663.
- Goovaerts, P. (1998). Ordinary cokriging revisited. Mathematical Geology, 30:21–42.
- Bootstrap your own latent-a new approach to self-supervised learning. Advances in Neural Information Processing Systems, 33:21271–21284.
- Inductive representation learning on large graphs. In Proceedings of the 31st International Conference on Neural Information Processing Systems, pages 1025–1035.
- Dr. vic: Decomposition and reasoning for video individual counting. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 3083–3092.
- G-mixup: Graph data augmentation for graph classification. In International Conference on Machine Learning, pages 8230–8248. PMLR.
- Contrastive multi-view representation learning on graphs. In International Conference on Machine Learning, pages 4116–4126. PMLR.
- Provable tensor factorization with missing data. Advances in Neural Information Processing Systems, 27.
- Categorical reparameterization with gumbel-softmax. In International Conference on Learning Representations.
- Multivariate time series forecasting with dynamic graph neural odes. IEEE Transactions on Knowledge and Data Engineering.
- Semi-supervised classification with graph convolutional networks. arXiv preprint arXiv:1609.02907.
- Positive-unlabeled learning with non-negative risk estimator. Advances in Neural Information Processing Systems, 30.
- Positional encoder graph neural networks for geographic data. In International Conference on Artificial Intelligence and Statistics, pages 1379–1389. PMLR.
- Krige, D. G. (1951). A statistical approach to some basic mine valuation problems on the witwatersrand. Journal of the Southern African Institute of Mining and Metallurgy, 52(6):119–139.
- Arterial travel time estimation based on vehicle re-identification using wireless magnetic sensors. Transportation Research Part C: Emerging Technologies, 17(6):586–606.
- Bayesian kernelized matrix factorization for spatiotemporal traffic data imputation and kriging. IEEE Transactions on Intelligent Transportation Systems, 23(10):18962–18974.
- Multi-sensor based landslide monitoring via transfer learning. Journal of Quality Technology, 53(5):474–487.
- Diffusion convolutional recurrent neural network: Data-driven traffic forecasting. In International Conference on Learning Representations.
- Diffusion convolutional recurrent neural network: Data-driven traffic forecasting. In International Conference on Learning Representations (ICLR ’18).
- Tensor completion for weakly-dependent data on graph for metro passenger flow prediction. In proceedings of the AAAI conference on Artificial Intelligence, volume 34, pages 4804–4810.
- A multi-stream feature fusion approach for traffic prediction. IEEE transactions on intelligent transportation systems, 23(2):1456–1466.
- Long-short term spatiotemporal tensor prediction for passenger flow profile. IEEE Robotics and Automation Letters, 5(4):5010–5017.
- Dynamic causal graph convolutional network for traffic prediction. arXiv preprint arXiv:2306.07019.
- Vehicle trajectory recovery on road network based on traffic camera video data. In Proceedings of the 29th International Conference on Advances in Geographic Information Systems, pages 389–398.
- Self-supervised consensus representation learning for attributed graph. In Proceedings of the 29th ACM International Conference on Multimedia, pages 2654–2662.
- Graph self-supervised learning: A survey. IEEE Transactions on Knowledge and Data Engineering.
- Lovász, L. (2012). Large networks and graph limits, volume 60. American Mathematical Soc.
- Jointly contrastive representation learning on road network and trajectory. In Proceedings of the 31st ACM International Conference on Information & Knowledge Management, pages 1501–1510.
- City-wide traffic volume inference with loop detector data and taxi trajectories. In Proceedings of the 25th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, pages 1–10.
- Collaborative filtering with graph information: Consistency and scalable methods. Advances in Neural Information Processing Systems, 28.
- Gaussian processes for machine learning, volume 1. Springer.
- Scalable probabilistic matrix factorization with graph-based priors. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 34, pages 5851–5858.
- Unifying visual contrastive learning for object recognition from a graph perspective. In Computer Vision–ECCV 2022: 17th European Conference, Tel Aviv, Israel, October 23–27, 2022, Proceedings, Part XXVI, pages 649–667. Springer.
- Large-scale representation learning on graphs via bootstrapping. arXiv preprint arXiv:2102.06514.
- Unsupervised representation learning for time series with temporal neighborhood coding. arXiv preprint arXiv:2106.00750.
- Kriging water levels with a regional-linear and point-logarithmic drift. Groundwater, 40(2):185–193.
- Visualizing data using t-sne. Journal of Machine Learning Research, 9(11).
- Short-term renewable energy forecasting in Greece using prophet decomposition and tree-based ensembles. In Database and Expert Systems Applications-DEXA 2021 Workshops: BIOKDD, IWCFS, MLKgraphs, AI-CARES, ProTime, AISys 2021, Virtual Event, September 27–30, 2021, Proceedings 32, pages 227–238. Springer.
- Inductive graph neural networks for spatiotemporal kriging. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 35, pages 4478–4485.
- Spatial aggregation and temporal convolution networks for real-time kriging. arXiv preprint arXiv:2109.12144.
- Graph wavenet for deep spatial-temporal graph modeling. In Proceedings of the 28th International Joint Conference on Artificial Intelligence, pages 1907–1913.
- Real-time spatiotemporal prediction and imputation of traffic status based on lstm and graph laplacian regularized matrix factorization. Transportation Research Part C: Emerging Technologies, 129:103228.
- Graph contrastive learning with augmentations. Advances in Neural Information Processing Systems, 33:5812–5823.
- When does self-supervision help graph convolutional networks? In International Conference on Machine Learning, pages 10871–10880. PMLR.
- Spatio-temporal graph convolutional networks: a deep learning framework for traffic forecasting. In Proceedings of the 27th International Joint Conference on Artificial Intelligence, pages 3634–3640.
- Citywide traffic volume inference with surveillance camera records. IEEE Transactions on Big Data, 7(6):900–912.
- Network-wide traffic flow estimation with insufficient volume detection and crowdsourcing data. Transportation Research Part C: Emerging Technologies, 121:102870.
- Increase: Inductive graph representation learning for spatio-temporal kriging. In Proceedings of the ACM Web Conference 2023, pages 673–683.
- Kernelized probabilistic matrix factorization: Exploiting graphs and side information. In Proceedings of the 2012 SIAM International Conference on Data Mining, pages 403–414. SIAM.
- Graph contrastive learning with adaptive augmentation. In Proceedings of the Web Conference 2021, pages 2069–2080.