Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

GEDI: A Graph-based End-to-end Data Imputation Framework (2208.06573v2)

Published 13 Aug 2022 in cs.LG

Abstract: Data imputation is an effective way to handle missing data, which is common in practical applications. In this study, we propose and test a novel data imputation process that achieve two important goals: (1) preserve the row-wise similarities among observations and column-wise contextual relationships among features in the feature matrix, and (2) tailor the imputation process to specific downstream label prediction task. The proposed imputation process uses Transformer network and graph structure learning to iteratively refine the contextual relationships among features and similarities among observations. Moreover, it uses a meta-learning framework to select features that are influential to the downstream prediction task of interest. We conduct experiments on real-world large data sets, and show that the proposed imputation process consistently improves imputation and label prediction performance over a variety of benchmark methods.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Katrina Chen (3 papers)
  2. Xiuqin Liang (1 paper)
  3. Zheng Ma (110 papers)
  4. Zhibin Zhang (10 papers)
Citations (4)

Summary

We haven't generated a summary for this paper yet.