Obtain Starmie containment measurements on YADL 50k and Open Data US
Determine the Jaccard Containment results for the Starmie retrieval method on the YADL 50k and Open Data US data lakes by computing the containment values between the query columns and the candidate columns returned by Starmie, following the same evaluation protocol used elsewhere in the paper (e.g., averaging over the top retrieved candidates across base tables) to enable direct comparison with Exact Matching, MinHash, and Hybrid MinHash.
References
We were unable to obtain the containment results for Starmie on YADL 50k and Open Data US.
— Retrieve, Merge, Predict: Augmenting Tables with Data Lakes
(2402.06282 - Cappuzzo et al., 9 Feb 2024) in Section 5.2, Containment affects the entire process