Quantitative impact of ordering and grouping on semantic aggregation quality
Determine how input document ordering and semantic grouping strategies (e.g., clustering-based partitioning) influence the quality of summarization produced by LOTUS’s sem_agg operator, and provide quantitative metrics and empirical evaluations that compare naive ordering against semantic-cluster-based partitioning for multi-document aggregation tasks.
References
We leave a quantitative study of this to future work and believe that semantic aggregations create a rich design space for optimization.
— Semantic Operators: A Declarative Model for Rich, AI-based Data Processing
(2407.11418 - Patel et al., 16 Jul 2024) in Section 3.4 (sem_agg: Optimizations)