- The paper introduces a novel semi-supervised framework that leverages contrastive self-supervised learning to extract visual embeddings for quality evaluation.
- It employs k-means clustering on learned features to compute online and offline quality metrics that correlate with polyp detection sensitivity.
- The study demonstrates that procedures with higher quality metrics significantly increase the likelihood of polyp detection, suggesting potential for real-time feedback improvement.
The paper introduces a novel approach to semi-supervised quality evaluation of colonoscopy procedures, aiming to address the challenge of missed polyps during colonoscopies, which is often attributed to operator-dependent factors. The work proposes both online and offline quality metrics derived from the visual data of colonoscopies, leveraging machine learning models to assess the quality of the procedure.
Key aspects of the methodology include:
- Learning visual representations of colonoscopy video frames using contrastive self-supervised learning, specifically the SimCLR framework. The loss function is defined as:
ℓ(zi1,zi2)=−log∑k=i∑a=12∑b=12exp(sim(zia,zkb)/τ)exp(sim(zi1,zi2)/τ)
where:
- zi1 and zi2 are the embedding vectors of two augmented versions of frame xi
- sim(u,v)=uTv/∥u∥∥v∥ is the cosine similarity
- τ is a temperature parameter
- Applying cluster analysis on the learned representations using k-means to identify semantically meaningful groups of frames. Instead of hard assignments, the probability of the i-th frame belonging to the k-th cluster is computed as:
ri,k=Prob(fθ(xi)∈k)∼[∥fθ(xi)−ck∥221]α
where:
- fθ(xi) is the feature representation of frame xi
- ck is the center of the k-th cluster
- α=16 is a constant
- Developing an online quality metric, Q, that predicts the likelihood of detecting a polyp based on the visual appearance of video segments. The classifier Q(⋅) is trained to map the cluster assignments $\{\overline{r_{i,k}\}_{k=1}^K$ to the detection of a polyp in the subsequent 2 seconds, achieving a polyp detection prediction accuracy of 64% on the test set.
- Formulating a method to estimate the chance of detecting a polyp at a given time t, denoted as P(D∣E,Q), which represents the probability of detecting a polyp (D) given its existence (E) and the quality of the procedure (Q). This is approximated using Bayes' rule and empirical estimation of P(Q) and P(Q∣D).
- Creating an offline quality metric, QOffline, by integrating the online metric Q over the entire withdrawal phase of the colonoscopy, according to the equation:
$Q_{\text{Offline} = \sum _{i \in \text{withdrawal} Q \left(\{r_{i,k}\}_{k=1}^K\right)$
The experimental results demonstrate a strong correlation between the proposed online quality metric Q and the polyp detection sensitivity (PDS). The likelihood of detecting an existing polyp, P(D∣E,Q), increases with higher values of Q. Furthermore, the offline quality metric QOffline shows a high correlation with the standard Polyps Per Colonoscopy (PPC) metric. Procedures with higher QOffline values are more likely to have detected polyps.
The paper suggests that real-time feedback based on the online quality metric could potentially improve the quality of colonoscopy procedures and increase the adenoma detection rate (ADR).