Papers
Topics
Authors
Recent
Search
2000 character limit reached

Online Group Feature Selection

Updated 16 March 2026
  • Online Group Feature Selection (OGFS) is a method that selects relevant feature groups from a data stream by leveraging group structure to maximize between-class variance and minimize within-class variance.
  • It employs a two-stage process using spectral analysis for intra-group selection and Lasso regression to eliminate redundant features across groups.
  • OGFS has demonstrated robust performance in image recognition and bioinformatics, showing stability under varying group arrival orders and improving classification accuracy with fewer features.

Online Group Feature Selection (OGFS) refers to a class of algorithms specifically designed for selecting relevant features from a dynamically arriving data stream, where features arrive in groups rather than individually. OGFS maintains the structural integrity of feature groups, such as those encountered when multi-modal or multi-descriptor features are computed in batches (e.g., SIFT descriptors in images, sensor arrays, biological pathway-based genes). The OGFS methodology emphasizes discriminative power and redundancy control under real-time or streaming constraints, utilizing principled statistical and optimization criteria as new groups are acquired and incrementally incorporated into the selected subset (Wang et al., 2016, Jing et al., 2014).

1. Problem Setting and Formalization

Let XRd×nX\in \mathbb{R}^{d\times n} denote the data matrix with nn instances and dd total features, and y{1,,c}ny\in \{1,\ldots,c\}^n the class labels. In OGFS, features are not assumed to be available at once; instead, they arrive as a sequence of groups, Gj={fj1,,fjdj}G_j = \{f_{j1},\ldots,f_{jd_j}\} at each timestep jj. Each fjkf_{jk} is a feature vector of length nn. The algorithm, unware of future groups, incrementally updates the selected subset U=jgjU = \bigcup_j g_j where gjGjg_j \subseteq G_j. Typical stopping criteria include reaching a target cardinality UK|U|\geq K, achieving a desired classification accuracy on UU, or exhausting the feature stream.

The selection objective is threefold:

  1. Maximize between-class variance,
  2. Minimize within-class variance,
  3. Reduce redundancy among selected features across all groups.

2. Intra-Group Feature Selection via Spectral Analysis

At each group arrival, OGFS applies a spectral criterion to decide, in an online manner, whether a feature should be provisionally included. Affinity matrices SbS_b (between-class) and SwS_w (within-class) are constructed as follows:

(Sb)ij={1n1nif yi=yj= 1nif yiyj(S_b)_{ij} = \begin{cases} \frac{1}{n} - \frac{1}{n_\ell} & \text{if } y_i = y_j = \ell \ \frac{1}{n} & \text{if } y_i \neq y_j \end{cases}

(Sw)ij={1nif yi=yj= 0otherwise(S_w)_{ij} = \begin{cases} \frac{1}{n_\ell} & \text{if } y_i = y_j = \ell \ 0 & \text{otherwise} \end{cases}

With degree matrices Db,DwD_b, D_w and Laplacians Lb=DbSbL_b = D_b - S_b, Lw=DwSwL_w = D_w - S_w, for any current feature set UU, the spectral ratio is defined as: F(U)=tr(XULbXU)tr(XULwXU)F(U) = \frac{\operatorname{tr}(X_U L_b X_U^\top)}{\operatorname{tr}(X_U L_w X_U^\top)} where XUX_U is the submatrix of XX restricted to features selected in UU.

For each incoming feature ff, the increment F(U{f})F(U)F(U\cup\{f\}) - F(U) is evaluated. If F(U{f})F(U)>ϵ|F(U\cup\{f\}) - F(U)| > \epsilon (e.g., ϵ=103\epsilon = 10^{-3}), ff is included in the provisional intra-group set GjG_j'. This criterion enforces the selection of features that enhance between-class discrimination or compact within-class scatter as measured spectrally (Wang et al., 2016, Jing et al., 2014).

3. Inter-Group Feature Selection via Sparse Regression

Following the intra-group pass, OGFS seeks to further prune redundancy both within and across groups using Lasso regression. For the current selection UU and interim group GjG_j', an augmented feature matrix AA is constructed from UGjU \cup G_j'. The algorithm solves:

minβyAβ22+λβ1\min_{\beta} \| y - A^\top \beta \|_2^2 + \lambda \|\beta\|_1

Features corresponding to nonzero entries in β\beta are retained; features with zero coefficient are discarded. This step leverages the sparsity-inducing 1\ell_1 penalty to encourage a compact, non-redundant selection for subsequent group arrivals (Wang et al., 2016, Jing et al., 2014).

4. Algorithmic Summary and Complexity

The overall OGFS pipeline consists of repeated, two-stage processing for each arriving group:

  1. Intra-group selection: For each feature in group GjG_j, compute F(U{f})F(U)F(U\cup \{f\}) - F(U) and include ff in GjG_j' if the increment exceeds ϵ\epsilon.
  2. Inter-group Lasso: Apply sparse regression over UGjU \cup G_j'; keep only those features with nonzero regression weights, updating UU.
  3. Stopping check: Terminate if the selected feature set reaches cardinality KK, achieves the desired accuracy, or the stream ends.

The intra-group computation involves efficient eigen-trace updates of the spectral ratio, O(p)O(p) per feature with pp small, while the inter-group Lasso regression is empirically linear or low polynomial in current feature count. Over TT groups, with mjm_j features per group, overall time complexity is O(j(mj+Uj))O(\sum_j (m_j + |U_j|)) and space is governed by O(dn)O(dn) and Laplacian storage (Wang et al., 2016, Jing et al., 2014).

5. Experimental Results and Empirical Characteristics

OGFS has been benchmarked on both image and biological data where group structure is natural or synthesized:

  • Image benchmarks:
    • Cifar-10 (color image classification): OGFS achieved 49.6% accuracy with ≈2k features; using only intra-group step achieves 51.2% (≈5k features), offering a controllable trade-off.
    • Caltech-101 (object recognition): Outperformed all online baselines by an absolute 6–13% margin with 1–2k features.
    • LFW (face verification): Achieved ~81% accuracy versus 77% for Grafting and 66% for OSFS, using ~1.5k features.
  • UCI/microarray benchmarks:
    • OGFS surpassed other online methods on 6/8 tasks and achieved comparable or better accuracy than offline LARS/MI selection with fewer features.
  • Robustness: OGFS exhibited empirical stability under random permutation of group arrival order, with accuracy variance <\lt1% (Wang et al., 2016, Jing et al., 2014).

Compared baselines included Alpha-investing, OSFS, Grafting (online) and mutual information, LARS, GBFS (offline). Metrics tracked were accuracy, number of features (compactness), and CPU time.

6. Theoretical Analysis and Guarantees

No formal convergence or regret-type guarantees are provided for OGFS. However, empirical evaluation demonstrates algorithmic stability concerning group order and efficacy in both discriminative power and redundancy pruning. The approach systematically combines local spectral discriminability with global sparsity control at each group addition. The performance and output subset depend explicitly on group quality and the parameters ϵ\epsilon and λ\lambda, which require tuning (Wang et al., 2016, Jing et al., 2014).

7. Applications, Limitations, and Extensions

OGFS is applicable where feature groups arrive dynamically or possess inherent structure: multi-descriptor visual analysis, sensor networks with grouped measurements, bioinformatics with grouped genes, or any streaming data pipeline where "early, small-footprint" discriminative models are needed. The method assumes meaningful group boundaries and requires ϵ\epsilon and λ\lambda selection for spectral and Lasso steps. There is no theoretical guarantee of global optimality. A plausible implication is that further advances in theoretical understanding and adaptive parameter selection could extend OGFS to domains with weaker or less defined structural grouping.


References

Definition Search Book Streamline Icon: https://streamlinehq.com
References (2)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Online Group Feature Selection (OGFS).