Papers
Topics
Authors
Recent
Search
2000 character limit reached

1-D CNN Feature Extractor

Updated 13 April 2026
  • 1-D CNN feature extractor is a specialized neural network model that uses layered 1-D convolutions to extract localized features from sequential data like time series and audio signals.
  • It employs multiple convolutional, activation, and pooling layers to build hierarchical representations, efficiently downsampling data for subsequent analysis.
  • The design supports applications in fault diagnosis and domain adaptation by leveraging adversarial training and optimized parameter allocation for robust performance.

A 1-dimensional convolutional neural network (1-D CNN) feature extractor is a hierarchical neural architecture designed to learn and extract meaningful representations from univariate or multivariate sequential data, such as time series, audio, vibration signals, or other 1-D signals. It employs stacked convolutional layers and channel-wise operations to progressively distill raw input sequences into lower-dimensional, semantically salient features suitable for downstream supervised or unsupervised tasks. The 1-D CNN paradigm enables the discovery of localized and hierarchical patterns across the input domain, holding significant utility in domains ranging from condition monitoring to signal processing and function approximation.

1. Formal Structure and Architectural Variants

A generic 1-D CNN feature extractor processes an input vector x∈Rdx \in \mathbb{R}^d through a sequence of convolutional, activation, and pooling modules. Each convolutional layer applies a set of njn_j 1-D kernels Wℓ,i(j)∈RsjW^{(j)}_{\ell,i} \in \mathbb{R}^{s_j} (with kernel size sjs_j and stride tjt_j) across the channel outputs of the preceding layer, interleaved with nonlinearities (typically ReLU) and optional channel-wise biases Bℓ(j)B^{(j)}_\ell. This mechanism is formalized as:

hj(x)ℓ=σ(∑i=1nj−1Wℓ,i(j)∗tjhj−1(x)i+Bℓ(j))h_j(x)_\ell = \sigma\left( \sum_{i=1}^{n_{j-1}} W^{(j)}_{\ell,i} *_{t_j} h_{j-1}(x)_i + B^{(j)}_\ell \right)

where ∗tj*_{t_j} denotes 1-D convolution with stride tjt_j, and σ\sigma is typically ReLU activation. Spatial dimensionality njn_j0 and channel number njn_j1 evolve with depth, with typical practice employing downsampling (stride equal to kernel size), causing njn_j2 to halve at each layer, and channel counts njn_j3 that grow then contract for parameter efficiency (Li et al., 2022).

The A2CNN architecture exemplifies a concrete 1-D CNN feature extractor for bearing fault diagnosis, operating on FFT amplitude vectors njn_j4 through a hierarchy of five convolutional+ReLU+max-pooling blocks and two fully connected layers:

  • Conv1: 8 filters, kernel=32, stride=2, output njn_j5
  • Pool1: pool=2, stride=2, output njn_j6
  • Conv2: 16 filters, kernel=16, stride=2, output njn_j7
  • Pool2: pool=2, stride=2, output njn_j8
  • Conv3: 32 filters, kernel=8, stride=2, output njn_j9
  • Pool3: pool=2, stride=2, output Wâ„“,i(j)∈RsjW^{(j)}_{\ell,i} \in \mathbb{R}^{s_j}0
  • Conv4: 32 filters, kernel=8, stride=2, output Wâ„“,i(j)∈RsjW^{(j)}_{\ell,i} \in \mathbb{R}^{s_j}1
  • Pool4: pool=2, stride=2, output Wâ„“,i(j)∈RsjW^{(j)}_{\ell,i} \in \mathbb{R}^{s_j}2
  • Conv5: 64 filters, kernel=3, stride=2, output Wâ„“,i(j)∈RsjW^{(j)}_{\ell,i} \in \mathbb{R}^{s_j}3
  • Pool5: pool=2, stride=2, output Wâ„“,i(j)∈RsjW^{(j)}_{\ell,i} \in \mathbb{R}^{s_j}4
  • FC1: 500 units, ReLU
  • FC2: Wâ„“,i(j)∈RsjW^{(j)}_{\ell,i} \in \mathbb{R}^{s_j}5 units (softmax label outputs) (Zhang et al., 2018)

2. Theoretical Foundations and Feature Extraction Guarantees

Rigorous analysis demonstrates that deep 1-D multi-channel CNNs, with appropriate layer depths, kernel sizes (=stride), and channel allocation, have the expressivity to serve as exact linear feature extractors. For an input Wℓ,i(j)∈RsjW^{(j)}_{\ell,i} \in \mathbb{R}^{s_j}6 and a dictionary of Wℓ,i(j)∈RsjW^{(j)}_{\ell,i} \in \mathbb{R}^{s_j}7 target feature vectors Wℓ,i(j)∈RsjW^{(j)}_{\ell,i} \in \mathbb{R}^{s_j}8, a Wℓ,i(j)∈RsjW^{(j)}_{\ell,i} \in \mathbb{R}^{s_j}9-layer network can realize sjs_j0 via multi-resolution convolutions. Intermediate layers yield all sjs_j1-length patch features of the target vectors, and the parameter count is bounded by sjs_j2, considerably more efficient than fully-connected layers (Li et al., 2022). This reveals a theoretical equivalence between deep 1-D CNNs and certain classical transforms (e.g., wavelets, SVD factorizations) in their capacity for structured linear feature extraction.

3. Feature Map Dimensions and Downsampling

At each layer, the output feature map length sjs_j3 is determined by the input length sjs_j4, kernel size sjs_j5, stride sjs_j6, and padding sjs_j7:

sjs_j8

For max-pooling with pool size sjs_j9:

tjt_j0

Channels per layer match the number of filters. Stride equaling kernel size yields rapid downsampling, consistent with practical approaches where spatial dimension is halved per layer, concentrating high-level receptive fields at depth (Zhang et al., 2018, Li et al., 2022).

4. Partial Layer Untying and Domain Adaptation

In domain adaptation scenarios, e.g., A2CNN, feature extractors are instantiated per domain (source tjt_j1, target tjt_j2). The lower layers (up to tjt_j3) are shared across domains, enforcing basic filter consistency, while the upper tjt_j4 layers are "untied" (domain-specific weights), allowing target-specific high-level feature adaptation. This yields a compromise between domain invariance (via tied layers) and adaptation flexibility (via untied layers), supporting robust transfer learning under covariate shift (Zhang et al., 2018).

5. Adversarial Training and Objective Functions

The 1-D CNN feature extractor can be integrated into an adversarial training scheme that enforces extracted features to be both discriminative for source labels and invariant across domains. Key loss terms are:

  • Source label cross-entropy:

tjt_j5

  • Domain discriminator loss (binary cross-entropy):

tjt_j6

  • Adversarial loss for target extractor:

tjt_j7

The overall objective combines tjt_j8 and tjt_j9 for balance. Training follows pre-training on source, then alternating domain discriminator and adversarial feature extractor updates. Gradient reversal layers can implement adversarial updates efficiently (Zhang et al., 2018).

6. Practical Implementation Notes and Computational Considerations

1-D CNN feature extractors admit efficient implementation:

  • Layer depth Bâ„“(j)B^{(j)}_\ell0 suffices to extract Bâ„“(j)B^{(j)}_\ell1-dimensional features with Bâ„“(j)B^{(j)}_\ell2 parameters (Li et al., 2022).
  • Pooling and strides are matched for maximal downsampling and parameter efficiency.
  • Channel counts double then halve, peaking at Bâ„“(j)B^{(j)}_\ell3, minimizing memory and computation cost.
  • Post-convolutional features can be fed directly into any fully-connected network, ResNet module, or classifier head for supervised tasks.
  • ReLU activations can be effectively linearized via large positive biases where needed (Zhang et al., 2018, Li et al., 2022).

7. Extensions and Analytical Perspectives

Recent analysis establishes deeper connections between 1-D and 2-D CNN architectures, using vectorization to show that multi-layer 2-D convolutions can be reformulated as 1-D convolutions with appropriately structured filters. This provides insights on how CNNs extract singular values and enables novel theoretical guarantees on their function approximation rates, which surpass those of shallow or fully-connected architectures under certain regularity conditions. Open directions include generalizing these results to stride-1 architectures, grouped/dilated convolutions, and quantifying the effects of finite precision and stochastic optimization (Li et al., 2022).

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to 1-D CNN Feature Extractor.