Papers
Topics
Authors
Recent
Search
2000 character limit reached

SliceTrain: Adaptive Slicing Framework

Updated 6 March 2026
  • SliceTrain is a family of frameworks using slicing operations to partition models, data, or tensors, enabling sub-network instantiation and efficient computation across various applications.
  • Applications include elastic deep neural network inference, small object detection via image tiling, self-supervised slice-order pretraining in biomedical imaging, and tensor completion for model reduction.
  • Key benefits involve adaptive resource scaling, accelerated convergence, and memory efficient processing, making SliceTrain valuable for dynamic and constrained computational environments.

SliceTrain refers to a family of algorithmic and modeling frameworks centered on leveraging "slicing" operations—either on models, data, or tensors—to promote efficiency, modularity, or improved generalization within machine learning and computational science workflows. The concept appears in multiple domains and under distinct technical forms, including elastic deep neural network inference, slice-assisted visual detection training, order-based self-supervised learning in biomedical imaging, and sparse tensor completion for model reduction. The unifying theme is the systematic use of slices—contiguous blocks of model parameters, spatial or parametric partitions of data, or axis-aligned views in tensors—combined with methods for efficient joint optimization or completion.

1. Model Slicing and Elastic Inference

In deep learning, "model slicing" refers to constructing a feed-forward network such that each layer is divided into GG contiguous groups of basic components (neurons or channels) and enforcing a partial-order constraint whereby only the first k=⌈αG⌉k = \lceil \alpha G \rceil groups are active for a slice rate α∈(0,1]\alpha \in (0,1]. During inference, sub-models of varying width can be instantiated by selecting different α\alpha, enabling adaptive computational cost. The model is trained ("SliceTrain") over a set of slice rates, optimizing the expected loss across both data and slice-rate schedule. This produces a single set of parameters supporting a nested family of residualized sub-networks of different widths and facilitates elastic adjustment to compute/memory budget at inference time. FLOPs scale as α2\alpha^2 relative to the full model, and empirical results show little accuracy loss down to small α\alpha on language modeling (e.g., PTB: full net PPL 81.6, α=0.5\alpha=0.5 PPL ≈88) and image classification (e.g., CIFAR-10 VGG-13: 94.3% at α=1.0\alpha=1.0 and at α=0.375\alpha=0.375 with only 14% computation) (Cai et al., 2019).

2. SliceTrain in Small Object Detection via Image Tiling

In small object detection and multi-object tracking with large, high-resolution images, SliceTrain denotes a systematic data pipeline leveraging deterministic full-coverage spatial slicing coupled with slice-level stochastic augmentation. The original image is decomposed into overlapping tiles such that every pixel and object appears in at least one slice, and no object is cut without overlap. Slice size (h=w=1280h=w=1280), a typical overlap ratio (α=0.2\alpha=0.2), and corresponding strides are chosen to guarantee coverage. Each slice is treated as an independent example; a diverse per-slice augmentation pipeline (including mosaic, MixUp, hue-saturation-value jitter, random geometry) maximizes diversity during mini-batch training. Compared to full-image training (batch size of 1), this approach enables a higher batch size (e.g., 6), leading to more numerous, information-dense batches and steeper learning curves. Quantitatively, replacing baseline YOLOv8 full-image training with SliceTrain increases detection accuracy on small objects (SO-DetA from 9.788% to over 48%) and accelerates convergence (Yu et al., 16 Jul 2025).

3. Slice-Order Learning and Self-Supervised Pretraining

For volumetric biomedical imaging, SliceTrain refers to a self-supervised pretraining methodology (SortingLoss) that exploits the natural anatomical order of tomographic slices. The core assumption is that the sequential structure along the principal axis of a CT stack provides a well-defined slice order. The task is formulated as a learning-to-rank problem: given a batch of bb slices {x1,…,xb}\{x_1, \dots, x_b\} from the same volume, the neural network (e.g., ResNet-50 backbone) maps each to a scalar, and a margin ranking loss enforces correct order across all pairs. No spatial or semantic labels are needed, and the model is pre-trained on these orderings, then fine-tuned on sparse 2D labeled slices (thus avoiding error-prone 3D label interpolation). SortingLoss pre-training achieves comparable or better segmentation (mean IoU) than SimCLR on COVID-19 chest and Medaka fish CT datasets, while utilizing about half the memory and double the training speed. SliceTrain exhibits higher robustness to strong augmentations and aligns natively with sparse annotation practices in biomedical imaging (Zharov et al., 2022).

4. Slice-Based Tensor Completion for Parametric Model Reduction

In reduced order modeling (ROM) for high-dimensional parametric dynamical systems, SliceTrain is a synonym for the "hybrid tensor train" (HTT) approach to tensor completion. The central object is a snapshot tensor X∈RM1×...×MC×K1×...×KDX \in \mathbb{R}^{M_1 \times ... \times M_C \times K_1 \times ... \times K_D}, where CC axes correspond to degrees of freedom and DD axes to system parameters. Due to the curse of dimensionality, only a small subset of parameter slices can be simulated. SliceTrain proceeds in two stages: (i) compute reduced bases U~(i)\widetilde{U}^{(i)} on fully sampled modes by truncated SVD, (ii) project tensor data onto these bases and perform tensor-train (TT) completions on the lower-dimensional parameter fibers, using an alternating least squares (ALS) solver. This hybrid slicing/completion approach enables accurate tensor reconstruction with sparse parametric sampling, and subsequent high-accuracy, low-dimensional ROM inference for new parameter queries (Mamonov et al., 2024).

5. Pseudocode Schematic for Representative SliceTrain Workflows

Application Domain High-Level SliceTrain Procedure Notable Hyperparameters
Neural model slicing (Cai et al., 2019) Sample slice rates, forward/backprop on partial model, optimize expected task loss Number of groups GG, slice-rate set FF
Image tiling for detection (Yu et al., 16 Jul 2025) Deterministic overlapping tiling, per-slice aug, train standard detector Slice size h×wh \times w, overlap α\alpha
CT slice-order learning (Zharov et al., 2022) Batch slices from one volume, apply augment, margin ranking loss on order Batch size bb, ranking margin mm
Tensor completion (Mamonov et al., 2024) Truncate SVD on fully sampled modes, TT-completion on parameter slices SVD truncation εC\varepsilon_C, TT error εq\varepsilon_q

6. Comparative Evaluation and Impact

SliceTrain techniques systematically offer improved tradeoffs in resource utilization, generalization, or data efficiency relative to alternatives:

  • Elastic model slicing provides a single neural network with sub-linear compute-memory scaling and the ability to meet inference-time budgets without post-hoc pruning or retraining (Cai et al., 2019).
  • In visual detection, enforcing spatial coverage with per-tile augmentations radically increases detection accuracy and convergence speed for difficult small object tasks, compared with full-image or random crop strategies (Yu et al., 16 Jul 2025).
  • Self-supervised ranking of biomedical CT slices yields pretrained features that match or exceed generic contrastive approaches in segmentation accuracy, while enabling larger batch sizes due to reduced memory needs (Zharov et al., 2022).
  • Slice-wise tensor-train completion circumvents exponential scaling for parametric ROM, facilitating accurate, memory-efficient surrogate models in multi-parameter PDE contexts (Mamonov et al., 2024).

7. Limitations and Characteristic Use Cases

SliceTrain strategies depend on well-structured axis-aligned domains or networks permitting groupwise slicing. In model slicing, the approach is less suited for irregular architectures or tasks requiring highly entangled representing power across all width/parameter groups. For visual detection and biomedical imaging, slicing is effective when target information is denser or better preserved in axis-aligned partitions, but may be suboptimal for highly non-local phenomena. Slice-based tensor completion assumes the ability to collect or simulate entire slices efficiently, with sufficient coverage to construct accurate low-rank bases. These constraints define the operational envelope for SliceTrain adoption.


For original methodology and results, see "Model Slicing for Supporting Complex Analytics with Elastic Inference Cost and Resource Constraints" (Cai et al., 2019), "YOLOv8-SMOT: An Efficient and Robust Framework for Real-Time Small Object Tracking via Slice-Assisted Training and Adaptive Association" (Yu et al., 16 Jul 2025), "Using the Order of Tomographic Slices as a Prior for Neural Networks Pre-Training" (Zharov et al., 2022), and "Model order reduction of parametric dynamical systems by slice sampling tensor completion" (Mamonov et al., 2024).

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to SliceTrain.