Fuzzy Logistic Liu-type Estimators (FLLTE)
- The paper introduces a two-parameter penalization that improves bias–variance performance and numerical stability in fuzzy logistic regression.
- FLLTE represents both responses and parameters as fuzzy triangular numbers, extending the framework to manage uncertainty in biomedical data.
- Empirical studies confirm that FLLTE outperforms standard estimators by achieving lower MSE and superior goodness-of-fit in multicollinear settings.
Fuzzy Logistic Liu-type Estimators (FLLTE) are a class of penalized estimators designed for fuzzy logistic regression under severe multicollinearity. Both the response and model parameters are represented by fuzzy triangular numbers, extending the classical logistic regression framework to address uncertainty and vagueness in both inputs and outputs. The FLLTE introduces a two-parameter shrinkage, yielding improved bias–variance properties and numerical stability, which is empirically validated by simulation and application to high-multicollinearity biomedical data (Shemail et al., 27 Dec 2025).
1. Statistical Model and Notation
In the fuzzy logistic regression context, each observation has a fuzzy binary response , represented as a triangular fuzzy number with lower, modal, and upper bounds. The parameter vector is also represented as fuzzy triangular numbers: . The design matrix is with a leading column of ones for the intercept.
The fuzzy predicted probability of success at observation is
For estimation, the logit link is applied to the midpoints:
$\logit(\pi_i) = X_i \beta,$
where is the midpoint of . The weight matrix for Fisher scoring is .
2. Multicollinearity and Standard Fuzzy Estimators
When columns of display strong linear dependence, the Fisher-information matrix becomes ill-conditioned. The standard Fuzzy Maximum Likelihood Estimator (FMLE),
where is the working response, exhibits high variance and instability under such conditions. This behavior motivates penalized alternatives to address the inflated variance characteristic of multicollinear settings.
3. Formulation of the Fuzzy Logistic Liu-type Estimator
The FLLTE is derived by augmenting the Newton–Raphson scoring equation with a generalized Liu-type two-parameter penalty:
for penalty parameters and . The resulting estimator is
In practice, this is written as
The parameters and are selected to balance sparsity, stability, and bias–variance trade-off.
4. Statistical Properties
The FLLTE displays analytically tractable bias and variance:
- Bias: For the true parameter ,
so the bias is
- Variance:
- Mean Square Error (MSE):
- Asymptotics: As , the penalization vanishes and in distribution, guaranteeing consistency.
5. Comparison with FMLE, FLRE, FLLE, and FLLTPE
Several estimators are contrasted: | Estimator | Formula | Penalty Parameters | |------------------------------|---------------------------------------------------------------------|--------------------| | FMLE | | None | | FLRE | | Ridge () | | FLLE | | Liu () | | FLLTE | | Liu-type () | | FLLTPE | | Liu-type param. |
Simulation and real-data analyses demonstrate the hierarchy:
in terms of lower MSE and improved goodness-of-fit, especially under strong predictor correlation (Shemail et al., 27 Dec 2025).
6. Choice of Shrinkage Parameters
Shrinkage parameters (, ) are selected through data-driven formulas developed by Asar & Genç (2016):
- Compute eigenvalues/vectors of .
- Set
ensuring . In empirical analysis, medians or means across the components are used for parameter stability.
7. Empirical Evaluation and Application
Simulation studies were conducted for and , with inter-predictor correlations of $0.70$, $0.90$, and $0.99$. The true fuzzy coefficients were set as triangular numbers around predefined values. Over $100$ Monte Carlo repetitions, FLLTE—and particularly FLLTPE—attained the smallest MSE, remaining robust even as correlation approached $0.99$, whereas FMLE demonstrated rapid degradation.
A real-data application used patient observations for modeling the fuzzy “renal failure rate” (triangular-coded as Low , Medium , High ) as a function of blood sugar, hemoglobin, and age. The condition number of was approximately $45$, indicating severe multicollinearity. Results are summarized as follows:
| Estimator | MSE | Goodness-of-Fit |
|---|---|---|
| FMLE | 1.81 | 0.54 |
| FLRE | 1.42 | 0.48 |
| FLLE | 1.29 | 0.39 |
| FLLTE | 0.57 | 0.17 |
| FLLTPE | 0.53 | 0.16 |
FLLTE and FLLTPE yielded the lowest MSE and best model fit.
8. Implications and Recommendations
The FLLTE addresses instability and inflated variance in fuzzy logistic regression when multicollinearity is present by introducing a flexible two-parameter penalization. Its closed-form expression admits analytic bias–variance trade-offs and preserves asymptotic consistency. Simulation and real-world results confirm notable improvements over FMLE, FLRE, and FLLE with respect to MSE and model fit. Median-based eigen-decomposition parameter selection is recommended to promote stability across settings (Shemail et al., 27 Dec 2025).