CryptoFair-FL: Secure & Fair Federated Learning

Updated 25 January 2026

The paper introduces CryptoFair-FL, a framework that integrates additively homomorphic encryption and secure multi-party computation to verify fairness metrics without revealing sensitive data.
It employs a batched binary-tree aggregation protocol to reduce computational overhead, achieving up to 488× speedup over naïve methods while maintaining efficiency.
Experiments demonstrate that CryptoFair-FL achieves an 86.6% reduction in demographic parity violation and robust defense against attribute inference attacks with only moderate cost increase.

CryptoFair-FL is a cryptographic framework designed to enable privacy-preserving federated learning (FL) with verifiable group fairness guarantees. It integrates additively homomorphic encryption and secure multi-party computation to support rigorous statistical verification of fairness metrics—specifically demographic parity and equalized odds—without disclosure of protected attribute distributions or individual predictions. The framework addresses both honest-but-curious and malicious adversaries, establishes formal information-theoretic lower bounds on privacy degradation, and reduces the computational complexity of fairness verification through a batched, binary-tree aggregation protocol, achieving practical deployment efficiency. Comprehensive experiments on heterogeneous federated datasets illustrate that CryptoFair-FL can meet regulatory fairness targets while defending against attribute inference and imposing only moderate computational overhead (Ali et al., 18 Jan 2026).

1. Framework Overview and Security Model

CryptoFair-FL supports collaborative training of models across $n$ distributed institutions, ensuring that neither raw data nor protected attributes are ever centralized. The primary objectives are: (i) privacy preservation for all participants, (ii) statistically verifiable group fairness guarantees under both cryptographic and differential privacy regimes, and (iii) computational and network demands not exceeding $2.3\times$ the baseline Federated Averaging (FedAvg) protocol.

The threat model encompasses:

Honest-but-Curious: The central aggregator and up to $t < n/2$ participants follow the protocol but seek to infer sensitive information from protocol transcripts.
Malicious Adversaries: Up to $t < n/3$ participants may deviate arbitrarily, including providing forged statistics. The protocol ensures that such misbehavior is detected with probability at least $1-\delta_{\text{detect}}$ .

2. Formalization of Fairness Metrics

CryptoFair-FL focuses on two group fairness criteria in binary classification settings:

Demographic Parity Violation:

$\Delta_{\mathrm{DP}}(\theta) = \left|\,\Pr[\hat Y=1 \mid A=0] - \Pr[\hat Y=1 \mid A=1]\right|$

where $\hat Y = \mathbf{1}[f_\theta(X) > 0.5]$ and $A \in \{0,1\}$ .

Equalized Odds Violation:

$\Delta_{\mathrm{EO}}(\theta) = \max_{y\in\{0,1\}} \left| \Pr[\hat Y=1 \mid A=0, Y=y] - \Pr[\hat Y=1 \mid A=1, Y=y]\right|$

Both notions require securely aggregating predictions and protected attribute counts without revealing local or aggregate statistics.

3. Cryptographic and Differential Privacy Foundations

Additively Homomorphic Encryption (AHE)

CryptoFair-FL uses the Paillier cryptosystem with 2048-bit modulus, supporting operations:

For plaintexts $m_1, m_2$ ,

$\operatorname{Dec}_{sk}(\operatorname{Enc}_{pk}(m_1)\;\odot\;\operatorname{Enc}_{pk}(m_2)) = m_1 + m_2$

Semantic security relies on the Decisional Composite Residuosity assumption.

Secure Multi-Party Computation (MPC) Components

Threshold Decryption: Private decryption key is $n$ -way shared; any $k$ -subset can jointly decrypt.
Zero-Knowledge Range Proofs: Each party $i$ commits to its local count $s_i$ using a Pedersen commitment $\mathsf{Com}_i = g^{s_i}h^{r_i}$ , providing a proof that $s_i \in [0, m_i]$ .
Aggregate Verification: The aggregator multiplies commitments and validates all range proofs, aborting on any failure.

$(\varepsilon, \delta)$ -Differential Privacy

A randomization mechanism $\mathcal{M}$ is $(\varepsilon, \delta)$ -DP if, for all adjacent datasets $D,D'$ and all measurable $S$ ,

$\Pr[\mathcal{M}(D)\in S] \leq e^\varepsilon\,\Pr[\mathcal{M}(D')\in S] + \delta.$

4. Batched Binary-Tree Fairness Verification Protocol

Naïve aggregation of $n$ encrypted statistics incurs $O(n^2)$ cost. CryptoFair-FL reduces this to $O(n\log n)$ using a binary-tree batching protocol, in which local encrypted counts are first summed within small batches, and these batch sums are then recursively homomorphically aggregated in a tree structure.

Protocol outline:

Partition $n$ participants into $\lceil n/B\rceil$ batches.
Homomorphically sum local ciphertexts within each batch.
Recursively aggregate batch ciphertexts in $\lceil\log_2(n/B)\rceil$ levels, generating zero-knowledge pairing proofs at each node.
If any proof fails, the protocol is aborted.

The aggregation step at each tree node is:

$C_j^{(\ell)} = C_{2j-1}^{(\ell-1)} \odot C_{2j}^{(\ell-1)}$

The process requires $O(\log n)$ communication rounds.

5. Privacy and Security Guarantees

Differential Privacy Parameters

Theorem 6 specifies that, for $T$ rounds of fairness verification, noise scale $\sigma$ per party, and $n$ institutions:

$\varepsilon = \frac{4\sqrt{2T\ln (2/\delta)}}{\sigma n} + \frac{4T}{(\sigma n)^2}, \quad \delta = 10^{-6}$

Local Laplace noise (with scale $1/\varepsilon_0$ ) is added to each count, yielding $(\varepsilon_0,0)$ -DP locally. Aggregation and composition preserve $(\varepsilon, \delta)$ -DP globally. For recommended parameter choices ( $\varepsilon=0.5$ , $\delta=10^{-6}$ ), privacy loss remains near optimal.

Information-Theoretic Lower Bounds

Theorem 3 establishes that any mechanism aiming to verify $\Delta_{\mathrm{DP}}$ with additive tolerance $\tau$ must satisfy:

$\varepsilon \geq \frac{2}{\tau\,\min\{n_0, n_1\}}$

where $n_a$ is the record count for protected-attribute value $a$ . This lower bound is obtained by reduction to distinguishing adjacent datasets via hypothesis testing.

Malicious and Honest-but-Curious Adversary Defense

Zero-knowledge range proofs and threshold decryption address malicious misreporting and protect against collusion. Differential privacy, combined with cryptographically protected aggregation, renders successful attribute inference infeasible, with adversarial success rates empirically reduced to near random guess ( $<0.05$ advantage).

6. Experimental Evaluation

Datasets

MIMIC-IV (30 hospitals), mortality classification, protected attribute: race.
Adult Income (50 institutions), protected: sex.
CelebA (40 participants), protected: gender and age.
FedFair-100: synthetic, 100 institutions, census-calibrated heterogeneity.

Results

Protocol	$\Delta_{\mathrm{DP}}$ (MIMIC-IV)	AUROC (MIMIC-IV)	Overhead
FedAvg (standard)	0.231	$\approx$ 0.868	$1\times$
CryptoFair-FL	0.031	$\approx$ 0.857	$2.3\times$

CryptoFair-FL reduces demographic parity violation from $0.231$ to $0.031$ (an $86.6\%$ reduction).
AUROC remains within $0.011$ of the centralized baseline.
Communication/compute cost increases by $2.3\times$ vs. FedAvg.
Batched verification yields up to $488\times$ speedup versus naïve HE at $n=100$ .
Attribute inference attack success drops from $0.72$–$0.81$ to $0.48$–$0.53$ with CryptoFair-FL (approaching the $0.50$ baseline for random guessing).

7. Privacy–Fairness Tradeoff and Applicability

Residual fairness error scales as $O(1/\varepsilon)$ , as formalized in Theorem 7:

$\Delta_{\mathrm{DP}} \approx \frac{C}{\varepsilon}, \quad C\approx 0.096$

Empirical measurements align closely with this bound across all datasets. In the 30-hospital MIMIC-IV setting, $\Delta_{\mathrm{DP}}$ falls below the $0.05$ regulatory target within 60 communication rounds while AUROC remains above 0.85. CryptoFair-FL thus achieves near-optimal privacy-fairness efficiency within $20\%$ of the theoretical lower bound.

The integration of statistical fairness verification, cryptographic privacy, and practical aggregation efficiency makes CryptoFair-FL particularly suitable for regulated, high-stakes collaborative learning scenarios such as healthcare and finance, where legal mandates require both privacy and accountability (Ali et al., 18 Jan 2026).

Markdown Report Issue Upgrade to Chat

References (1)

Privacy-Preserving Federated Learning with Verifiable Fairness Guarantees (2026)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to CryptoFair-FL.

CryptoFair-FL: Secure & Fair Federated Learning

1. Framework Overview and Security Model

2. Formalization of Fairness Metrics

3. Cryptographic and Differential Privacy Foundations

Additively Homomorphic Encryption (AHE)

Secure Multi-Party Computation (MPC) Components

$(\varepsilon, \delta)$ -Differential Privacy

4. Batched Binary-Tree Fairness Verification Protocol

5. Privacy and Security Guarantees

Differential Privacy Parameters

Information-Theoretic Lower Bounds

Malicious and Honest-but-Curious Adversary Defense

6. Experimental Evaluation

Datasets

Results

7. Privacy–Fairness Tradeoff and Applicability

Topic to Video (Beta)

Whiteboard

Follow Topic

Continue Learning

Don't miss out on important new AI/ML research

CryptoFair-FL: Secure & Fair Federated Learning

1. Framework Overview and Security Model

2. Formalization of Fairness Metrics

3. Cryptographic and Differential Privacy Foundations

Additively Homomorphic Encryption (AHE)

Secure Multi-Party Computation (MPC) Components

(ε,δ)(\varepsilon, \delta)(ε,δ)-Differential Privacy

4. Batched Binary-Tree Fairness Verification Protocol

5. Privacy and Security Guarantees

Differential Privacy Parameters

Information-Theoretic Lower Bounds

Malicious and Honest-but-Curious Adversary Defense

6. Experimental Evaluation

Datasets

Results

7. Privacy–Fairness Tradeoff and Applicability

Topic to Video (Beta)

Whiteboard

Follow Topic

Continue Learning

Related Topics

Don't miss out on important new AI/ML research

Sign up for free to explore the frontiers of research

$(\varepsilon, \delta)$ -Differential Privacy