Differentiable FOL with LTNs in Neuro-Symbolic AI

Updated 25 February 2026

The paper introduces a differentiable framework for FOL with LTNs, integrating neural models with logical reasoning.
It leverages fuzzy logic and log-space techniques to ensure stable optimization and effective aggregation of logical constraints.
Applications include zero-shot image classification and medical segmentation, demonstrating improved performance in neuro-symbolic tasks.

Differentiable first-order logic with Logic Tensor Networks (LTNs) unifies learning and reasoning by grounding the syntax and semantics of first-order logic—constants, variables, functions, predicates, connectives, and quantifiers—as differentiable operations in neural architectures. Through fully differentiable "Real Logic" semantics, LTNs enable the optimization of logical knowledge bases and support applications in neuro-symbolic AI such as zero-shot image classification and medical semantic segmentation. PROTOtypical LTNs (PROTO-LTNs) extend this paradigm, providing parameter-efficient, prototype-based class groundings suitable for zero and few-shot learning. This approach underpins a broad range of tasks where integrating symbolic knowledge and data-driven learning is essential.

1. Real Logic and LTNs: Semantics of Differentiable FOL

Logic Tensor Networks implement "Real Logic," mapping all elements of classical first-order logic to real-valued, continuous, and differentiable functions (Badreddine et al., 2020). Groundings are defined as follows:

Constants/Variables: Each symbol $x_i$ is mapped to $\mathcal{G}(x_i) \in \mathbb{R}^n$ .
Function Symbols: Each $k$ -ary function $f$ is mapped as $\mathcal{G}(f): (\mathbb{R}^n)^k \to \mathbb{R}^n$ (commonly a neural network).
Predicate Symbols: For each $k$ -ary predicate $P$ , the grounding is $\mathcal{G}(P): (\mathbb{R}^n)^k \to [0,1]$ , typically using neural tensor networks or MLPs.

The connective layer replaces classical Boolean operators with differentiable fuzzy logic:

$\begin{aligned} \mathcal{G}(\phi\land\psi) &= T(\mathcal{G}(\phi),\mathcal{G}(\psi)), \ \mathcal{G}(\phi\lor\psi) &= S(\mathcal{G}(\phi),\mathcal{G}(\psi)), \ \mathcal{G}(\neg\phi) &= 1 - \mathcal{G}(\phi), \ \mathcal{G}(\phi\to\psi) &= S(1-\mathcal{G}(\phi),\,\mathcal{G}(\psi)). \end{aligned}$

Commonly, $T(a,b) = a b$ (product t-norm) and $\mathcal{G}(x_i) \in \mathbb{R}^n$ 0 (probabilistic sum) (Badreddine et al., 2020, Martone et al., 2022).

Quantifiers are interpreted as aggregators:

$\mathcal{G}(x_i) \in \mathbb{R}^n$ 1 ("for all") as a generalized $\mathcal{G}(x_i) \in \mathbb{R}^n$ 2-mean close to a product,
$\mathcal{G}(x_i) \in \mathbb{R}^n$ 3 ("there exists") as a generalized mean or smoothed max.

Letting $\mathcal{G}(x_i) \in \mathbb{R}^n$ 4 be groundings over the quantification domain, quantifier aggregation can be:

$\mathcal{G}(x_i) \in \mathbb{R}^n$ 5

A knowledge base $\mathcal{G}(x_i) \in \mathbb{R}^n$ 6 of fuzzy formulas is globally satisfied to the extent

$\mathcal{G}(x_i) \in \mathbb{R}^n$ 7

Learning is formulated as maximizing this satisfaction, with $\mathcal{G}(x_i) \in \mathbb{R}^n$ 8 regularization on parameters.

2. logLTN and Differentiable Fuzzy Logic in the Logarithm Space

logLTN (Badreddine et al., 2023) addresses numerical instability and gradient vanishing/exploding issues by grounding all connectives and quantifiers in the log domain:

Atoms: If $\mathcal{G}(x_i) \in \mathbb{R}^n$ 9 is a sigmoid/softmax output for predicate $k$ 0, $k$ 1 is used.
Conjunction: $k$ 2.
Disjunction: $k$ 3.
Negation: Atom-level with exact closed forms for log-sigmoid and log-softmax.
Universal quantifier: Mean of log-truths (ensuring batch-size invariance).
Existential quantifier: LogMeanExp (LME), providing a tight smooth approximation to max, distributing gradients stably.

The mean over log-truths for $k$ 4 and LME for $k$ 5 prevent underflow/overflow and maintain efficient backpropagation. Empirical benchmarks demonstrate significantly improved stability and tighter theoretical bounds compared to traditional product and sum-based aggregators.

3. LTNs in Practice: Model Architecture and Optimization

A typical LTN model graph includes:

Grounding variables as real vectors/tensors for the given domain.
Neural networks for predicate and function symbols.
Fuzzy connective layers compute $k$ 6, $k$ 7, $k$ 8, $k$ 9 element-wise over outputs.
Quantifier layers aggregate (via $f$ 0-means or log-space operators) over variable axes.
The global satisfaction is aggregated (e.g., via a t-norm or mean), and the final loss is typically:

$f$ 1

Gradients propagate through all layers, including logic operators and aggregators, enabling joint end-to-end learning of neural parameters and logical satisfaction (Badreddine et al., 2020, Badreddine et al., 2023).

4. PROTOtypical Logic Tensor Networks for Zero- and Few-Shot Learning

PROTO-LTN (Martone et al., 2022) introduces a paradigm for class-level reasoning in few- and zero-shot visual tasks:

Prototype-based Grounding: Each class $f$ 2 is represented as a prototype vector $f$ 3, occupying the same space as image embeddings.
Few-Shot: Prototypes are computed as means of support embeddings: $f$ 4 where $f$ 5 maps input images to embeddings.
Zero-Shot: Unseen-class prototypes are derived by embedding semantic attribute vectors via $f$ 6.

The $f$ 7 predicate, fundamental for semantic interpretation, is grounded by a Gaussian kernel over embedding distances: $f$ 8 Alternatively, a parameterized similarity function can be learned.

Logical axioms enforce class membership and non-membership constraints for queries. For each episode:

Positive axioms: each query example must be an instance of its true class.
Negative axioms: each query must not be an instance of any other class (with down-weighting $f$ 9).

The per-episode differentiable loss aggregates the satisfaction of these axioms, incorporating both data-driven and logical supervision.

5. Applications and Empirical Results

5.1 Semantic Image Interpretation and GZSL

In generalized zero-shot learning (GZSL), PROTO-LTN demonstrates performance at or above state-of-the-art embedding-based methods. On AWA2, it achieves unseen-class accuracy $\mathcal{G}(f): (\mathbb{R}^n)^k \to \mathbb{R}^n$ 0 and harmonic mean $\mathcal{G}(f): (\mathbb{R}^n)^k \to \mathbb{R}^n$ 1, outperforming DEM’s $\mathcal{G}(f): (\mathbb{R}^n)^k \to \mathbb{R}^n$ 2, $\mathcal{G}(f): (\mathbb{R}^n)^k \to \mathbb{R}^n$ 3 (Martone et al., 2022). Embedding visualizations via t-SNE confirm that instances and prototypes cluster semantically in the learned metric space.

5.2 Semantic Segmentation with Medical Knowledge

The integration of FOL constraints into medical image segmentation is exemplified by combining LTNs with a SwinUNETR backbone (Bergamin et al., 26 Sep 2025). Background knowledge rules—such as connectivity, non-nesting, and volume similarity—are expressed as FOL formulas and continuously relaxed to yield differentiable loss components. Empirically, LTNs act as soft, anatomically-informed regularizers, yielding consistent improvements in Dice coefficient, notably in low-data settings. For instance, with only 5% training data, SwinUNETR + LTN yields a Dice of $\mathcal{G}(f): (\mathbb{R}^n)^k \to \mathbb{R}^n$ 4 versus $\mathcal{G}(f): (\mathbb{R}^n)^k \to \mathbb{R}^n$ 5 without LTN.

6. Significance, Limitations, and Directions

LTNs and their variants—including logLTN and PROTO-LTN—provide a unified, differentiable backbone for neuro-symbolic AI. Their chief strengths are parameter sharing, end-to-end optimization, and the ability to inject arbitrary, domain-specific, or commonsense background knowledge through differentiable logic. logLTN advances address prior instability and scaling issues, facilitating broader classes of formulas with improved convergence.

A plausible implication is that this approach can generalize seamlessly to new domains, provided predicates and axioms are expressible as neural modules and fuzzy-logic formulas. Nevertheless, constraints on expressiveness may arise if required logical properties cannot be adequately captured by fuzzy groundings. Moreover, practical performance depends on the differentiability and smoothness of chosen relaxations, as demonstrated in ablation studies comparing p-means, t-norms, and log-space semantics (Badreddine et al., 2023).

7. Summary Table: LTN Variants and Key Features

Framework	Connective Semantics	Quantifier Aggregation	Notable Features
LTN ("Real Logic") (Badreddine et al., 2020)	Product t-norm, S-norm, min/max	$\mathcal{G}(f): (\mathbb{R}^n)^k \to \mathbb{R}^n$ 6-mean for $\mathcal{G}(f): (\mathbb{R}^n)^k \to \mathbb{R}^n$ 7	General-purpose, modular
logLTN (Badreddine et al., 2023)	Log-domain addition, LME	Mean of logs, LME	Numerical stability, batch-invariant, tighter bounds
PROTO-LTN (Martone et al., 2022)	Gaussian kernel (prototype)	Regular $\mathcal{G}(f): (\mathbb{R}^n)^k \to \mathbb{R}^n$ 8-mean	Prototype-based, parameter efficiency, GZSL/few-shot

The differentiable first-order logic formalized by LTNs and their extensions enables principled integration of data-driven learning with logical axioms, providing a robust neuro-symbolic framework that supports both knowledge-based reasoning and high-capacity function approximation.

Markdown Report Issue Upgrade to Chat

References (4)

Logic Tensor Networks (2020)

PROTOtypical Logic Tensor Networks (PROTO-LTN) for Zero Shot Learning (2022)

logLTN: Differentiable Fuzzy Logic in the Logarithm Space (2023)

Integrating Background Knowledge in Medical Semantic Segmentation with Logic Tensor Networks (2025)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Differentiable First-Order Logic with LTNs.

Differentiable FOL with LTNs in Neuro-Symbolic AI

1. Real Logic and LTNs: Semantics of Differentiable FOL

2. logLTN and Differentiable Fuzzy Logic in the Logarithm Space

3. LTNs in Practice: Model Architecture and Optimization

4. PROTOtypical Logic Tensor Networks for Zero- and Few-Shot Learning

5. Applications and Empirical Results

5.1 Semantic Image Interpretation and GZSL

5.2 Semantic Segmentation with Medical Knowledge

6. Significance, Limitations, and Directions

7. Summary Table: LTN Variants and Key Features

Topic to Video (Beta)

Whiteboard

Follow Topic

Continue Learning

Don't miss out on important new AI/ML research

Differentiable FOL with LTNs in Neuro-Symbolic AI

1. Real Logic and LTNs: Semantics of Differentiable FOL

2. logLTN and Differentiable Fuzzy Logic in the Logarithm Space

3. LTNs in Practice: Model Architecture and Optimization

4. PROTOtypical Logic Tensor Networks for Zero- and Few-Shot Learning

5. Applications and Empirical Results

5.1 Semantic Image Interpretation and GZSL

5.2 Semantic Segmentation with Medical Knowledge

6. Significance, Limitations, and Directions

7. Summary Table: LTN Variants and Key Features

Topic to Video (Beta)

Whiteboard

Follow Topic

Continue Learning

Related Topics

Don't miss out on important new AI/ML research

Sign up for free to explore the frontiers of research