Explicit positional encoding for the texture transformer branch
Develop and evaluate explicit positional encoding mechanisms for the transformer-based texture branch that aggregates per-face texture pixels into a token in semantic segmentation models for textured non-manifold 3D meshes, in order to assess their impact on segmentation accuracy and representation quality.
References
The most notable limitation of our method is the absence of explicit positional encoding in the texture transformer branch, which we leave for future exploration.
— Semantic Segmentation of Textured Non-manifold 3D Meshes using Transformers
(2604.01836 - Heidarianbaei et al., 2 Apr 2026) in Conclusion and Future Work