2025
Maestro-Prieto, José Alberto; Romero, Pablo E.; Sanz, José Miguel Ramírez
Semi-supervised techniques to address the scarcity of experimental data: a case study of single point incremental forming Journal Article
In: Journal of Intelligent Manufacturing, 2025, ISSN: 0956-5515.
Abstract | Links | BibTeX | Tags: mulitiple data source, Semi-supervised learning, single point incremental forming, SPIF, surface roughness
@article{maestro-prieto2025,
title = {Semi-supervised techniques to address the scarcity of experimental data: a case study of single point incremental forming},
author = {José Alberto Maestro-Prieto and Pablo E. Romero and José Miguel Ramírez Sanz},
url = {https://doi.org/10.1007/s10845-025-02704-3},
doi = {10.1007/s10845-025-02704-3},
issn = {0956-5515},
year = {2025},
date = {2025-10-31},
urldate = {2025-10-31},
journal = { Journal of Intelligent Manufacturing},
abstract = {A lack of experimental data can be especially critical in new manufacturing processes. Although experimental datasets for industrial processes are reported in various research works, their lack of homogeneity complicates any fitting with conventional numerical models. Artificial Intelligence (AI) models can be an optimal alternative to extract useful information from those unconnected datasets, while generating models that can help explain the hidden patterns within datasets and interpret the predictions of the model for final users. Moreover, an AI algorithm that could be trained with limited labeled datasets would be in high demand, as it could effectively lower implementation costs. Semi-Supervised Learning (SSL) techniques might therefore be a promising solution to respond to industrial demand for the analysis of manufacturing processes. In this research, the use of SSL techniques is proposed in a case study of surface quality prediction in single point incremental forming, a promising new manufacturing technique. Datasets were extracted from the existing bibliography to generate a 234-instance dataset with 4 different industrial specifications of roughness. The best results were obtained using a semi-supervised Co-Training algorithm. Semi-supervised methods systematically improved the results obtained with the reference supervised methods, although statistical significance has not been mainly achieved due to the limited dataset size. The results obtained with the unbalanced dataset were very promising for its industrial implementation with an extended training dataset optimized for the range of process conditions of each end-user.},
keywords = {mulitiple data source, Semi-supervised learning, single point incremental forming, SPIF, surface roughness},
pubstate = {published},
tppubtype = {article}
}
A lack of experimental data can be especially critical in new manufacturing processes. Although experimental datasets for industrial processes are reported in various research works, their lack of homogeneity complicates any fitting with conventional numerical models. Artificial Intelligence (AI) models can be an optimal alternative to extract useful information from those unconnected datasets, while generating models that can help explain the hidden patterns within datasets and interpret the predictions of the model for final users. Moreover, an AI algorithm that could be trained with limited labeled datasets would be in high demand, as it could effectively lower implementation costs. Semi-Supervised Learning (SSL) techniques might therefore be a promising solution to respond to industrial demand for the analysis of manufacturing processes. In this research, the use of SSL techniques is proposed in a case study of surface quality prediction in single point incremental forming, a promising new manufacturing technique. Datasets were extracted from the existing bibliography to generate a 234-instance dataset with 4 different industrial specifications of roughness. The best results were obtained using a semi-supervised Co-Training algorithm. Semi-supervised methods systematically improved the results obtained with the reference supervised methods, although statistical significance has not been mainly achieved due to the limited dataset size. The results obtained with the unbalanced dataset were very promising for its industrial implementation with an extended training dataset optimized for the range of process conditions of each end-user.

