Synthetic generated data for intelligent corrosion classification in oil and gas pipelines

Research output: Contribution to journalArticlepeer-review

4 Scopus citations

Abstract

This research presents the K-Pipelines dataset, a pioneering synthetic image collection designed specifically for the classification of corrosion in oil and gas pipelines. Instead of training custom generative architectures, our research used an online image generation tool powered by Stable Diffusion. This choice leveraged the platform's robust capability to quickly produce a high volume of diverse and detailed images, saving significant time and resources. The dataset was carefully constructed using a sequence of refined prompts, derived from a review of pipeline characteristics including material types, environments, and corrosion forms. K-Pipelines consist of 600 PNG images of 512 × 512 resolution. Furthermore, an augmented version was developed, totaling 1080 images. Our evaluation employed state-of-the-art deep learning classifiers, specifically VGG16, ResNet50, EfficientNet, InceptionV3, MobileNetV2, and ConvNeXt-base, to test the integrity of the K-pipelines dataset. These models showcased its robustness by consistently achieving accuracies around the 90% mark, illustrating the dataset's substantial promise as a resource for both AI research and real-world applications in the oil and gas industry. The dataset is publicly available for access and use within the scientific community.

Original languageEnglish
Article number200463
JournalIntelligent Systems with Applications
Volume25
DOIs
StatePublished - Mar 2025

Bibliographical note

Publisher Copyright:
© 2024 The Authors

Keywords

  • Artificial intelligence
  • Computer vision
  • Corrosion
  • Deep learning
  • Generative models
  • Image classification
  • Oil and gas
  • Stable diffusion

Cite this