Lung cancer classification using vision transformer: a CRISP-DM approach with histopathological imaging

Wahyudi Setiawan, Wahyudi Agustiono, Yoga Dwitya Pramudita

Abstract


Lung cancer classification based on histopathological imaging plays a pivotal role in achieving early detection, accurate diagnosis, and effective treatment planning. Conventional diagnostic methods, including manual examination of histopathological slides and radiological imaging, are often subjective and time-consuming. Their limited ability to capture complex morphological patterns further constrains diagnostic accuracy. In response to these limitations, the present study employs the Cross-Industry Standard Process for Data Mining (CRISP-DM) framework to systematically evaluate the application of the Vision Transformer (ViT) in lung cancer classification. The LC25000 dataset, comprising three histopathological categories: adenocarcinoma, squamous cell carcinoma, and benign lung tissue. It was utilized for model evaluation. All images were resized to 224 × 224 pixels, and data augmentation techniques were applied to enhance generalization capability. The ViT model was implemented using TensorFlow and trained with the Adam optimizer (learning rate = 0.0001, batch size = 16, epochs = 50), employing early stopping and learning rate scheduling to mitigate overfitting. The proposed model achieved an overall accuracy of 0.97, with precision, recall, and F1-scores consistently exceeding 0.97. Class-level analysis demonstrated exceptional performance in identifying benign tissue (precision = 0.999, recall = 1.000, F1 = 0.999) and robust classification of malignant subtypes, including adenocarcinoma (F1 = 0.957) and squamous cell carcinoma (F1 = 0.959). These results emphasize the ViT’s strong capability in capturing global contextual features, surpassing conventional CNN-based methods that primarily rely on local feature extraction.

Full Text: PDF

Published: 2025-11-20

How to Cite this Article:

Wahyudi Setiawan, Wahyudi Agustiono, Yoga Dwitya Pramudita, Lung cancer classification using vision transformer: a CRISP-DM approach with histopathological imaging, Commun. Math. Biol. Neurosci., 2025 (2025), Article ID 136

Copyright © 2025 Wahyudi Setiawan, Wahyudi Agustiono, Yoga Dwitya Pramudita. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Commun. Math. Biol. Neurosci.

ISSN 2052-2541

Editorial Office: [email protected]

 

Copyright ©2025 CMBN