This article presents a methodology for the classification of bladder cancer using Vision Transformer (ViT) models. The dataset used for this research is a proprietary dataset established by the team at Zagazig University in Egypt, comprising of 2629 images classified into three classes. The dataset is pre-processed and divided into suitable subsets for training, validation, and testing purposes. The models are trained on the pre-processed dataset and compared based on their performance metrics. All methodologies employed in this study were meticulously executed in alignment with applicable guidelines and regulations.