1 Introduction

COVID-19 is a highly contagious disease from the Coronavirus family and can cause numerous diseases. The World Health Organization (WHO) declared COVID-19 as the “Public Health Emergency of International Importance” on January 30, 2020, and as a pandemic on March 11, 2020 [1]. Billions of people around the world have been affected by the rapid spread of COVID-19 (SARS-CoV-2) and causing severe respiratory failure. SARS-CoV-2 infection can lead to severe pneumonia, potentially fatal [2, 3]. Until now (9 February 2021), there are a total of 106,125,682 confirmed cases, 2,320,497 of which resulted in death [1]. These numbers reveal the extent of the epidemic.

Early diagnosis is the most important factor directly affecting the spread of the virus in COVID-19. The best known diagnostic method for COVID-19 is the reverse transcription-polymerase chain reaction (RT-PCR) [1, 4]. On the other hand, studies show that this test has a sensitivity of about 60–70%, and therefore it is necessary to perform more than one test to make an accurate diagnosis [5,6,7]. Since the RT-PCR test has a sensitivity of 60–70%, symptoms are determined by using lung and chest radiology images in the diagnosis process [8, 9]. CT is a frequently used and successful imaging method in diagnosis [10]. The main purpose of computer-aided automatic diagnosis systems is to assist medical professionals in their decisions. Situations such as the high spread rate of the COVID-19 epidemic and an insufficient number of physicians increase the importance of automatic diagnosis approaches.

Findings obtained in CT tomography images have a very important place in the diagnosis stage. One of the most important findings that radiologists look at from COVID-19 images are ground-glass opacity [11, 12]. With ground-glass opacity, it appears to have a piece frosted appearance on the lung. Another obvious finding is the crazy-paving pattern [11, 12]. In a study investigating the radiological findings of COVID-19, the findings obtained from 88 patients were included [12]. According to the results obtained in the study, the most common patterns seen in chest CT are ground-glass opacity seen in 65% of the patients. Besides, 47% of the patients had air bronchogram, 35% had flat or irregular interlobular septal thickening, 32% had adjacent pleural thickening, and 10% had a crazy-paving pattern [12]. As can be seen here, there are significant differences between a healthy CT image and a CT image with COVID-19. It is aimed to determine the differences between computer-based approaches. For this, various image processing and machine learning methods are used.

Medical imaging technologies are frequently used in the diagnosis of different diseases. Image segmentation is a basic technique in image processing. Generally, in segmentation processes, the process is carried out by making use of the brightness and gray level of the image pixels, contrast, texture, and color. Segmentation is especially applied to medical images. In this study, which we carried out, segmentation was also used. The main motivation for this study is to propose a robust and highly accurate system that can diagnose COVID-19 through CT images. The innovations and contributions of the study are as follows;

  1. 1.

    To propose a new and effective approach based on machine learning and image processing that can diagnose COVID-19.

  2. 2.

    To create a decision support system that can support and assist medical professionals in their COVID-19 work.

  3. 3.

    Identify suspected COVID-19 cases accurately and quickly, thus playing an important role in timely quarantine and medical treatment.

  4. 4.

    It has been shown that a robust approach is presented for mixed data sets.

  5. 5.

    The proposed method is independent of the data set. This method is applicable for different datasets.

  6. 6.

    It has been shown that more successful results are obtained than other studies in the literature when segmented images are used.

  7. 7.

    A GUI accessible on GitHub has been created for the use of medical professionals and people who want to work in this field.

The following parts of the study are as follows; Sect. 2 provides a summary of the literature, Sect. 3 data set and method, Sect. 4 experimental study results, Sect. 5 discussion, and Sect. 6 conclusion.

2 Related Work

Since COVID-19 emerged, numerous academic studies have been conducted using radiological images like CT and X-ray. The rate of detecting COVID-19 from CT images is higher than in X-ray images [13]. X-ray images can be misleading in ground-glass opacities [14]. In other words, ground-glass opacities that are not visible in the X-ray image can be seen when imaging with CT. Some of the studies in the literature were performed on CT images [15,16,17,18,19], some on X-ray images [20,21,22,23,24], and some on both [25,26,27]. In our experimental studies, we use two different CT data sets and a mixed data set consisting of combining these data sets. There are many studies in the literature using CT datasets. In the first of these studies, Wang and others redesigned the COVID-Net deep learning architecture and applied it to two different CT images [28]. As the most important feature of their study, they stated that they reduced the data heterogeneity with the normalization process applied to each data set separately. They achieved 90.83% and 78.69% accuracy, respectively. Jaiswal et al. aimed to diagnose COVID-19 from CT images with the pre-trained DenseNet201-based CNN deep learning model [29]. In this study, which has the same data set as the data set we used in our study, the testing accuracy is 96.25%. They used DenseNet, a CNN model, in their studies. Yazdani et al. made the Resnet56 architecture for COVID detection over the residual attention network model [19]. In the study, an accuracy value of 92% was obtained. Silva et al. used two different data sets in their study. EfficientCovidNet achieved an 87.68% accuracy rate with deep learning architecture [17]. Unlike other studies, the performance of the system was measured with different data sets during the training and testing stages by the cross dataset. The accuracy rate obtained in this way is 56.16%. Details of more studies that do not use the same data set as our study but operate on CT and X-ray images are shown in Table 1.

Table 1 Some of the studies in the literature about COVID-19

The main motivation of this study is to propose a robust and rapid system capable of diagnosing COVID-19 through CT images. It has been understood that a generalizable and rapid approach should be put forward to provide support, especially to medical professionals. For this motivation, the studies carried out in the literature were examined and their deficient and positive aspects were investigated. As a result of the evaluations, it was seen that there was no study using segmented images in the studies carried out in the literature. In addition, it has been determined that mixed data sets, which are obtained by using different data sets together, are not used. In the light of these determinations, a general method has been tried to be put forward. It would not be wrong to say that this application, which was created and made available on GitHub, is the most important contribution of the study to this field. By selecting any CT image via the GUI, results can be obtained in less than 1 min. And this process is carried out with a very high accuracy rate.

3 Materials and Methods

Computed tomography (CT) images were used for experimental studies [15, 18, 38]. First dataset contains 1252 images with COVID-19 (+) and 1230 images with COVID-19 (−) [15, 18]. It can be said to be one of the largest CT COVID-19 datasets available to the public. This data set we use belongs to 60 patients with COVID-19 in Brazil. While 30 of these 60 people are COVID-19 (+), 30 of them are COVID-19 (−). The top two rows in Fig. 1 show COVID-19 (+), the bottom two rows show COVID-19 (−) images.

Fig. 1
figure 1

SARS-CoV-2 CT scan sample images for Dataset 1 [15, 18]

Our second dataset used in our study includes 349 COVID-19 (+) images and 397 COVID-19 (−) images. In that 37% of those with COVID-19 (+) are female and 63% are male. Sample images of this dataset are shown in Fig. 2. The top row in Fig. 2 show COVID-19 (+), the bottom row show COVID-19 (−) images.

Fig. 2
figure 2

COVID-19 sample images (Dataset 2)

3.1 Preprocessing and Segmentation

The CT images in the study are in png format. Also, the dimensions of the images range from 182 × 129 to 534 × 341. For this reason, the resizing process was applied to our data set in the first stage of the preprocessing step. In our study, the nearest method is used for the resize operation. This method is the default method used in the resize operation. The image has been resized to 320 × 256 dimensions, which are considered to be the most suitable size for all images using the average height and width. All images were then converted to gray levels. Gray-level images are easier and less costly to process because a single value is used to represent each pixel. After this conversion process, image sharpening and contrast enhancement were applied. After all these processes, the segmentation step was started. Gaussian Mixture Modeling Method based on Expectation–Maximization (EM) algorithm was used for the segmentation process. Figure 3 shows the pseudocode of the EM algorithm. The pixels of the image to be segmented in the initial phase are expressed by P. K indicates the number of clusters. First, with the K-mean clustering method, the initial values ​​of the parameters are determined. After the initial values of the parameters of the method are determined, the expectation and maximization steps (E and M) are performed repeatedly to determine the probability distribution parameters. As can be seen in Fig. 3, the probability estimation is performed with step E, while the parameters of the probability distribution are determined in step M.

Fig. 3
figure 3

Pseudocode of the expectation–maximization (EM) algorithm for Gaussian mixture modeling (GMM)

In our study, segmentation with the number of clusters of different sizes was applied and it was determined that the most suitable number of clusters was 3. For this reason, the number of clusters was applied as 3 in the segmentation process. A Wiener filter was used in the noise reduction stage [39]. This filter is generally a linear smoothing filter used in the frequency domain. The Wiener filter increases the blurring level of images. The Wiener filter is formulated as follows;

$$W\left(u, v\right)= \frac{H(u, v)}{{\left|H(u, v)\right|}^{2}+{S}_{nx}(u, v)}$$
(1)

\({S}_{nx}(u, v)\) represents the signal-to-noise ratio (SNR). H (u, v) represents the sinc function of the target pixel. The segmentation and noise removal stages on two sample images are shown in Fig. 4. When the figure is examined, the effect of three different neighborhood sizes on the image can be seen in the Wiener filter [3 3], [5 5], and [7 7]. As a result of the experimental studies, it has been seen that using a window size for the Wiener filter [3 3] leads to the best result. For this reason, [3 3] was used as the window size in our study. As you increase in the neighborhood, so does the blurring level of the image. Besides, it is possible to see from the same figure that the ground-glass opacity in the COVID-19 (+) image becomes more pronounced as a result of segmentation.

Fig. 4
figure 4

COVID and non-COVID sample images, segmentation, and noise reduction with different neighborhood sizes

The flow chart showing all the operations we have performed in the preprocessing and segmentation stage is shown in Fig. 5.

Fig. 5
figure 5

Preprocessing and segmentation process flow diagram

3.2 Feature Extraction and Classification

In this work, Histogram of Oriented Gradients (HOG), Gray-Level Co-Occurrence Matrix (GLCM), and Local Binary Pattern (LBP) methods were used in feature extraction. Since the method that gives the most successful results is HOG, the results obtained with this method will be given.

For feature extraction with HOG [40]; Horizontal and vertical Sobel filters are applied (Gx and Gy edge detection is provided) Then, the gradient size and gradient orientation angle (G and α) are calculated. Mentioned transactions can be seen in Fig. 6.

$$\begin{aligned} G_{x} &= I*\left[ { - 1\, 0\, 1} \right],\quad G_{y} = I*\left[ { - 1\, 0\, 1} \right]^{T} ,\\ G &= \sqrt {G_{x}^{2} + G_{y}^{2} } ,\quad \alpha = \arctan \frac{{G_{y} }}{{G_{x} }} \end{aligned}$$
(2)
Fig. 6
figure 6

HOG processing steps

As a result of experimental studies, it was determined that the cell size that gives the highest success for HOG is 32. When using the HOG method, it is necessary to increase the cell size to capture large-scale spatial information. However, when you increase the cell size, you may lose small-scale details. In other words, it is not possible to talk about a single optimal cell size for HOG. The best cell size for the HOG method depends on your data. This can be obtained as a result of experimental studies. In the classification stage of the study, training of the extracted features and performing the classification process are included. At this stage, K Nearest Neighbor (kNN), Ensemble kNN, ensemble subspace discriminant analysis, Kernel Support Vector Machines (k-SVM), Decision Tree, and kernel naive Bayes methods were used. The parameters of these methods used are given in Table 2.

Table 2 Parameters of the methods

kNN is a nonparametric classification method [41]. The prediction of a new sample is determined by considering the closest neighbors and assigned to the most similar cluster [42]. The number k here shows how many nearest neighbors to look after. The Minkowski distance was used for the distance to the neighbors. The Minkowski distance is a generalized version of the Euclidean and Manhattan distance measures. In this distance method, if the p-value is 1, it is the Manhattan distance measure, and if it is 2, it is the Euclidean distance measure. In our study, the p-value was used as 2. In other words, the Euclidean version of the Minkowski distance criterion was used. The k value used in the kNN method in our study was obtained as a result of experimental studies. The system was tested with 1, 3, 5, and 7 k values, and it was seen that the most successful results were obtained for the 1 value of k.

Ensemble methods perform the process of combining weak classifiers to obtain strong classifiers [43, 44]. Ensemble kNN is one of these methods. In high-dimensional data, the kNN is heavily affected by noise. To improve the performance of the nearest neighbor classifier with ensemble methods, a method is proposed in which each classifier of the ensemble can only access a random subset of features [45]. Another ensemble method is the ensemble subspace discriminant analysis method, which eliminates the weaknesses of the Linear Discriminant analysis method and creates a stronger method. One of the shortcomings of this method is that weak discrimination capability may arise due to the random selection of subsets in the random subspaces method. The majority voting (MV) method is used to find a solution to this weakness of RSM. Under normal circumstances, a single classifier in the ensemble can use only a small portion of the features in the feature space, but thanks to this method, each classifier can classify any new unknown sample [46, 47].

Decision trees, another method we use in our study, is a classification model in the form of a tree structure consisting of decision nodes and leaf nodes according to properties and goals. A decision tree algorithm is developed by breaking the data set into small pieces. The naive Bayes classifier method is a method that has been used for many years and is based on probability theory. A naive Bayes classifier can also be considered as a Bayesian network where each attribute is conditionally independent of each other and the concept to be learned is conditionally dependent on all these attributes [48].

The SVM is the last classifier that gave our study’s name. SVM is a widely used method in medical image processing applications. The main purpose of the SVM algorithm is to find a hyper-plane that separates data [49]. The standard SVM algorithm was created for binary classification problems. However, today SVM method is frequently used for multi-class problems. Support vector machines algorithms use kernel functions [50]. Kernel functions map data to a different feature space [51]. The most commonly used kernel functions in the literature are Gaussian, Radial Basis, Sigmoid and Polynomial functions [52]. In our study, by experimenting with all these kernel functions, we determined that the function that gives the best result is the polynomial kernel function. For d degree polynomials, the polynomial kernel is defined as;

$$K\left(x,y\right)={({x}^{T}y+c)}^{d}$$
(3)

where x and y are feature vectors calculated from training or test samples in the input space. c ≥ 0 is a free parameter controlling the influence of lower-order versus higher-order in the polynomial. When c = 0, the kernel is called homogeneous [3]. d indicates the degree of the polynomial. In our study, it is also called cubic kernel because the d value was chosen as 3.

In the classification phase of our study, tenfold cross-validation (CV) was used to determine the training and testing data. In this method, 10% of the data is reserved for testing and 90% for training in each iteration, and this process is repeated 10 times. It is tried to prevent the memorization of data by changing the test part by 10% in each iteration.

The final success rate is obtained by taking the average of the success rates obtained as a result of 10 iterations. Figure 7 shows the general flow diagram of our study. In the figure, all process steps are shown one by one on a COVID-19 (+) image selected from the data set. When the figure is examined, you can have an idea of what the effect of each process step is.

Fig. 7
figure 7

Flow chart of the proposed method

The pseudocode of the proposed method can be seen in Fig. 8. After applying the pseudocode seen in Fig. 8 to the datasets used in the study, classification success is achieved.

Fig. 8
figure 8

Pseudocode of the proposed method

4 Experimental Results

In this study, GT730 4 GB video card, 16 GB memory, and i5 processor were used. MATLAB platform is used for the coding process. Accuracy, Recall, Specificity, Precision, Negative Predictive Value (NPV), Matthews Correlation Coefficient, and AUC values were applied to evaluation. Accuracy is the ratio of correct estimates to total estimates, as seen in formula 2. If we have a symmetrical data set, i.e., positive and negative classes show a balanced distribution, this metric is suitable for evaluation [53]. Another metric is recall, also known as sensitivity. This metric is preferred when identifying positive cases is crucial. It is useful in situations such as detecting whether there is a fatal disease or not. Represents the ratio of true negatives to total negatives within the specificity data set [54]. This metric is preferred in situations where it is important to detect all true negatives. For example, if there is a situation where people with positive test results will be punished, it would be more meaningful to look at the results of this metric. The precision metric is the ratio of true positives to total predicted positives. It is preferred in cases where false positives are riskier. NPV is preferred in situations where false negatives are more risky, unlike Precision [55]. The Matthews Correlation Coefficient (MCC) is used in machine learning as a measure of the quality of binary classification [56]. It is generally considered a balanced metric that can be used even when classes are of very different sizes [57]. MCC returns values between − 1 and 1. A coefficient of 1 indicates a perfect prediction, while 0 is no better than a random guess, and -1 indicates a discrepancy between the prediction and the observation. With all these metrics, ROC curves are also given for the methods that produce the most successful results. AUC values ​​are performance indicators in the ROC curves shown in Fig. 10. It is also stated that the AUC value is more sensitive than the accuracy of performance measurement [58].

These performance metrics are shown in (4)–(9). TP, TN, FP, and FN mean true positive, true negative, false positive, and false negative, respectively.

$$\mathrm{Accuracy}=\frac{\mathrm{TP}+\mathrm{TN}}{(\mathrm{TP}+\mathrm{TN}+\mathrm{FP}+\mathrm{FN})}*100$$
(4)
$$\mathrm{Recall}=\frac{\mathrm{TP}}{(\mathrm{TP}+\mathrm{FN})}*100$$
(5)
$$\mathrm{Specificity}=\frac{\mathrm{TN}}{(\mathrm{TN}+\mathrm{FP})}*100$$
(6)
$$\mathrm{Precision}=\frac{\mathrm{TP}}{(\mathrm{TP}+\mathrm{FP})}*100$$
(7)
$$\mathrm{NPV}=\frac{\mathrm{TN}}{(\mathrm{TN}+\mathrm{FN})}*100$$
(8)
$$\mathrm{MCC}=\frac{\mathrm{TP}*\mathrm{TN}-\mathrm{FP}*\mathrm{FN}}{\sqrt{(TP+FP)(TP+FN)(TN+FP)(TN+FN)}}$$
(9)

Results of four of the six classification methods used in Table 3 that produced the most successful results are given. When the table is examined, it would be correct to say that the kernel SVM method, which misdiagnosed 17 of the positive samples and 21 of the negative samples, produced the best results. While this method has 38 defective detections in total, the kNN method has 43, and the ensemble subspace kNN method has 44 defective detections.

Table 3 Confusion matrices were obtained for the four most successful methods for Dataset-1

Table 4 shows the confusion matrices of the most successful classification methods obtained for Dataset 2. Here, as in Dataset 1, the kernel SVM method produced the most successful result. In the results obtained from the Kernel SVM method, it is seen that 102 samples were classified incorrectly and 644 samples were classified correctly.

Table 4 Confusion matrices were obtained for the four most successful methods for Dataset-2

In our study, a mixed data set was created, unlike many studies in this area. This data set was created by combining the samples from two different data sets we used. The confusion matrix of the results obtained for the mixed data set is shown in Table 5. It is possible to see that the highest success here was obtained with the kernel SVM method with an accuracy rate of 94.5%, followed by the subspace kNN method with 94.1%. It is possible to say that the success rates achieved are remarkably good.

Table 5 Confusion matrices were obtained for the four most successful methods for Mixed Dataset

When Fig. 9 is examined, it is seen that some positive images are classified as negative, and some negative images are classified as positive. When looking at the b and c images, which are negative, it is seen that there are various anomalies in the lungs. These people may have had different ailments before or may have habits that adversely affect their lungs (for example, smoking). And the structure of these lung images is similar to that of patients with COVID-19. On the other hand, a and d images that are positive are classified as negative by the system. When looking at the image in d from these images, it is understood that any anomaly has not yet occurred clearly on the image. This may also be the early stages of the disease. Therefore, the system also predicted this image incorrectly.

Fig. 9
figure 9

Examples of misclassified CT images

Table 6 shows the result values of six different metrics that we have evaluated when the purposes of use above are more meaningful. It can be seen from the table that the k-SVM method produces more successful accuracy values compared to other methods. However, it is also noteworthy that the precision results of kNN and ensemble kNN methods are higher than k-SVM. It is also a point to be considered that naive Bayes and decision tree methods produce very low results compared to other methods. In the last column of Table 4, the number of incorrectly classified samples can be seen. The kNN, k-SVM, and ensemble kNN methods produced 43, 38, and 44 erroneous results, respectively. When the MCC values are examined, it is seen that the k-SVM and kNN methods give the best results with 0.97 values.

Table 6 Classification results according to different metrics for Dataset 1

The results of 6 different metrics obtained for Dataset 2 are shown in Table 7. Similar to the previous table, it is seen that the most successful results are obtained with the k-SVM method. Again, as can be seen from the table, the MCC value in the k-SVM method is much higher than in other methods.

Table 7 Classification results according to different metrics for Dataset 2

The results for the mixed data set are shown in Table 8. According to the results obtained with mixed data, the highest rate of correct classification was in the k-SVM method with 94.5% accuracy and 0.89 MCC values.

Table 8 Classification results according to different metrics for mixed dataset

It was stated that ROC curves are an effective method used in performance measurements. The graphs below also show the ROC curves of four different methods that produce the most successful results. When we look at the AUC values below these curves, the k-SVM is the most successful method with a 1.00 AUC value for dataset 1 (Fig. 10).

Fig. 10
figure 10

ROC curves results of classification methods Dataset-1

As seen in Fig. 11, the AUC value for the second data set was obtained with the k-SVM method and its value was 0.91. The method that follows this method as the most successful method is subspace kNN.

Fig. 11
figure 11

ROC curves results of classification methods Dataset-2

In Table 6, it is seen that the accuracy value of the kNN method is higher than the ensemble methods. However, when looking at Fig. 10, it is striking that the AUC values of the ensemble methods are higher than the kNN method. From this, it can be inferred that making a direct performance comparison based on accuracy is not always appropriate.

The model we realized in our study was recorded and a simple GUI was created using this model as shown in Fig. 12. We have stated before that we aim to support medical professionals. The main purpose of this GUI is to provide an application that can help experts in their decisions. In practice, all the medical professionals need to do is click Load an Image for COVID-19 Diagnosis and select one of the CT images on their computer.

Fig. 12
figure 12

Examples of tests performed with GUI

After that, the selected image is tested with the trained model, and the result predicted by the system is written in the Text Area section under the GUI. Also, a segmented version of the image is shown on the right of the original image. It is thought that this segmented image will contribute to the evaluation of the patient’s CT image. As a result of synchronizing the result CT images sent from the tomography devices to the computer used by the radiologist or to the hospital information management system software, the result of the COVID-19 diagnosis produced by our system can be automatically displayed on the screen of the radiologist.

5 Discussion

The main aim of this study is to support medical professionals in their decisions. The results obtained in our study support this goal. Also, thanks to the simple GUI developed for medical professionals to use easily, the diagnosis of COVID-19 from CT images can be made with 98.5%, 86.3%, and 94.5% accuracy for Dataset-1, Dataset-2, and Mixed Dataset, respectively. It is seen that the realization of the results obtained in the study on the images obtained by the segmentation process has a positive effect on the results.

Table 9 shows the success rates and methods used by those who use the data set we use among the studies on COVID-19 in the literature. It is seen that from the table, the results in our study are higher than the results obtained in other studies. This highlights the originality and effectiveness of our study.

Table 9 Comparison of studies in the literature

Training and prediction times are shown in Table 10 to get a better idea about the methods we use in our study. When the results obtained in the study are analyzed, it is possible to see that the kernel SVM method achieves the highest accuracy rate. The closest followers of this method are kNN and subspace kNN methods. However, these successes in accuracy rates do not mean that these methods are faster than other methods. The kernel SVM method produces results faster than other classifiers used in the training phase. In estimating a new example, the k-naive Bayes and ensemble kNN methods are the methods that produce the fastest results. Decision trees take the longest time to estimate a new sample. However, it is possible to say that the estimation time of the k-SVM method, which is the method with the highest accuracy, can also provide support to medical professionals in making quick decisions.

Table 10 Training and estimation times of the classification methods used

6 Conclusion

The method we propose in this study, which we have carried out, is to ensure the detection of the COVID-19 virus in CT images. A CT image given to the proposed method can be classified as COVID-19 positive or negative. An effective model that enables the detection of the COVID-19 virus from CT images on segmented images is proposed. Two different data sets were used in the study. In addition, a mixed data set was created by combining these two different datasets. The datasets used are the most comprehensive and publicly accessible. In our study, a model was created by using many methods of image processing and artificial learning methods. A tenfold cross-validation method was applied to test the accuracy of the model. According to the model results, the highest success value was obtained for dataset 1 using the kernel SVM method with an accuracy of 98.5%. For data set 2, this rate was determined as 86.3%. In the mixed data set, a high accuracy rate of 94.5% was obtained. Matthews Correlation Coefficient (MCC) metric was also used in our study and 0.97, 0.73, 0.89 MCC values were obtained for Dataset 1, Dataset 2, Mixed Dataset, respectively. This shows the effectiveness of the results obtained. Also, a user-friendly GUI was created from the created model. Thanks to this GUI, medical professionals will be able to perform the segmentation of CT images and will also receive an answer to the question of whether COVID is positive or negative by the system.

Considering the results obtained, it is seen that the system we propose achieves higher success than other studies in the literature. Accurate and rapid detection of suspected COVID-19 cases, and thus timely quarantine and medical treatment is the important process. The main purpose of our study is to contribute to this situation and this contribution is achieved with the resulting GUI.

There is no detailed clinical data related to the data set we used in our study. We can say this is a limitation of our work. Adequate demographic and clinical information of patients can also be made in many different analyzes. For example, if there were data on the patient’s disease history or chronic disease information, evaluations could be made on the interactions of COVID-19 with other diseases. We plan to create a data set suitable for this situation and to implement these different analyzes in our future studies. Another limitation is that the CT images in the study do not have information on which day of the disease it belongs to. If we have such information, analysis of the daily course of the disease can also be done with computer-aided studies on CT images. In addition to these, future goals of this study include expanding the scope of this study with different imaging methods (x-Ray, Ultrasound, MR, etc.). In this way, an application that produces solutions for all kinds of images will be obtained.