Machine Learning
A. Mohamadi; M. Habibi; F. Parandin
Abstract
Background and Objectives: Metastatic castration-sensitive prostate cancer (mCSPC) represents a critical juncture in the management of prostate cancer, where the accurate prediction of the onset of castration resistance is paramount for guiding treatment decisions.Methods: In this study, we underscore ...
Read More
Background and Objectives: Metastatic castration-sensitive prostate cancer (mCSPC) represents a critical juncture in the management of prostate cancer, where the accurate prediction of the onset of castration resistance is paramount for guiding treatment decisions.Methods: In this study, we underscore the power and efficiency of auto-ML models, specifically the Random Forest Classifier, for their low-code, user-friendly nature, making them a practical choice for complex tasks, to develop a predictive model for the occurrence of castration resistance events (CRE (. Utilizing a comprehensive dataset from MSK (Clin Cancer Res 2020), comprising clinical, genetic, and molecular features, we conducted a comprehensive analysis to discern patterns and correlations indicative of castration resistance. A random forest classifier was employed to harness the dataset's intrinsic interactions and construct a robust predictive model. Results: We used over 18 algorithms to find the best model, and our results showed a significant achievement, with the developed model demonstrating an impressive accuracy of 75% in predicting castration resistance events. Furthermore, the analysis highlights the importance of specific features such as 'Fraction Genome Altered ‘and the role of prostate specific antigen (PSA) in castration resistance prediction.Conclusion: Corroborating these findings, recent studies emphasize the correlation between high 'Fraction Genome Altered' and resistance and the predictive power of elevated PSA levels in castration resistance. This highlights the power of machine learning in improving outcome predictions vital for prostate cancer treatment. This study deepens our insights into metastatic castration-sensitive prostate cancer and provides a practical tool for clinicians to shape treatment strategies and potentially enhance patient results.
Artificial Intelligence
S. Nemati
Abstract
Background and Objectives: Community question-answering (CQA) websites have become increasingly popular as platforms for individuals to seek and share knowledge. Identifying users with a special shape of expertise on CQA websites is a beneficial task for both companies and individuals. Specifically, ...
Read More
Background and Objectives: Community question-answering (CQA) websites have become increasingly popular as platforms for individuals to seek and share knowledge. Identifying users with a special shape of expertise on CQA websites is a beneficial task for both companies and individuals. Specifically, finding those who have a general understanding of certain areas but lack expertise in other fields is crucial for companies who are planning internship programs. These users, called dash-shaped users, are willing to work for low wages and have the potential to quickly develop into skilled professionals, thus minimizing the risk of unsuccessful recruitment. Due to the vast number of users on CQA websites, they provide valuable resources for finding individuals with various levels of expertise. This study is the first of its kind to directly classify CQA users based solely on the textual content of their posts. Methods: To achieve this objective, we propose an ensemble of advanced deep learning algorithms and traditional machine learning methods for the binary classification of CQA users into two categories: those with dash-shaped expertise and those without. In the proposed method, we used the stack generalization to fuse the results of the dep and machine learning methods. To evaluate the effectiveness of our approach, we conducted an extensive experiment on three large datasets focused on Android, C#, and Java topics extracted from the Stack Overflow website. Results: The results on four datasets of the Stack Overflow, demonstrate that our ensemble method not only outperforms baseline methods including seven traditional machine learning and six deep models, but it achieves higher performance than state-of-the-art deep models by an average of 10% accuracy and F1-measure. Conclusion: The proposed model showed promising results in confirming that by using only their textual content of questions, we can classify the users in CQA websites. Specifically, the results showed that using the contextual content of the questions, the proposed model can be used for detecting the dash-shaped users precisely. Moreover, the proposed model is not limited to detecting dash-shaped users. It can also classify other shapes of expertise, such as T- and C-shaped users, which are valuable for forming agile software teams. Additionally, our model can be used as a filter method for downstream applications, like intern recommendations.
Artificial Intelligence
I. Zabbah; K. Layeghi; Reza Ebrahimpour
Abstract
Background and Objectives: COVID-19 disease still has a devastating effect on society health. The use of X-ray images is one of the most important methods of diagnosing the disease. One of the challenges specialists are faced is no diagnosing in time. Using Deep learning can reduce the diagnostic error ...
Read More
Background and Objectives: COVID-19 disease still has a devastating effect on society health. The use of X-ray images is one of the most important methods of diagnosing the disease. One of the challenges specialists are faced is no diagnosing in time. Using Deep learning can reduce the diagnostic error of COVID-19 and help specialists in this field. Methods: The aim of this study is to provide a method based on a combination of deep learning(s) in parallel so that it can lead to more accurate results in COVID-19 disease by gathering opinions. In this research, 4 pre-trained (fine-tuned) deep model have been used. The dataset of this study is X-ray images from Github containing 1125 samples in 3 classes include normal, COVID-19 and pneumonia contaminated.Results: In all networks, 70% of the samples were used for training and 30% for testing. To ensure accuracy, the K-fold method was used in the training process. After modeling and comparing the generated models and recording the results, the accuracy of diagnosis of COVID-19 disease showed 84.3% and 87.2% when learners were not combined and experts were combined respectively. Conclusion: The use of machine learning techniques can lead to the early diagnosis of COVID-19 and help physicians to accelerate the healing process. This study shows that a combination of deep experts leads to improved diagnosis accuracy.
Geographic Information System
E. Norouzi; S. Behzadi
Abstract
Background and Objectives: Climate phenomena such as quantity of surface evaporation are affected by many environmental factors and parameters, which makes modeling and data mining difficult. On the other hand, the estimation of surface evaporation for a target station can be difficult as a result of ...
Read More
Background and Objectives: Climate phenomena such as quantity of surface evaporation are affected by many environmental factors and parameters, which makes modeling and data mining difficult. On the other hand, the estimation of surface evaporation for a target station can be difficult as a result of partial or complete lack of local meteorological data under many conditions. In this regard, satellite imagery can play a special role in modeling and data mining of climatic phenomena, because of their significant advantages, including availability and their potential analysis. Therefore, addressing the improvement and expansion of machine learning methods and modeling algorithms along with remote sensing data is inevitable.Methods: In this research, we intend to study the ability of 11 machine-learning modeling algorithms to model data and surface evaporation phenomena using satellite imagery. We used two methods to prepare the database: PCA and its opposite method using standard deviation and correlation.Results: The calculation of the Root Mean Squared Error (RMSE) indicated that, in general, the use of the PCA method has a better result in preparing and reducing the dimensions of large databases for all methods of machine learning. The SEGPR model was ranked first with the least error (93.49%) in the Principal Component Analysis (PCA) method, and the Artificial Neural Network (ANN) model performed well in both data preparation methods (93.42, 93.38), and the Classification-Tree-Coarse model had the highest error in both methods (92.66, 92.67).Conclusion: Consequently, it can be said that by changing the methods of database preparation in order to train models, the modeling results can be changed effectively.
Data Mining
R. Asgarnezhad; A. Monadjemi; M. SoltanAghaei
Abstract
Background and Objectives: Twitter Sentiment Classification is one of the most popular fields in information retrieval and text mining. Millions of people of the world intensity use social networks like Twitter. It supports users to publish tweets to tell what they are thinking about topics. There are ...
Read More
Background and Objectives: Twitter Sentiment Classification is one of the most popular fields in information retrieval and text mining. Millions of people of the world intensity use social networks like Twitter. It supports users to publish tweets to tell what they are thinking about topics. There are numerous web sites built on the Internet presenting Twitter. The user can enter a sentiment target and seek for tweets containing positive, negative, or neutral opinions. This is remarkable for consumers to investigate the products before purchase automatically.Methods: This paper suggests a model for sentiment classification. The goal of this model is to investigate what is the role of n-grams and sampling techniques in Sentiment Classification application using an ensemble method on Twitter datasets. Also, it examines both binary and multiple classifications, which are classified datasets into positive, negative, or neutral classes.Results: Twitter Classification is an outstanding problem, which has very few free resources and not available due to modified authorization status. However, all Twitter datasets are not labeled and free, except for our applied dataset. We reveal that the combination of ensemble methods, sampling techniques, and n-grams can improve the accuracy of Twitter Sentiment Classification.Conclusion: The results confirmed the superiority of the proposed model over state-of-the-art systems. The highest results obtained in terms of accuracy, precision, recall, and f-measure.
Computer Vision
M. Fakhredanesh; S. Roostaie
Abstract
Background and Objectives: Action recognition, as the processes of labeling an unknown action of a query video, is a challenging problem, due to the event complexity, variations in imaging conditions, and intra- and inter-individual action-variability. A number of solutions proposed to solve action recognition ...
Read More
Background and Objectives: Action recognition, as the processes of labeling an unknown action of a query video, is a challenging problem, due to the event complexity, variations in imaging conditions, and intra- and inter-individual action-variability. A number of solutions proposed to solve action recognition problem. Many of these frameworks suppose that each video sequence includes only one action class. Therefore, we need to break down a video sequence into sub-sequences, each containing only a single action class.Methods: In this paper, we develop an unsupervised action change detection method to detect the time of actions change, without classifying the actions. In this method, a silhouette-based framework will be used for action representation. This representation uses xt patterns. The xt pattern is a selected frame of xty volume. This volume is achieved by rotating the traditional space-time volume and displacing its axes. In xty volume, each frame consists of two axes (x) and time (t), and y value specifies the frame number.Results: To test the performance of the proposed method, we created 105 artificial videos using the Weizmann dataset, as well as time-continuous camera-captured video. The experiments have been conducted on this dataset. The precision of the proposed method was 98.13% and the recall was 100%.Conclusion: The proposed unsupervised approach can detect action changes with a high precision. Therefore, it can be useful in combination with an action recognition method for designing an integrated action recognition system.