Journal of Electrical and Computer Engineering Innovations (JECEI)

Artificial Intelligence

A Fast and Accurate Tree-based Approach for Anomaly Detection in Streaming Data

K. Moeenfar; V. Kiani; A. Soltani; R. Ravanifard

Volume 13, Issue 1 , January 2025, , Pages 209-224

https://doi.org/10.22061/jecei.2024.11110.767

Abstract

Background and Objectives: In this paper, a novel and efficient unsupervised machine learning algorithm named EiForestASD is proposed for distinguishing anomalies from normal data in data streams. The proposed algorithm leverages a forest of isolation trees to detect anomaly data instances. Methods: ... Read More

Artificial Intelligence

An Effective Ensemble of Deep and Machine Learning Methods for Classifying the Expertise Shape of CQA Users

S. Nemati

Volume 12, Issue 2 , July 2024, , Pages 409-424

https://doi.org/10.22061/jecei.2024.10621.724

Abstract

Background and Objectives: Community question-answering (CQA) websites have become increasingly popular as platforms for individuals to seek and share knowledge. Identifying users with a special shape of expertise on CQA websites is a beneficial task for both companies and individuals. Specifically, ... Read More Background and Objectives: Community question-answering (CQA) websites have become increasingly popular as platforms for individuals to seek and share knowledge. Identifying users with a special shape of expertise on CQA websites is a beneficial task for both companies and individuals. Specifically, finding those who have a general understanding of certain areas but lack expertise in other fields is crucial for companies who are planning internship programs. These users, called dash-shaped users, are willing to work for low wages and have the potential to quickly develop into skilled professionals, thus minimizing the risk of unsuccessful recruitment. Due to the vast number of users on CQA websites, they provide valuable resources for finding individuals with various levels of expertise. This study is the first of its kind to directly classify CQA users based solely on the textual content of their posts. Methods: To achieve this objective, we propose an ensemble of advanced deep learning algorithms and traditional machine learning methods for the binary classification of CQA users into two categories: those with dash-shaped expertise and those without. In the proposed method, we used the stack generalization to fuse the results of the dep and machine learning methods. To evaluate the effectiveness of our approach, we conducted an extensive experiment on three large datasets focused on Android, C#, and Java topics extracted from the Stack Overflow website. Results: The results on four datasets of the Stack Overflow, demonstrate that our ensemble method not only outperforms baseline methods including seven traditional machine learning and six deep models, but it achieves higher performance than state-of-the-art deep models by an average of 10% accuracy and F1-measure. Conclusion: The proposed model showed promising results in confirming that by using only their textual content of questions, we can classify the users in CQA websites. Specifically, the results showed that using the contextual content of the questions, the proposed model can be used for detecting the dash-shaped users precisely. Moreover, the proposed model is not limited to detecting dash-shaped users. It can also classify other shapes of expertise, such as T- and C-shaped users, which are valuable for forming agile software teams. Additionally, our model can be used as a filter method for downstream applications, like intern recommendations.

Artificial Intelligence

Optimum Spectral Indices for Water Bodies Recognition Based on Genetic Algorithm and Sentinel-2 Satellite Images

H. Karim Tabbahfar; F. Tabib Mahmoudi

Volume 12, Issue 1 , January 2024, , Pages 217-226

https://doi.org/10.22061/jecei.2023.10118.678

Abstract

Background and Objectives: Considering the drought and global warming, it is very important to monitor changes in water bodies for surface water management and preserve water resources in the natural ecosystem. For this purpose, using the appropriate spectral indices has high capabilities to distinguish ... Read More Background and Objectives: Considering the drought and global warming, it is very important to monitor changes in water bodies for surface water management and preserve water resources in the natural ecosystem. For this purpose, using the appropriate spectral indices has high capabilities to distinguish surface water bodies from other land covers. This research has a special consideration to the effect of different types of land covers around water bodies. For this reason, two different water bodies, lake and wetland, have been used to evaluate the implementation results.Methods: The main objective of this research is to evaluate the capabilities of the genetic algorithm in optimum selection of the spectral indices extracted from Sentinel-2 satellite image due to distinguish surface water bodies in two case studies: 1) the pure water behind the Karkheh dam and 2) the Shadegan wetland having water mixed with vegetation. In this regard, the set of optimal indices is obtained with the genetic algorithm followed by the support vector machine (SVM) classifier. Results: The evaluation of the classification results based on the optimum selected spectral indices showed that the overall accuracy and Kappa coefficient of the recognized surface water bodies are 98.18 and 0.9827 in the Karkheh dam and 98.04 and 0.93 in Shadegan wetland, respectively. Evaluation of each of the spectral indices measured in both study areas was carried out using quantitative decision tree (DT) classifier. The best obtained DT classification results show the improvements in overall accuracy by 1.42% in the Karkheh Dam area and 1.56% in the Shadegan Wetland area based on the optimum selected indices by genetic algorithm followed by SVM classifier. Moreover, the obtained classification results are superior compared with Random Forest classifier using the optimized set of spectral features.Conclusion: Applying the genetic algorithm on the spectral indices was able to obtain two optimal sets of effective indices that have the highest amount of accuracy in classifying water bodies from other land cover objects in the study areas. Considering the collective performance, genetic algorithm selects an optimal set of indices that can detect water bodies more accurately than any single index.

Artificial Intelligence

MVO-Autism: An Effective Pre-treatment with High Performance for Improving Diagnosis of Autism Mellitus

K. Ali Mohsin Alhameedawi; R. Asgarnezhad

Volume 10, Issue 1 , January 2022, , Pages 209-220

https://doi.org/10.22061/jecei.2021.8109.480

Abstract

Background and Objectives: Autism is the most well-known disease that occurs in any age people. There is an increasing concern in appealing machine learning techniques to diagnose these incurable conditions. But, the poor quality of most datasets contains the production of efficient models for the forecast ... Read More

Artificial Intelligence

A Transformer Self-attention Model for Time Series Forecasting

R. Mohammadi Farsani; E. Pazouki

Volume 9, Issue 1 , January 2021, , Pages 1-10

https://doi.org/10.22061/jecei.2020.7426.391

Abstract

Background and Objectives: Many real-world problems are time series forecasting (TSF) problem. Therefore, providing more accurate and flexible forecasting methods have always been a matter of interest to researchers. An important issue in forecasting the time series is the predicated time interval.Methods: ... Read More Background and Objectives: Many real-world problems are time series forecasting (TSF) problem. Therefore, providing more accurate and flexible forecasting methods have always been a matter of interest to researchers. An important issue in forecasting the time series is the predicated time interval.Methods: In this paper, a new method is proposed for time series forecasting that can make more accurate predictions at larger intervals than other existing methods. Neural networks are an effective tool for estimating time series due to their nonlinearity and their ability to be used for different time series without specific information of those. A variety of neural networks have been introduced so far, some of which have been used in forecasting time series. Encoder decoder Networks are an example of networks that can be used in time series forcasting. an encoder network encodes the input data based on a particular pattern and then a decoder network decodes the output based on the encoded input to produce the desired output. Since these networks have a better understanding of the context, they provide a better performance. An example of this type of network is transformer. A transformer neural network based on the self-attention is presented that has special capability in forecasting time series problems.Results: The proposed model has been evaluated through experimental results on two benchmark real-world TSF datasets from different domain. The experimental results states that, in terms of long-term estimation Up to eight times more resistant and in terms of estimation accuracy about 20 percent improvement, compare to other well-known methods, is obtained. Computational complexity has also been significantly reduced.Conclusion: The proposed tool could perform better or compete with other introduced methods with less computational complexity and longer estimation intervals. It was also found that with better configuration of the network and better adjustment of attention, it is possible to obtain more desirable results in any specific problem.

Artificial Intelligence

Using Machine Learning Methods for Automatic Bug Assignment to Developers

M. Yousefi; R. Akbari; S. M. R. Moosavi

Volume 8, Issue 2 , July 2020, , Pages 263-272

https://doi.org/10.22061/jecei.2020.7212.370

Abstract

Background and Objectives: It is generally accepted that the highest cost in software development is associated with the software maintenance phase. In corrective maintenance, the main task is correcting the bugs found by the users. These bugs are submitted by the users to a Bug Tracking System (BTS). ... Read More

Artificial Intelligence

Stock Price Prediction using Machine Learning and Swarm Intelligence

I. Behravan; S. M. Razavi

Volume 8, Issue 1 , January 2020, , Pages 31-40

https://doi.org/10.22061/jecei.2020.6898.346

Abstract

Background and Objectives: Stock price prediction has become one of the interesting and also challenging topics for researchers in the past few years. Due to the non-linear nature of the time-series data of the stock prices, mathematical modeling approaches usually fail to yield acceptable results. Therefore, ... Read More

Artificial Intelligence

A New Model for Text Coherence Evaluation Using Statistical Characteristics

M. Abdolahi; M. Zahedi

Volume 6, Issue 1 , January 2018, , Pages 15-24

Abstract

< p>Background and Objectives: Discourse coherence modeling evaluation becomes a critical but challenging task for all content analysis tasks in Natural Language Processing subfields, such as text summarization, question answering, text generation and machine translation. Existing methods like ... Read More < p>Background and Objectives: Discourse coherence modeling evaluation becomes a critical but challenging task for all content analysis tasks in Natural Language Processing subfields, such as text summarization, question answering, text generation and machine translation. Existing methods like entity-based and graph-based models are engaging in semantic and linguistic concepts of a text. It means that the problem cannot be solved very well and these methods are only very limited to available word co-occurrence information in the sequential sentences within a short part of a text. One of the greatest challenges of the above methods is their limitation in long documents coherence evaluation and being suitable for documents with low number of sentences. Methods: Our proposed method focuses on both local and global coherence. It can also assess the local topic integrity of text at the paragraph level regardless of word meaning and handcrafted rules. The global coherence in the proposed method is evaluated by sequence paragraph dependency. According to the derived results in word embeddings, by applying statistical approaches, the presented method incorporates the external word correlation knowledge into short and long stories to assess both local and global coherence, simultaneously. Results: Using the effect of combined word2vec vectors and most likely n-grams, we show that our proposed method is independent of the language and its semantic concepts. The derived results indicate that the proposed method offers the higher accuracy with respect to the other algorithms, in long documents with a high number of sentences. Conclusion: Our current study, comparing our proposed method with BGSEG method showed that the mean degree of coherence evaluation 1.19 percent improvement. The results in this study also indicate improvement results are much more in larger texts with more sentences.======================================================================================================Copyrights©2018 The author(s). This is an open access article distributed under the terms of the Creative Commons Attribution (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, as long as the original authors and source are cited. No permission is required from the authors or the publishers.======================================================================================================

Journal of Electrical and Computer Engineering Innovations (JECEI)

Articles in Press

Current Issue

Volume 13 (2025)

Volume 12 (2024)

Volume 11 (2023)

Volume 10 (2022)

Volume 9 (2021)

Volume 8 (2020)

Volume 7 (2019)

Volume 6 (2018)

Volume 5 (2017)

Volume 4 (2016)

Volume 3 (2015)

Volume 2 (2014)

Volume 1 (2013)

Main Subjects = Data mining

A Fast and Accurate Tree-based Approach for Anomaly Detection in Streaming Data

Abstract

An Effective Ensemble of Deep and Machine Learning Methods for Classifying the Expertise Shape of CQA Users

Abstract

Optimum Spectral Indices for Water Bodies Recognition Based on Genetic Algorithm and Sentinel-2 Satellite Images

Abstract

MVO-Autism: An Effective Pre-treatment with High Performance for Improving Diagnosis of Autism Mellitus

Abstract

A Transformer Self-attention Model for Time Series Forecasting

Abstract

Using Machine Learning Methods for Automatic Bug Assignment to Developers

Abstract

Stock Price Prediction using Machine Learning and Swarm Intelligence

Abstract

A New Model for Text Coherence Evaluation Using Statistical Characteristics

Abstract