Journal of Electrical and Computer Engineering Innovations (JECEI)

Artificial Intelligence

An Effective Ensemble of Deep and Machine Learning Methods for Classifying the Expertise Shape of CQA Users

S. Nemati

Volume 12, Issue 2 , July 2024, , Pages 409-424

https://doi.org/10.22061/jecei.2024.10621.724

Abstract

Background and Objectives: Community question-answering (CQA) websites have become increasingly popular as platforms for individuals to seek and share knowledge. Identifying users with a special shape of expertise on CQA websites is a beneficial task for both companies and individuals. Specifically, ... Read More Background and Objectives: Community question-answering (CQA) websites have become increasingly popular as platforms for individuals to seek and share knowledge. Identifying users with a special shape of expertise on CQA websites is a beneficial task for both companies and individuals. Specifically, finding those who have a general understanding of certain areas but lack expertise in other fields is crucial for companies who are planning internship programs. These users, called dash-shaped users, are willing to work for low wages and have the potential to quickly develop into skilled professionals, thus minimizing the risk of unsuccessful recruitment. Due to the vast number of users on CQA websites, they provide valuable resources for finding individuals with various levels of expertise. This study is the first of its kind to directly classify CQA users based solely on the textual content of their posts. Methods: To achieve this objective, we propose an ensemble of advanced deep learning algorithms and traditional machine learning methods for the binary classification of CQA users into two categories: those with dash-shaped expertise and those without. In the proposed method, we used the stack generalization to fuse the results of the dep and machine learning methods. To evaluate the effectiveness of our approach, we conducted an extensive experiment on three large datasets focused on Android, C#, and Java topics extracted from the Stack Overflow website. Results: The results on four datasets of the Stack Overflow, demonstrate that our ensemble method not only outperforms baseline methods including seven traditional machine learning and six deep models, but it achieves higher performance than state-of-the-art deep models by an average of 10% accuracy and F1-measure. Conclusion: The proposed model showed promising results in confirming that by using only their textual content of questions, we can classify the users in CQA websites. Specifically, the results showed that using the contextual content of the questions, the proposed model can be used for detecting the dash-shaped users precisely. Moreover, the proposed model is not limited to detecting dash-shaped users. It can also classify other shapes of expertise, such as T- and C-shaped users, which are valuable for forming agile software teams. Additionally, our model can be used as a filter method for downstream applications, like intern recommendations.

Data Mining

A High-Performance Model based on Ensembles for Twitter Sentiment Classification

R. Asgarnezhad; A. Monadjemi; M. SoltanAghaei

Volume 8, Issue 1 , January 2020, , Pages 41-52

https://doi.org/10.22061/jecei.2020.7100.357

Abstract

Background and Objectives: Twitter Sentiment Classification is one of the most popular fields in information retrieval and text mining. Millions of people of the world intensity use social networks like Twitter. It supports users to publish tweets to tell what they are thinking about topics. There are ... Read More

Journal of Electrical and Computer Engineering Innovations (JECEI)

Articles in Press

Current Issue

Volume 13 (2025)

Volume 12 (2024)

Volume 11 (2023)

Volume 10 (2022)

Volume 9 (2021)

Volume 8 (2020)

Volume 7 (2019)

Volume 6 (2018)

Volume 5 (2017)

Volume 4 (2016)

Volume 3 (2015)

Volume 2 (2014)

Volume 1 (2013)

Keywords = Ensemble Method

An Effective Ensemble of Deep and Machine Learning Methods for Classifying the Expertise Shape of CQA Users

Abstract

A High-Performance Model based on Ensembles for Twitter Sentiment Classification

Abstract