An Efficient Classification Model for Cyber Text

2511.03107v1 cs.LG, cs.IT, math.IT 2025-11-07

Авторы:

Md Sakhawat Hossen, Md. Zashid Iqbal Borshon, A. S. M. Badrudduza

Abstract

The uprising of deep learning methodology and practice in recent years has brought about a severe consequence of increasing carbon footprint due to the insatiable demand for computational resources and power. The field of text analytics also experienced a massive transformation in this trend of monopolizing methodology. In this paper, the original TF-IDF algorithm has been modified, and Clement Term Frequency-Inverse Document Frequency (CTF-IDF) has been proposed for data preprocessing. This paper primarily discusses the effectiveness of classical machine learning techniques in text analytics with CTF-IDF and a faster IRLBA algorithm for dimensionality reduction. The introduction of both of these techniques in the conventional text analytics pipeline ensures a more efficient, faster, and less computationally intensive application when compared with deep learning methodology regarding carbon footprint, with minor compromise in accuracy. The experimental results also exhibit a manifold of reduction in time complexity and improvement of model accuracy for the classical machine learning methods discussed further in this paper.

Ссылки и действия

Читать на arXiv Скачать PDF

Дополнительные ресурсы:

An Efficient Classification Model for Cyber Text

Авторы:

Abstract

Ссылки и действия

Связанные статьи

Complexity as Advantage: A Regret-Based Perspective on Emergent Structure

Measuring the Intrinsic Dimension of Earth Representations

Optimal Information Combining for Multi-Agent Systems Using Adaptive Bias Learni...

Transformers Provably Learn Directed Acyclic Graphs via Kernel-Guided Mutual Inf...

Information-Theoretic Discrete Diffusion

Навигация