close
Skip to main content
Log in

StockAI 3.0: Ensemble fusion paradigms using novel gating mechanism in long short-term memory architectures for forecasting sentiment-based stock trends

  • Application of soft computing
  • Published:
Soft Computing Aims and scope Submit manuscript

Abstract

Financial time series prediction requires researchers to detect dynamic market indicator relationships while also predicting short-term market trends that are either positive or negative. Conventional LSTM (cLSTM) models face limitations when identifying multiple complex relationships between their input data points. Our work adds Price-to-Earnings (PE) and Price-to-Book (PB), Volatility Index (VIX), and Sentiment Score to the standard Open, Close, Volume features to enhance prediction accuracy levels. We introduce two enhanced Extended Long Short-Term Memory (xLSTM) architectures: (i) xLSTMcg, which implements cross-gating, and (ii) xLSTMeg, which employs exponential gating to improve information flow between feature channels. We compare categorical versus feature-based loss functions and apply ensemble techniques—bagging, boosting, and stacking—to optimize our bidirectional fused models. Furthermore, our study also evaluates the stability of models on validation dataset. The bidirectional configurations produced superior results compared to unidirectional configuration by 2% to 3%. Through the experiment, xLSTMeg model achieved 86.2% accuracy, but when combined within a stacking ensemble, yielded an accuracy of 91.2% with area-under-the-curve (AUC) of 0.95. The obtained results of xLSTMeg showed better performance than both cLSTM and xLSTMcg models. Additionally, our study meets regulatory standards by maintaining less than a 5% performance difference between experimental and external validation dataset. Integrating advanced gating mechanisms with ensemble fusion significantly enhances LSTM-based financial forecasts, delivering robust performance under varied market conditions. This framework lays a solid foundation for future market-forecasting research and supports more effective trading decisions and risk-management strategies.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+
from $39.99 /Month
  • Starting from 10 chapters or articles per month
  • Access and download chapters and articles from more than 300k books and 2,500 journals
  • Cancel anytime
View plans

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
The alternative text for this image may have been generated using AI.
Fig. 2
The alternative text for this image may have been generated using AI.
Fig. 3
The alternative text for this image may have been generated using AI.
Fig. 4
The alternative text for this image may have been generated using AI.
Fig. 5
The alternative text for this image may have been generated using AI.
Fig. 6
The alternative text for this image may have been generated using AI.
Fig. 7
The alternative text for this image may have been generated using AI.
Fig. 8
The alternative text for this image may have been generated using AI.
Fig. 9
The alternative text for this image may have been generated using AI.
Fig. 10
The alternative text for this image may have been generated using AI.
Fig. 11
The alternative text for this image may have been generated using AI.
Fig. 12
The alternative text for this image may have been generated using AI.
Fig. 13
The alternative text for this image may have been generated using AI.

Similar content being viewed by others

Data avaliability

The data used in this manuscript is proprietary to AtheroPoint LLC, and is not publicly available.

Abbreviations

ACCU:

Accuracy

AUC:

Area-under-the-curve

AUGU:

Augmentation

Bi:

Bi-directional

CL:

Categorical Cross-Entropy Loss

CLSTM:

Conventional Long Short-Term Memory

DL:

Deep Learning

EDL:

Ensemble Deep Learning

EML:

Ensemble Machine Learning

ET:

Extra Trees

F1:

F1-Score

FL:

Feature Loss

GB:

Gradient Boosting

GNB:

Gaussian Naïve Baise

LR:

Linear Regression

LSTM:

Long Short-Term Memory

MoE:

Margin of Error

NSE:

National Stock Exchange

P/B:

Price to Book ratio

P/E:

Price to Earnings Ratio

PA:

Power Analysis

PREC:

Precision

RF:

Random Forest

RNN:

Recurrent Neural Network

ROC:

Receiver Operator Characteristic

SENS:

Sensitivity

SML:

Solo Machine Learning

SMOTE:

Synthetic Minority Oversampling Technique

SVM:

Support Vector Machine

Uni:

Uni-directional

VADAR:

Valence Aware Dictionary and sEntiment Reasoner

VIX:

Volatility Index

xLSTMcg:

Extended Long Short-Term Memory with Cross Gates

xLSTMeg:

Extended Long Short-Term Memory with exponential Gates

L CL :

Categorical Cross-Entropy Loss

N:

Total number of samples

yᵢ:

Ground truth label for sample i

aᵢ:

Predicted probability

ŷ i :

Predicted feature value for sample

y i true :

Ground truth feature value for sample i

xₜ:

New input on time t

iₜ:

Input Gate on time t

fₜ:

Forgot Gate on time t

cₜ₋₁:

Memory before time t

Wx:

Learning weight matrices for the state x

B:

Bias

⨀:

Element wise-multiplication

cₜ:

Current memory

oₜ:

Output Gate at time t

σ:

Sigmoid Function

hₜ:

Hidden State

tan:

Tangent Function

(i):

Represent the i th cell

ĉₜ:

Candidate cell state

Cross:

Cross influence

exp:

Exponential Function

TPd:

True Positives for Deep Learning

TNd:

True Negative for Deep Learning

FPd:

False Positives for Deep Learning

FNd:

False Negative for Deep Learning

AUC:

Area-under-the-curve

∫(t):

Integral Function at time t

References

Download references

Funding

This research received no external funding.

Author information

Authors and Affiliations

Authors

Contributions

Conceptualization: Y.S.; S.G.; N.G.; E.T.; and J.S.S.; resources, Y.S.; S.G. N.G.; N.N.K.; and M.A.M.; and J.S.S.; writing—original draft preparation, Y.S; V.R.; P.A.; V.K.; and S.N.; writing—review and editing Y.S.; N.G.; E.T.; and R.S.; supervision, P.A.; V.K.; S.N.; L.S.; and J.S.S.; All authors have read and agreed to the published version of the manuscript.

Corresponding author

Correspondence to Jasjit S. Suri.

Ethics declarations

Conflict of interests

The authors have no conflicts of interest to declare.

Ethical approval

This article does not contain any studies with human participants or animals performed by any of the authors.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Sharma, Y., Gupta, S., Gupta, N. et al. StockAI 3.0: Ensemble fusion paradigms using novel gating mechanism in long short-term memory architectures for forecasting sentiment-based stock trends. Soft Comput 29, 5803–5829 (2025). https://doi.org/10.1007/s00500-025-10901-8

Download citation

  • Received:

  • Accepted:

  • Published:

  • Version of record:

  • Issue date:

  • DOI: https://doi.org/10.1007/s00500-025-10901-8

Keywords

Profiles

  1. Yuvraj Sharma
  2. Puneet Ahluwalia
  3. Vandana Kumari
  4. Luca Saba
  5. Jasjit S. Suri