We use historical market data from 700 to 1800 days in the past to forecast stock behavior for the upcoming 14 to 45 days. This timeframe is chosen to balance the need for sufficient post-COVID market context with a forecasting window that minimizes short-term volatility while still capturing market trends.
For each stock or ETF, we calculate an extensive set of features (detailed below) and build a custom predictive model. This allows for each model to capture the nuances of each stock/ETF. Each model is then backtested and saved.
Every night we introduce a new parameter setting, and retrain models for 50+ assets to provide the most up-to-date predictions. Our API then delivers these forecasts. This also allows for meta-learning, where we can analyze the performance of different models and parameters to improve our overall approach.
While our APIs are flexible and support all stocks and public ETFs, we recommend focusing on the consumer goods sector. Our findings show companies in this sector tend to align with macroeconomic trends and technical indicators, making them ideal candidates for our predictive models. Examples include:
Our models integrate a wide array of features:
We utilize XGBoost for binary classification to predict whether a stock's price will increase or decrease over the next X days. XGBoost is selected for its ability to handle non-linear relationships, robustness to outliers, and high performance on tabular data.
For predicting exact returns X days into the future, we employ the Seasonal Autoregressive Integrated Moving Average with Exogenous Regressors (SARIMAX) model.
We use cross-validation to assess model performance on unseen data, reducing the risk of overfitting.
For classification models, we evaluate:
For regression models, we assess:
By analyzing feature importance, we ensure that our models make decisions based on the most meaningful predictors, which helps maintain transparency in how forecasts are generated.
We meticulously separate training and testing datasets in our time series analyses to ensure that future data does not influence model training. This step is critical for producing reliable and unbiased predictions.
We are committed to transparency. Below, you’ll find common questions users ask about our process, along with detailed answers. If you have additional questions, feel free to reach out.
While our models are built using robust statistical methods and comprehensive datasets, they are intended as predictive tools rather than guarantees of future performance. We encourage users to consult with financial advisors and consider multiple sources of information before making any investment decisions.