Mastering the Future: Forecasting Models for Smarter Business Planning
At its core, Integrated Business Planning involves the creation of a sound forecast that turns into a constrained demand plan. This is a product of analysis, near-term goals and expert opinion and also informs the long-term strategic vision. Here we provide the different ways in which forecasts are produced across industries. But before exploring specific forecasting models, it’s crucial to recognize that the real strength of demand forecasting comes from how multiple models are integrated into a well-designed architecture. Rather than relying solely on one model, businesses can gain more accurate and actionable insights by integrating multiple modeling blocks into an ensemble framework. For instance, a Marketing Mix model can provide the right pricing elasticity that is then fed into a consumption model that can forecast customer demand. This is then augmented by estimates by experts and can then feed into a sell-in model to optimize supply chain and inventory decisions. This layered approach allows each model to focus on different aspects of the forecasting process, ensuring that the final output accounts for both internal patterns and external influences. Ultimately, it is the architecture and thoughtful combination of models that drive better forecasts, addressing the complexities of real-world business challenges more effectively than any single out-of-the-box model could.
Statistical Forecasting Methods
Statistical methods rely on historical data and mathematical models to project future demand. These methods are generally easy to implement and are widely used across industries where the demand patterns are relatively stable and predictable with clear trends and seasonality, these being retail (forecasting short-term store replenishments or regional/store-level demand of staple goods), manufacturing and energy.
Time Series Analysis: examine historical data to identify trends, cycles, and seasonality. Use Case: Forecasting sales for stable, consistent products where past data patterns are predictive of future demand. Some common techniques include:
Moving Averages: calculates the average of a subset of data points within a sliding window over time. The idea is to reduce short-term fluctuations and highlight longer-term trends
Exponential Smoothing: weighs recent data more heavily than older data, making it useful for products with stable but slightly fluctuating demand.
Holt-Winters Exponential Smoothing: form of exponential smoothing used to forecast time series data that exhibits both trend and seasonality. Works well for time series with regular patterns and seasonality such as Seasonal sales forecasting for retail or e-commerce, where demand fluctuates based on holidays or recurring events.
ARIMA (AutoRegressive Integrated Moving Average): more advanced statistical forecasting method that models the relationship between current and past values in a time series. It combines three components: AutoRegressive (AR), Integrated (I), and Moving Average (MA) to model both the trend and the pattern in the data.
Time Series Analysis with Exogenous Features: adds exogenous variables (additional external features) to the time-series data to improve forecasting accuracy. These models are useful when external factors like promotions, weather, or economic conditions influence the time series.
ARIMAX (ARIMA with Exogenous Variables): Adds external, explanatory variables (e.g., marketing spend) to the ARIMA model.
SARIMAX (Seasonal ARIMAX): Similar to ARIMAX, but with seasonal adjustments. It incorporates exogenous data like economic indicators to improve seasonal forecasting.
VAR (Vector Autoregression): A multivariate model that forecasts multiple interdependent time series simultaneously, considering how they influence each other. Useful for datasets where several variables interact, like different product categories
Machine Learning (ML)-Based Forecasting Methods
ML-based methods are increasingly popular for demand forecasting because they can analyze large datasets and detect complex patterns that statistical models may miss. This is especially useful when demand is influenced by multiple factors, such as promotions, product distribution, pricing, economic conditions, or consumer behavior. However, the quality of any ML model depends heavily on the quality of the input data. In demand forecasting, having clean, structured data is a critical requirement. While deep learning techniques can incorporate unstructured data to improve forecast accuracy, such data typically serves as a supplement rather than the primary driver of model improvement.
As a result, much of the effort goes into identifying and building high-quality data inputs. Additionally, using ensemble models often comes at the cost of explainability. Techniques like SHAP can help attribute the output to the correct drivers, but as ML models become more complex, it becomes increasingly difficult to provide clear explanations for their predictions.
Here’s a breakdown of the different ways to produce ML forecasts:
Bayesian Forecasting Models: These models use Bayesian inference to update the probability of an outcome as more information becomes available. They are ideal for time series with uncertainty or irregular patterns (new product introductions or skus with little sales history)
Prophet: Developed by Facebook, Prophet is a robust model for time series forecasting with daily, weekly, or yearly seasonality. It handles missing data and outliers well.Use Case: Retail sales forecasting with seasonality, holiday effects, and promotions.
Orbit: Another Bayesian forecasting model similar to Prophet. It’s designed for scalable Bayesian time series forecasting and is particularly suitable for datasets with irregular or sparse patterns.
Bayesian Ridge Regression: An ML extension of linear regression that includes Bayesian inference for time series prediction with uncertainty estimates
Gradient Tree Boosting Models: This category includes methods that apply decision trees in an iterative boosting framework. These models are powerful for predicting demand when multiple variables interact (e.g., promotions, prices, seasons). Use Case: Forecasting product sales based on many structured inputs like product type, historical demand, location and pricing.
Random Forests: An ensemble learning model that builds multiple decision trees and averages their predictions for more accurate results.
XGBoost (Extreme Gradient Boosting): A highly efficient, gradient-boosting algorithm that builds trees iteratively, improving each step.
LightGBM: Another fast, efficient gradient boosting algorithm similar to XGBoost, used for time series forecasting with engineered features. It is known for handling large datasets with ease.
Neural Networks: Flexible models that can handle both structured and unstructured data, learning complex patterns from large datasets. These are not widely used for time-series prediction but can be used as part of ensemble modle to augment prediction especially from unstructured data. Use Case: Forecasting sales or demand based on historical trends, customer behaviors and time-dependent data (e.g., weekly sales fluctuations).
Feedforward Neural Networks: A basic type of neural network that predicts outcomes by learning the relationships between input variables.
LSTM (Long Short-Term Memory Networks): A type of recurrent neural network (RNN) designed to capture sequential patterns in time-series data.
Collaborative and Judgmental Forecasting
These strategies integrate human expertise, market knowledge, and collaboration across stakeholders and can be used standalone or as augmentation to statistical or ML methodologies discussed above. This is how industries like manufacturing have traditionally set up where they align their production schedules with anticipated demand by bringing together data from various departments (e.g., sales, marketing, and operations) to create a consolidated demand forecast.
Delphi Method: This expert-driven forecasting method is often used in healthcare, pharmaceuticals, and industries with long product development cycles. Experts provide independent demand estimates, which are then aggregated for a consensus forecast.
Collaborative Planning, Forecasting, and Replenishment (CPFR): This method is prevalent in retail and supply chain industries, where companies collaborate with suppliers and retailers to share data and co-create demand forecasts. This helps ensure products are always available where and when they are needed.

