> For the complete documentation index, see [llms.txt](https://learn.moselle.io/llms.txt). Markdown versions of documentation pages are available by appending `.md` to page URLs; this page is available as [Markdown](https://learn.moselle.io/faq/forecasting/statistical-vs-ml-forecasting.md).

# What's the Difference Between Statistical and ML-Based Forecasting?

{% hint style="info" %}
**Quick Answer:** Statistical forecasting uses established mathematical models (like exponential smoothing or ARIMA) that decompose historical patterns into trend, seasonality, and noise. ML-based forecasting uses algorithms trained on large datasets to detect complex, non-linear patterns. Statistical methods are more interpretable; ML methods can outperform them at scale when data is abundant.
{% endhint %}

**Statistical forecasting** applies classical mathematical models to time series data. These models make explicit assumptions about how demand behaves — for example, that demand follows a trend that changes gradually, or that seasonal patterns repeat consistently year-over-year.

**Machine learning (ML) forecasting** uses algorithms that learn patterns from data without being explicitly programmed with those patterns. ML models can discover relationships across many variables simultaneously and do not require the same structural assumptions as statistical models.

Both approaches are used in production demand planning. The right choice — or combination — depends on your data volume, SKU complexity, and the trade-off between accuracy and interpretability.

## Key Differences

| Dimension                  | Statistical Forecasting                                     | ML-Based Forecasting                                     |
| -------------------------- | ----------------------------------------------------------- | -------------------------------------------------------- |
| **Model type**             | Explicit mathematical formulas                              | Learned from data                                        |
| **Interpretability**       | High — planners can see why the model predicts what it does | Low — "black box" behavior is harder to explain          |
| **Data requirement**       | Works with limited history (6–12 months per SKU)            | Performs better with large volumes of data               |
| **Training time**          | Fast — models fit quickly                                   | Slower — training can take hours on large datasets       |
| **Handling of complexity** | Assumes linear or structured relationships                  | Can model complex, non-linear patterns                   |
| **Best at**                | Stable, seasonal products with consistent history           | High SKU count, cross-product patterns, external signals |
| **Weakness**               | Misses complex interactions; rigid assumptions              | Opaque; can overfit; needs lots of data                  |

## Common Statistical Models

**Exponential Smoothing (ETS)** : Assigns exponentially decreasing weight to older observations. Older data influences the forecast less than recent data. Good for products with changing demand levels.

**ARIMA (AutoRegressive Integrated Moving Average)** : Models demand as a function of its own past values and past forecast errors. Powerful for stationary time series with identifiable autocorrelation patterns.

**Seasonal Decomposition (STL)** : Separates demand into trend, seasonal, and remainder components. The seasonal component is used to adjust forecasts by period.

**Croston's Method** : Designed specifically for intermittent or slow-moving demand, where many periods have zero sales.

## Common ML-Based Approaches

**Gradient Boosted Trees (e.g., XGBoost, LightGBM)** : Trains on tabular features — lag values, calendar features, product attributes — to predict demand. Strong performance at scale; widely used in demand planning competitions.

**Neural Networks / Deep Learning (e.g., LSTM, Transformer models)** : Can model sequential dependencies and learn from very long historical windows. More useful when interactions between products, channels, or external signals drive demand.

**Ensemble Methods** : Combine multiple models (statistical + ML) to produce a blended forecast. In practice, ensembles often outperform any single method because they balance each model's strengths.

## Which Approach Is Better?

Neither approach is universally superior. The evidence from forecasting research and practice:

* For **products with limited history**, statistical models often outperform ML because ML models overfit on sparse data
* For **large, diverse SKU portfolios**, ML and ensemble methods tend to outperform because they can detect cross-product patterns
* For **interpretability requirements** — when planners need to understand and trust the forecast — statistical models provide clearer explanations
* For **high-stakes planning decisions**, human-in-the-loop review matters more than the model choice

{% hint style="info" %}
The biggest driver of forecast accuracy in practice is **data quality and model inputs**, not which specific algorithm is used. Clean data, appropriate feature engineering, and correct handling of outliers and stockouts improve any model.
{% endhint %}

## How Moselle Approaches Forecasting

### Top-Down vs. Bottom-Up — and Where Each Model Type Lives

In Moselle, the choice between statistical and ML forecasting is connected to how you build your forecast:

**Top-down forecasting** starts with a total revenue or unit target set by your team and breaks it down across SKUs. Because the starting point is a human-defined number rather than a raw time series, top-down forecasts don't use statistical or ML models in the same way — they distribute a target using historical mix and proportional logic.

**Bottom-up forecasting** is where statistical and ML models do their work. Moselle analyzes your historical sales data at the SKU level and applies a model to generate a forward-looking demand signal from the ground up. This is the approach that benefits from model selection — and where the choice between statistical, probabilistic, ensemble, and deep learning methods matters.

### Moselle Recommends — But You Choose

When you build a bottom-up forecast, Moselle evaluates your SKU's data characteristics — history length, demand variability, seasonality, and sales patterns — and recommends the model most likely to perform well for your business. You don't need to become a forecasting expert to get a good starting point.

That said, all five models are available for you to explore:

| Model         | Type                     | Best For                                                               |
| ------------- | ------------------------ | ---------------------------------------------------------------------- |
| **Rome**      | Statistical (ARIMA)      | Consistent demand with predictable seasonal patterns                   |
| **Athens**    | Statistical (ETS)        | Stable, slow-growth products with 12+ months of history                |
| **Venice**    | Probabilistic (Prophet)  | Holiday and event-driven demand; handles data gaps                     |
| **Hong Kong** | Ensemble (3-model blend) | Fast-growing brands; 2+ years of history; occasional zero-sale periods |
| **London**    | Deep Learning (Chronos)  | Short history (6–9 months); complex or retail-heavy businesses         |

The best indicator for choosing between models — or validating the recommendation — is your **MAPE score** for each SKU. Moselle tracks this in the Forecast Performance Report so you can compare model accuracy over time.

{% content-ref url="<https://github.com/Moselle-io/moselle-docs/blob/master/faq/planning-and-execution/forecasting/how-to-create-a-forecast/generate-a-bottom-up-forecast.md>" %}
<https://github.com/Moselle-io/moselle-docs/blob/master/faq/planning-and-execution/forecasting/how-to-create-a-forecast/generate-a-bottom-up-forecast.md>
{% endcontent-ref %}

{% content-ref url="<https://github.com/Moselle-io/moselle-docs/blob/master/faq/analytics/reporting/moselle-reports/performance-reports/demand-forecast-performance.md>" %}
<https://github.com/Moselle-io/moselle-docs/blob/master/faq/analytics/reporting/moselle-reports/performance-reports/demand-forecast-performance.md>
{% endcontent-ref %}

## Frequently Asked Questions

### Should I ask a vendor whether they use statistical or ML forecasting?

**Answer:** Yes — but more importantly, ask how they handle forecast transparency, how planners can review and override outputs, and how accuracy is tracked over time. The model type matters less than the workflow built around it.

### Can statistical and ML models be used together?

**Answer:** Yes, and this is common in production systems. Ensemble methods blend multiple model outputs — often weighting statistical models more heavily for SKUs with short history and ML models more heavily for high-volume SKUs.

### Does ML forecasting require a data scientist to maintain?

**Answer:** Not in modern SaaS tools. ML models in production planning software are maintained by the vendor and updated automatically. End users interact with the forecast output, not the model configuration.


---

# Agent Instructions
This documentation is published with GitBook. GitBook is the documentation platform designed so that both humans and AI agents can read, navigate, and reason over technical content effectively. Learn more at gitbook.com.

## Querying This Documentation
If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter, and the optional `goal` query parameter:

```
GET https://learn.moselle.io/faq/forecasting/statistical-vs-ml-forecasting.md?ask=<question>&goal=<endgoal>
```

`ask` is the immediate question: it should be specific, self-contained, and written in natural language.
`goal` is optional and describes the broader end goal you are ultimately trying to accomplish on behalf of the user. GitBook uses it to tailor the answer towards what is most useful for that goal.

The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.