###### Technical Articles

# Statistical Forecasting in IBP- Manage Forecast Models.

**Introduction:**

While the overall process of statistical forecasting is not that complicated (unless you get into advanced statistical models), there are some minor details in the “**Manage Forecast Models**” app which can be confusing. Here I am sharing a simple example to demonstrate the key technical features of the app for ease of understanding.

__Example:__

For this example, one **Product-Location-Customer **combination has been considered. While this might not be the most ideal level for forecasting in majority of cases, some businesses might still like to forecast at this detailed granularity. The level of forecasting does not matter in context of this blog.

Refer Fig. 01. “**Actuals Qty.”** is the key figure that stores sales history data. Just by looking at the data, you can notice a few things- values are missing for **“MAY 2022”** & **“AUG 2022”**. You may also notice that the value “*1000*” in “**MAR 2022**” is much higher compared to rest of the data set and is most likely an ‘**outlier’**. We will validate this assumption in the later sections.

**“Manage Forecast Models”** App has multiple tabs- **GENERAL, PREPROCESSING STEPS, FORECASTING STEPS, POSTPROCESSING STEPS. **Let us look at the individual settings sequentially.

**GENERAL:** The General Tab settings are shown in Fig. 02.

**PREPROCESSING STEPS:**

Next is the “PREPROCESSING STEPS” tab. This is where we add settings to correct sales history data for ** “Missing values”** &

**Multiple preprocessing steps can be added one after another in this tab. It is important to judge the sequence in which you want to run preprocessing steps because wrong sequence might lead to undesirable results. For example, let us say you want to**

*“Outliers.”***“Substitute missing values”**in the historical data using

*“Mean”(**Fig. 03).*

So, the **“average(mean)”** of the entire historical data set will be calculated, and the missing values are substituted by that “average.” While calculating the average, ‘*1000*’ which is most probably an outlier, also gets included in the calculation which might not be desirable.

**Average**= Sum of data points/No. of data points= 1595/10= 159.5 (note that the ‘null values’ are not included as valid data points while calculating ‘mean’. So, the denominator in this case is ‘10’ and not ‘12’). See excel output below (Fig. 04)

* *The example shown above was just to highlight the impact of a probable outlier on the calculation of substituting missing values. However, to proceed further, we will replace missing values by a *‘Given Value’**, = 80 (Settings shown in Fig. 05).*

After running statistical forecasting operator, the given value of ‘80’ gets substituted in place of missing values (Fig. 06).

After ** “Missing Values”** are substituted, the next step is to check for

**and to correct them (note that there is no SAP recommended set of steps for preprocessing tasks. It depends on individual business needs. In some cases, users might want to run all combination of sequences of preprocessing tasks before deciding on the final approach).**

*“Outliers”*In this case, we are going ahead with ** “Interquartile Range test” for **“

**See settings below (Fig. 07).**

*Outlier Detection.”*As *“Substitute Missing Values”* is the predecessor step for outlier detection and the Input key figure is same in both (Actuals Qty.), the output of that step automatically becomes the input for outlier detection step.

Fig. 08 shows how IQR(Inter Quartile Range), Lower & Upper bounds are calculated. The only **“Actuals Qty.” **value which is outside of the boundaries (<3.5 or >139.5) is ‘*1000’* and hence it is detected as an outlier.

Since, the outlier correction method chosen by us is ** “Correction with tolerance excluding outliers,”** tolerance(bounds) are now recalculated for the dataset excluding outlier(s).

After running statistical forecasting, the upper bound value of ‘152’ (which is closer to 1000 than the lower bound value of -32) substitutes the value of ‘1000’.

**FORECASTING STEPS:**

This is where we define input & output key figures for forecasting and choose the forecasting algorithm.

It is important to note that even within the same forecasting model, the ** preprocessing steps** might not always have direct connection to the

**. It is**

*forecasting step***for both these steps that can establish that connection.**

__only the input key figures__For example, in Fig. 11, the main input for forecasting step is ** “Actuals Qty. Adj.”** Now, this key figure has some manually adjusted values as shown in Fig. 12.

In the preprocessing steps for **“substituting missing values”** and **“detecting & correcting outliers”**, **“Actuals Qty.”** is taken as the input key figure while for **forecasting step**, a different key figure is now considered as input. So, forecasting output will have nothing to do with either **“Actuals Qty.”** key figure values or the output of the preprocessor steps.

Based on the settings in Fig. 11. Look at the Statistical Forecasting output(Fig. 13)using the **simple average method**. The forecasting output is a simple average of **“Actuals Qty. Adj.”** key figure:

You see that everything that we did as part of the Preprocessing step has not been considered as in input to Forecasting since a different key figure was chosen as ** main input for forecasting**.

Now, let us consider the same key figure **“Actuals Qty.”** as the input to forecasting step as well (Fig. 14).

The actual input to statistical forecasting now is the final value of **“Actuals Qty.”** (corrected for both missing values & outliers). You can validate that by comparing the key figures ** “Stat Output for Outlier Correction” **&

**Both have the same values.**

*“Target Key Figure for Calculated Forecast Input.”***POSTPROCESSING STEPS:**

In this step, we choose the error measure for comparison between Sales History & Ex-post forecast. In this example, MAPE has been chosen as the error measure (Fig. 16).

The standard MAPE key figure doesn’t have a time dimension associated with it. So, a custom key figure has to be created to make it time dependent (so it can be seen in excel planning view). There are several ways of doing it. For this example, I have used the following calculation (Fig. 16). MAPE results can be seen in Fig. 15 & Fig. 17.

I hope the example shared above simplifies the understanding around “Manage Forecast Models” app. I have tried to cover how the different tabs within the app link to each other and how output of one, influences other. You can read other posts relevant to IBP Demand at these links:

SAP Integrated Business Planning for demand | SAP | SAP Blogs

All Questions in SAP Integrated Business Planning for demand | SAP Community

Please feel free to share your feedback/thoughts.

Very informative. Thanks for sharing kunal roy