When we are configuring the historical data, the settings available for that are the following:
Regularly delete old raw data
Delete old aggregated report data
Schedule old data deletion
In this faq, How do I delete historical Piwik data? (purge old logs and/or old processed reports) FAQ - Analytics Platform - Matomo, there are only the two first options, but we don’t know how to use “Schedule old data deletion” and what is the purpose about this setting.
We have set up these configurations to test how to use in a production enviroment :
Regularly delete old raw data: 2 days (we will change to six months in production)
Delete old aggregated report data: 12 months
Schedule old data deletion: week.
We also have set up the script for every one hour to auto-archiving our reports
So, What is the porpuse about Shedule old data deletion? Because if the shedule old data deletion is before one of the other configuration, will be they affected?
Moreover, in our tests, we have seen that there is no difference for the user while reading the reports: they don’t know if there are an old one that have been processed or a one directly from logs.
But we have several doubts about the reports information. Because we have to be sure to give our client the differences between the reports directly from logs or proccesed ones. Reading the faq, these are the main differences:
- Transitions report : when you are viewing the transitions report, this report is directly from logs. If you have not logs at all, you have not reports even if you have processed. But there is a bug with the counter of the views.
- Unique visitors : it is supposed to be the same as with transitions report: if there are not logs, there aren’t any report. But we detected in some reports that there are still information about unique visitors, for example, Visits over time:
And for many other reports like Device type:
Why is still visible the unique visitors? It is ok for us to make it visible if they are processing with the script, but some information in the faq make it clear that they must have logs to be visible.