We have set up these configurations to test how to use in a production enviroment :
Regularly delete old raw data: 2 days (we will change to six months in production)
Delete old aggregated report data: 12 months
Schedule old data deletion: week.
We also have set up the script for every one hour to auto-archiving our reports
So, What is the porpuse about Shedule old data deletion? Because if the shedule old data deletion is before one of the other configuration, will be they affected?
Moreover, in our tests, we have seen that there is no difference for the user while reading the reports: they don’t know if there are an old one that have been processed or a one directly from logs.
But we have several doubts about the reports information. Because we have to be sure to give our client the differences between the reports directly from logs or proccesed ones. Reading the faq, these are the main differences:
Transitions report : when you are viewing the transitions report, this report is directly from logs. If you have not logs at all, you have not reports even if you have processed. But there is a bug with the counter of the views.
Unique visitors : it is supposed to be the same as with transitions report: if there are not logs, there aren’t any report. But we detected in some reports that there are still information about unique visitors, for example, Visits over time:
Why is still visible the unique visitors? It is ok for us to make it visible if they are processing with the script, but some information in the faq make it clear that they must have logs to be visible.
I guess that this field schedule the data deletion: every two days the logs will be deleted, after running the script to auto-archiving our reports.
So, if “Schedule old data deletion” set the data deletion for every month, there will be not data older than two days because the setting above stablish the data deletion every two days. So for me, it has not sense at all.
Moreover, we have to confirmed which is the main difference between the reports directly from logs or proccesed ones:
Transitions report: bug about counter of the views.
Unique visitors: are there still visible after processing the script?
No, if you set Delete old data every to month and Delete logs older than to 2 than Matomo will run a job once a month that will go through all logs and delete those that are older than 2 days at this time.
So the Delete logs older than setting allows you to set what logs Matomo considers to be old (and therefore should be deleted, but it does not influence how often this deletion should happen.