I’ve installed and have Piwik up and running on my Linux server. I’ve imported the log files per the documentation. I am confused on how Piwik will now track users in Real-time. There doesn’t seem to be mention of this in the docs.
The docs mention “archiving” and setting a cron to do that … but what does “archiving” do? Does it pull in the new data? How does Piwik get new data and keep the reporting up-to-date?
Thanks for any answers or point me to the documentation!
I understand it won’t be perfectly real-time, but how would I import the logs daily without producing duplicates?
What does archiving do? The docs suggest doing a cron hourly … but if nothing is changing (without importing logs at least hourly) what is the point of archiving?
Archiving builds the reports out of the tracked raw data. This doesn’t make sense for log import on an hourly basis.
In order to not get duplicates you need something like automatic log rotation. Every time after the log was rotated, you can let the log importer run. That would ensure each entry is only processed once