I’ve reviewed all the reading I can find on the subject of importing Apache logs into Piwik, but there’s a few basic things I’m still unclear on.
Re-loading logs: To automate the loading of logs, is it safe to re-import the same log multiple times? For example, if I want to do a daily import of access.log but only do logrotate weekly, is Piwik smart enough to skip records already imported? Or will I end up with duplicate records in the PIwik logs tables and inflated/incorrect stats? Of course, I can test this out, but someone must know the actual logic used during the import.
Minimal/Verbose Piwik website definitions: I’m thinking I might want more than one log-mode piwik website so that I can run one with default/minimal settings, and one with more complete logs (using --enable-bots --enable-static --enable-http-errors --enable-http-redirects). But is this really necessary? It would be more efficient, seems to me, if I could just load the full verbose logs and then define various filters within Piwik to show the detail level wanted.
Excluding paths after load: Somewhat related to 2), if after importing I see various paths that I would rather hide in the visitor actions lists, is there a way to filter those out after the fact? Or do I have to purge everything for that piwik website and reload from scratch using --exclude-path / --exclude-path-from?