I am using the piwik/misc/log-analytics/import_logs.py script to import urchin logs into a piwik domain. It works okay, apart from a crawler that isn’t picked up.
I know I could exclude the IP Address, but is there a way I can exclude the crawler.
The crawler is:
LWP::Simple/5.827 libwww-perl/5.834
Name: libwww-perl 5.834
Family: libwww-perl
Type: Library
This is the log file
143.132.224.66 website - [23/Oct/2012:01:40:50 +0100] “GET /info/page.shtml HTTP/1.1” 200 12915 “-” “LWP::Simple/5.827 libwww-perl/5.834” “-”