Matomo version: 5.0.3
MySQL version: 10.5.23-MariaDB-0+deb11u1
PHP version: 8.3.6
Cronjob: @hourly
(all fine).
I had considered, that the database size are more increased as expected. Normaly monthly 30 MB increasing. Last month 130 MB increased.
So, yet i had checked the database.
I have a mysqldump from 2024-04-01 and from 2024-05-01.
The archived data since 11/2011 are strong increased in the mysqldump from 2024-05-01 compared to the mysqldump from 2024-04-01.
Database tables: archive_blob_[year]_[month]
mysqldump mysqldump
2024-04-01 Rows Size 2024-05-01 Rows Size Difference Rows Size
2021_11 32960 11 2021_11 33468 18 2021_11 508 7
2021_12 27599 9 2021_12 28101 13 2021_12 502 4
2022_01 31278 13 2022_01 31804 19 2022_01 526 6
2022_02 25667 10 2022_02 26135 15 2022_02 468 5
2022_03 29756 10 2022_03 30266 16 2022_03 510 6
2022_04 26783 9 2022_04 27277 15 2022_04 494 6
2022_05 36019 11 2022_05 38690 17 2022_05 2671 6
2022_06 31780 10 2022_06 34526 17 2022_06 2746 7
2022_07 31573 10 2022_07 34426 17 2022_07 2853 7
2022_08 32442 12 2022_08 35474 18 2022_08 3032 6
2022_09 28875 9 2022_09 31646 14 2022_09 2771 5
2022_10 27343 9 2022_10 30220 14 2022_10 2877 5
2022_11 26116 8 2022_11 28891 12 2022_11 2775 4
2022_12 26878 8 2022_12 29759 12 2022_12 2881 4
2023_01 33821 14 2023_01 36319 19 2023_01 2498 5
2023_02 29050 9 2023_02 31267 14 2023_02 2217 5
2023_03 33571 11 2023_03 36288 17 2023_03 2717 6
2023_04 31652 10 2023_04 34247 17 2023_04 2595 7
2023_05 33864 12 2023_05 36330 18 2023_05 2466 6
2023_06 33141 11 2023_06 35415 17 2023_06 2274 6
2023_07 39598 12 2023_07 42171 18 2023_07 2573 6
2023_08 33237 11 2023_08 35462 15 2023_08 2225 4
2023_09 31637 10 2023_09 33901 14 2023_09 2264 4
2023_10 36138 12 2023_10 38570 16 2023_10 2432 4
2023_11 34245 12 2023_11 36647 16 2023_11 2402 4
2023_12 38267 12 2023_12 40775 17 2023_12 2508 5
2024_01 42349 16 2024_01 44945 21 2024_01 2596 5
2024_02 31791 12 2024_02 34450 17 2024_02 2659 5
2024_03 31323 12 2024_03 35051 18 2024_03 3728 6
In the next step, i had checked an archive_*
date range database table. The 2024-05-01mysqldump has more *_chunk_*
rows compared to the 2024-04-01 mysqldump.
Example:
mysqldump 2024-04-01
2641 Actions_actions 2 2021-11-01 2021-11-30 3 2023-04-04 14:37:49 [BLOB - 15.2 KiB]
2641 Actions_actions_url 2 2021-11-01 2021-11-30 3 2023-04-04 14:37:49 [BLOB - 687 B]
2641 Actions_actions_url_chunk_0_99 2 2021-11-01 2021-11-30 3 2023-04-04 14:37:49 [BLOB - 16.3 KiB]
2641 Actions_downloads 2 2021-11-01 2021-11-30 3 2023-04-04 14:37:49 [BLOB - 14 B]
2642 Actions_actions 2 2021-11-01 2021-11-07 2 2023-04-04 14:37:45 [BLOB - 8.9 KiB]
2642 Actions_actions_url 2 2021-11-01 2021-11-07 2 2023-04-04 14:37:45 [BLOB - 501 B]
2642 Actions_actions_url_chunk_0_99 2 2021-11-01 2021-11-07 2 2023-04-04 14:37:45 [BLOB - 7.2 KiB]
2642 Actions_downloads 2 2021-11-01 2021-11-07 2 2023-04-04 14:37:45 [BLOB - 14 B]
mysqldump 2024-05-01
10398 Actions_actions 2 2021-11-01 2021-11-30 3 2024-04-16 10:15:21 [BLOB - 211.8 KiB]
10398 Actions_actions_url 2 2021-11-01 2021-11-30 3 2024-04-16 10:15:21 [BLOB - 859 B]
10398 Actions_actions_url_chunk_0_99 2 2021-11-01 2021-11-30 3 2024-04-16 10:15:21 [BLOB - 81.4 KiB]
10398 Actions_actions_url_chunk_100_199 2 2021-11-01 2021-11-30 3 2024-04-16 10:15:21 [BLOB - 64.9 KiB]
10398 Actions_actions_url_chunk_200_299 2 2021-11-01 2021-11-30 3 2024-04-16 10:15:21 [BLOB - 52.3 KiB]
10398 Actions_actions_url_chunk_300_399 2 2021-11-01 2021-11-30 3 2024-04-16 10:15:21 [BLOB - 22.2 KiB]
10398 Actions_downloads 2 2021-11-01 2021-11-30 3 2024-04-16 10:15:21 [BLOB - 14 B]
10399 Actions_actions 2 2021-11-01 2021-11-07 2 2024-04-16 10:15:15 [BLOB - 83.6 KiB]
10399 Actions_actions_url 2 2021-11-01 2021-11-07 2 2024-04-16 10:15:15 [BLOB - 774 B]
10399 Actions_actions_url_chunk_0_99 2 2021-11-01 2021-11-07 2 2024-04-16 10:15:15 [BLOB - 37.9 KiB]
10399 Actions_actions_url_chunk_100_199 2 2021-11-01 2021-11-07 2 2024-04-16 10:15:15 [BLOB - 41.0 KiB]
10399 Actions_actions_url_chunk_200_299 2 2021-11-01 2021-11-07 2 2024-04-16 10:15:15 [BLOB - 7.8 KiB]
10399 Actions_downloads 2 2021-11-01 2021-11-07 2 2024-04-16 10:15:15 [BLOB - 14 B]
In the next step, i had checked this archive_*
data binary Blobs with PHP.
Example:
$blob_content = file_get_contents(__DIR__ . "/20240501-Actions_actions_url_chunk_300_399.bin");
$blob_content = unserialize(gzuncompress ($blob_content));
var_dump($blob_content);
/*
php blob.php > 20240501-Actions_actions_url_chunk_300_399.txt
*/
The blobs from the last mysqldump (20240501) have URLs inside, that not included in the older mysqldump (2024-04-01).
The question is: How does additional tracking data get into old archived data?